Visual Blocks: Visual Prototyping of AI Pipelines

The project Rapsai, a.k.a. Visual Blocks for ML, aims to make the prototyping of machine learning (ML) based multimedia applications more efficient and accessible. In recent years, there has been a proliferation of multimedia applications that leverage machine learning (ML) for interactive experiences. Prototyping ML-based applications is, however, still challenging, given complex workflows that are not ideal for design and experimentation. To better understand these challenges, we conducted a formative study with seven ML practitioners to gather insights about common ML evaluation workflows. \n\nThe study helped us derive six design goals, which informed Rapsai, a visual programming platform for rapid and iterative development of end-to-end ML-based multimedia applications. Rapsai features a node-graph editor to facilitate interactive characterization and visualization of ML model performance. Rapsai streamlines end-to-end prototyping with interactive data augmentation and model comparison capabilities in its no-coding environment. Our evaluation of Rapsai in four real-world case studies (N=15) suggests that practitioners can accelerate their workflow, make more informed decisions, analyze strengths and weaknesses, and holistically evaluate model behavior with real-world input. Try our live demo at Visual Blocks for ML and [let us know if you find it useful in your classes or project!

Publications

InstructPipe: Building Visual Programming Pipelines in Visual Blocks with Human Instructions Using LLMs🎖️ Honorable Mentions Award

Zhongyi Zhou, Jing Jin, Vrushank Phadnis, Xiuxiu Yuan, Jun Jiang, Xun Qian, Jingtao Zhou, Yiyi Huang, Zheng Xu, Yinda Zhang, Kristen Wright, Jason Mayes, Mark Sherwood, Johnny Lee, Alex Olwal, David Kim, Ram Iyengar, Na Li, and Ruofei Du

Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI), 2025.

Keywords: Visual Programming; Large Language Models; Visual Prototyping; Nodegraph Editor; Graph Compiler; Low-code Development; Deep Neural Networks; Deep Learning; Visual Analytics; Interactive Perception

pdf, doi | project, video, demo, news | abstract | cited by, cite

Experiencing InstructPipe: Building Multi-modal AI Pipelines Via Prompting LLMs and Visual Programming

Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems (CHI), 2024.

Keywords: Visual Programming; Large Language Models; Visual Prototyping;Node-graph Editor; Graph Compiler; Low-code Development; DeepNeural Networks; Deep Learning; Visual Analytics

pdf, doi | project, video | abstract | cited by, cite

teaser image of Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications Through Visual Programming

Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications Through Visual Programming🎖️ Honorable Mentions Award, 170K+ views

Ruofei Du, Na Li, Jing Jin, Michelle Carney, Scott Miles, Maria Kleiner, Xiuxiu Yuan, Yinda Zhang, Anuva Kulkarni, Xingyu Bruce Liu, Ahmed Sabie, Sergio Orts-Escolano, Abhishek Kar, Ping Yu, Ram Iyengar, Adarsh Kowdle, and Alex Olwal

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI), 2023.

Keywords: visual programming, node-graph editor, deep neural networks, data augmentation, deep learning, model comparison, visual analytics, interactive perception

pdf, lowres, doi | website, project, video, slides, code, demo, news | abstract | cited by, cite

Experiencing Visual Blocks for ML: Visual Prototyping of AI Pipelines

Ruofei Du, Na Li, Jing Jin, Michelle Carney, Xiuxiu Yuan, Kristen Wright, Mark Sherwood, Jason Mayes, Lin Chen, Jun Jiang, Jingtao Zhou, Zhongyi Zhou, Ping Yu, Adarsh Kowdle, Ram Iyengar, and Alex Olwal

Adjunct Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology (UIST), 2023.

Keywords: visual programming, large language models, visual prototyping, multi-modal models, node-graph editor, deep neural networks, data augmentation, deep learning, visual analytics

pdf, doi | project, slides, code | abstract | cited by, cite

Experiencing Rapid Prototyping of Machine Learning Based Multimedia Applications in Rapsai

Ruofei Du, Na Li, Jing Jin, Michelle Carney, Xiuxiu Yuan, Ping Yu, Ram Iyengar, Adarsh Kowdle, and Alex Olwal

Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems (CHI EA), 2023.

Keywords: visual programming, node-graph editor, deep neural networks, data augmentation, deep learning, model comparison, visual analytics, interactive perception

pdf, doi | website, project, slides, code, demo, news | abstract | cited by, cite

Videos

InstructPipe: Building Visual Programming Pipelines in Visual Blocks with Human Instructions Using LLMs

youtube | cite

Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications Through Visual Programming

youtube, mp4 | cite

Preview of Rapsai

youtube

Visual Blocks: Ridiculously rapid ML/AI prototyping and deployment to production

youtube

How to create effects with models and shaders using visualblocks

youtube

How to compare models from web using visualblocks

youtube

How to use models and build pipelines in Colab with visualblock

youtube

Talks

Interactive Perception & Graphics for a Universally Accessible XR

Ruofei Du

CVPR 2025 , Nashville, TN, USA.

pdf |

Visual Blocks for ML: Visual Prototyping of AI Pipelines

Ruofei Du

ECE188 , UCLA Remote Lecture.

pdf | gSlides

Networking and System Challenges: Interactive Perception & Graphics for a Universally Accessible XR

Ruofei Du

NSF ImmerCon 2025 , George Mason University, Fairfax, VA.

pdf |

Computational Interaction for a Universally Accessible Metaverse

Ruofei Du

Invited Talk at KAIST by Prof. Sang Ho Yoon , Remote Talk.

pdf |

Computational Interaction for a Universally Accessible Metaverse

Ruofei Du

Invited Talk at University of Minnesota by Prof. Zhu-Tian Chen, Minneapolis, MN, USA.

pdf |

Experiencing InstructPipe: Building Multi-modal AI Pipelines via Prompting LLMs and Visual Programming

Zhongyi Zhou

CHI 2024, Hawaii, USA.

pdf | | cite

Visual Blocks for ML: Visual Prototyping of AI Pipelines

Ruofei Du

CS139: Human-Centered AI @ Stanford , Stanford, Palo Alto.

pdf | gSlides

Interactive Perception & Graphics for a Universally Accessible Metaverse

Ruofei Du

University of Virginia CS Fall 2023 Distinguished Speakers, Charlottesville, Virginia, USA.

pdf | source, gSlides

Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications through Visual Programming

Ruofei Du

CHI 2023, Hamburg, Germany.

pdf | talk (onsite), video, video (short) | gSlides | cite

Interactive Perception & Graphics for a Universally Accessible Metaverse

Ruofei Du

Invited Talk at UCLA by Prof. Yang Zhang , Remote Talk.

pdf | gSlides

Interactive Graphics for a Universally Accessible Metaverse

Ruofei Du

Invited Talk at ECL Seminar Series by Dr. Alaeddin Nassani , Remote Talk.

pdf | gSlides

Interactive Graphics for a Universally Accessible Metaverse

Ruofei Du

Invited Talk at UICC 2023 sponsored by University of Iowa ACM Student Chapter, Iowa City, Iowa.

pdf | gSlides

Interactive Graphics for a Universally Accessible Metaverse

Ruofei Du

Invited Talk at Empathic Computing Lab , Remote Talk.

pdf |

Cited By

Creating Design Resources to Scaffold the Ideation of AI Concepts. Proceedings of the 2023 ACM Designing Interactive Systems Conference.Nur Yildirim, Changhoon Oh, Deniz Sayar, Kayla Br, Supritha Challa, Violet Turri, Nina Crosby Walton, Anna Elise Wong, Jodi Forlizzi, James McCann, and John Zimmerman. source | cite | search

LLMR: Real-time Prompting of Interactive Worlds Using Large Language Models. Proceedings of the CHI Conference on Human Factors in Computing Systems.Fernanda De La Torre, Cathy Fang, Han Huang, Andrzej Banburski-Fahey, Judith Amores Fernandez, and Jaron Lanier. source | cite | search

IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency. arXiv.2308.12871.Saeid Ghafouri, Kamran Razavi, Mehran Salmani, Alireza Sanaee, Tania Lorido-Botran, Lin Wang, Joseph Doyle, and Pooyan Jamshidi. source | cite | search

Branching Preferences: Visualizing Non-linear Topic Progression in Conversational Recommender Systems. Adjunct Proceedings of the 31st ACM Conference on User Modeling, Adaptation and Personalization.Lovis Bero Suchmann, Nicole Krämer, and Jürgen Ziegler. source | cite | search

Jigsaw: Supporting Designers in Prototyping Multimodal Applications by Assembling AI Foundation Models. arXiv.2310.08574.David Lin and Nikolas Martelaro. source | cite | search

Adaptation of Enterprise Modeling Methods for Large Language Models. The Practice of Enterprise Modeling.Balbir S. Barn, Souvik Barat, and Kurt Sandkuhl. source | cite | search

Canvil: Designerly Adaptation for LLM-Powered User Experiences. arXiv.2401.09051.K. J. Kevin Feng, Q. Vera Liao, Ziang Xiao, Jennifer Wortman Vaughan, Amy Zhang, and David W. McDonald. source | cite | search

Spatial-Temporal Transformer Network for Human Mocap Data Recovery. 2023 IEEE International Conference on Image Processing (ICIP).Jijin Zhang, Jingliang Peng, and Na Lv. source | cite | search

Not Just Novelty: A Longitudinal Study on Utility and Customization of an AI Workflow. Designing Interactive Systems Conference.Tao Long, Katy Ilonka Gero, and Lydia Chilton. source | cite | search

Wordflow: Social Prompt Engineering for Large Language Models. arXiv.2401.14447.Zijie J. Wang, Aishwarya Chakravarthy, David Munechika, and Duen Horng Chau. source | cite | search

Vision-Based Multimodal Interfaces: A Survey and Taxonomy for Enhanced Context-Aware System Design. arXiv.2501.13443.Yongquan Hu, Jingyu Tang, Xinya Gong, Zhongyi Zhou, Shuning Zhang, Don Samitha Elvitigala, Florian 'Floyd' Mueller, Wen Hu, and Aaron J. Quigley. source | cite | search

ProInterAR: A Visual Programming Platform for Creating Immersive AR Interactions. Proceedings of the CHI Conference on Human Factors in Computing Systems.Hui Ye, Jiaye Leng, Pengfei Xu, Karan Singh, and Hongbo Fu. source | cite | search

Navigate to Projects

Stay In Touch

{{Email}}
click to reveal
{{Phone}}
click to reveal
Last Updated
by Ruofei Du
(杜若飞)

Publications

InstructPipe: Building Visual Programming Pipelines in Visual Blocks with Human Instructions Using LLMs🎖️ Honorable Mentions Award

Experiencing InstructPipe: Building Multi-modal AI Pipelines Via Prompting LLMs and Visual Programming

Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications Through Visual Programming🎖️ Honorable Mentions Award, 170K+ views

Experiencing Visual Blocks for ML: Visual Prototyping of AI Pipelines

Experiencing Rapid Prototyping of Machine Learning Based Multimedia Applications in Rapsai

Videos

Talks

Interactive Perception & Graphics for a Universally Accessible XR

Visual Blocks for ML: Visual Prototyping of AI Pipelines

Networking and System Challenges: Interactive Perception & Graphics for a Universally Accessible XR

Computational Interaction for a Universally Accessible Metaverse

Computational Interaction for a Universally Accessible Metaverse

Experiencing InstructPipe: Building Multi-modal AI Pipelines via Prompting LLMs and Visual Programming

Visual Blocks for ML: Visual Prototyping of AI Pipelines

Interactive Perception & Graphics for a Universally Accessible Metaverse

Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications through Visual Programming

Interactive Perception & Graphics for a Universally Accessible Metaverse

Interactive Graphics for a Universally Accessible Metaverse

Interactive Graphics for a Universally Accessible Metaverse

Interactive Graphics for a Universally Accessible Metaverse

Cited By

Related Work

Stay In Touch

{{Email}}

{{Phone}}

Last Updated