CS298 Proposal

Integrating ChatGPT with A-Frame for User-Driven 3D Modeling

Ivan Hernandez (ivan.a.hernandez@sjsu.edu)

Advisor: Dr. Chris Pollett

Committee Members: Dr. Genya Ishigaki, Prof. Kevin Smith

Abstract:

The objective of this project is to develop a system that integrates ChatGPT into an A-Frame-based virtual reality (VR) environment, thus enabling users to create, modify, and interact with 3D models based on user-provided natural language descriptions. Currently, VR and modeling applications often rely on traditional input devices and graphical user interfaces, which can be complex and unintuitive for users. By leveraging ChatGPT's natural language understanding capabilities, this system can offer a more engaging and immersive experience for users to guide and modify the 3D modeling process in real-time. Furthermore, we will extend this system’s capabilities by implementing scene serialization to save and load VR scenes, custom A-Frame component generation using JavaScript, and more advanced 3D modeling techniques (bevel, booleans, etc.) by directly accessing the underlying Three.js libraries.

CS297 Results

Created a demo VR environment with A-Frame and directly compared its workflow with that of Unity3D.
Successfully integrated ChatGPT with an A-Frame VR environment for real-time 3D modeling.
Developed an interpreter layer to translate ChatGPT's responses into A-Frame modeling commands.
Implemented voice interactions for easier communication with the system.

Proposed Schedule

Week 1: Jan 30 - Feb 6	Finish CS 298 proposal
Week 2: Feb 6 - Feb 13	Research serialization strategies and begin implementing A-Frame scene serialization
Week 3: Feb 13 - Feb 20	Finish serialization functionality and test different scenarios
Week 4-6: Feb 20 - Mar 12	Implement custom A-Frame component generation with Javascript
Week 7-10: Mar 12 - Apr 9	Implement Three.js integration for more 3D modeling features
Week 11-12: Apr 9 - Apr 23	Conduct comprehensive testing to evaluate the system's effectiveness.
Week 13: Apr 23 - Apr 30	Work on CS 298 Report and begin preparation for Master’s Defense
Week 14: Apr 30 - May 7	Continue working on CS298 Report and Presentation
Week 15: May 7 - May 14	Finish CS298 Report and Presentation

Key Deliverables:

Software
- Implement an A-Frame serialization module for saving and loading A-Frame VR scenes.
- Design a system that creates A-Frame Javascript components with customizable functionality which can be attached to A-Frame objects. Then, implement a parser that ensures that the component is functional and compatible within the VR environment.
- Extend the project’s capabilities by implementing modeling features, such as bevel, extrude, and booleans using the underlying Three.js library that is not natively available in A-Frame.
- Implement a structured test series covering simple to complex modeling scenarios to evaluate the effectiveness and usability of the system.
Report
- CS 298 Report
- CS 298 Presentation

Innovations and Challenges

One of the challenges for this project will be to effectively design a system that can generate A-Frame components that provide accurate behavior and functionality for objects in the scene. The generated Javascript components must use working code that adheres to the newest version of A-Frame standards and correctly interprets the users intentions.
There is little to no research that explores the combination of NLP with VR for 3D modeling. Therefore, as this project requires the integration of various technologies, including VR, NLP, and 3D Graphics, it will be challenging to develop a system that merges these fields effectively while ensuring a seamless and immersive 3D modeling experience.
Integrating the Three.js libraries into the A-Frame project will bring about more 3D modeling capabilities that not only need to be easy to use but also intuitive for the user to incorporate into their modeling process. Therefore, as more complexity is added to the system, it will make it more challenging to maintain a consistent level of accessibility and user-friendly interactions while modeling with voice commands.
Conducting comprehensive testing for this project will also prove to be challenging as it needs to cover a diverse case of modeling scenarios, including simple to complex cases as well as extreme and invalid test cases. A clear and balanced testing methodology needs to be established so that we can properly evaluate the quantitative performance metrics as well as the qualitative user interactions.

References:

[1] Web-based Virtual Reality with A-Frame. S. G. Santos and J. C. S. Cardoso. 2019 14th Iberian Conference on Information Systems and Technologies (CISTI). 2019.

[2] ChatGPT: Fundamentals, Applications and Social Impacts. M. Abdullah, A. Madain and Y. Jararweh. 2022 Ninth International Conference on Social Networks Analysis, Management and Security (SNAMS). 2022.

[3] Consistency Analysis of ChatGPT. Jang, Myeongjun and Thomas Lukasiewicz. ArXiv. 2023.

[4] Research on Voice Interaction Technology in VR Environment. C. Li and B. Tang. 2019 International Conference on Electronic Engineering and Informatics (EEI). 2019.

[5] Collaboration between ChatGPT API and A-Frame. Takashi Yoshinaga. Youtube. 2023.