← Back to VOLUME 15, ISSUE 3, MARCH 2026
This work is licensed under a Creative Commons Attribution 4.0 International License.
3D HUMANOID AI-SMART "AW8R”
Prof. CHANDRESH BHANGE, PRATYUSH.G. JANBHANDHU, SAMIKSHA A. GAJBHIYE, PRAKRUTI R. GAIMUKHE, TANU R. PATALE, RAVISHANKAR V. SHAHU, JAYASHRI M. MARBATE
DOI: 10.17148/IJARCCE.2026.15381
Abstract: The 3D Humanoid Al-Smart “AW8R” is an advanced portable Al-based voice assistant system designed to enhance natural human-machine interaction by integrating speech recognition, emotion detection, and a 3D holographic humanoid avatar. Unlike traditional voice assistants that rely solely on audio responses, the proposed system introduces synchronized facial expressions and lip movements to create a more engaging and realistic user experience.
The system is built around the ESP32-S3 microcontroller with integrated Wi-Fi for cloud connectivity. Audio input is captured using an INMP441 MEMS microphone and transmitted to cloud-based Al services such as Gemini or Open Al for speech-to-text conversion, intent detection, and contextual response generation. The backend also performs emotion analysis and text-to-speech synthesis. The processed response is delivered to the user through a 3W speaker driven by a MAX98357A amplifier, while a 1.3-inch TFT display renders a 3D humanoid avatar. A holographic acrylic pyramid creates a floating 3D visual effect.
The system is powered by a 2400mAh Li-ion battery with integrated charging and boost regulation. Experimental evaluation demonstrates stable operation, synchronized animation, and improved user engagement compared to traditional voice-only systems. The project provides a cost-effective and scalable solution for next-generation Al assistants.
The system is built around the ESP32-S3 microcontroller with integrated Wi-Fi for cloud connectivity. Audio input is captured using an INMP441 MEMS microphone and transmitted to cloud-based Al services such as Gemini or Open Al for speech-to-text conversion, intent detection, and contextual response generation. The backend also performs emotion analysis and text-to-speech synthesis. The processed response is delivered to the user through a 3W speaker driven by a MAX98357A amplifier, while a 1.3-inch TFT display renders a 3D humanoid avatar. A holographic acrylic pyramid creates a floating 3D visual effect.
The system is powered by a 2400mAh Li-ion battery with integrated charging and boost regulation. Experimental evaluation demonstrates stable operation, synchronized animation, and improved user engagement compared to traditional voice-only systems. The project provides a cost-effective and scalable solution for next-generation Al assistants.
👁 33 views
How to Cite:
[1] Prof. CHANDRESH BHANGE, PRATYUSH.G. JANBHANDHU, SAMIKSHA A. GAJBHIYE, PRAKRUTI R. GAIMUKHE, TANU R. PATALE, RAVISHANKAR V. SHAHU, JAYASHRI M. MARBATE, “3D HUMANOID AI-SMART "AW8R”,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2026.15381
