Department of CSE (Data Science), ACE Engineering College, Telangana, India.
World Journal of Advanced Engineering Technology and Sciences, 2025, 15(02), 073-081
Article DOI: 10.30574/wjaets.2025.15.2.0512
Received on 18 March 2025; revised on 29 April 2025; accepted on 01 May 2025
Wave Talk is a multimodal human-computer interaction system that integrates real-time hand gesture recognition and voice command processing to enable seamless, touchless control of digital devices. Utilizing OpenCV and Media Pipe for gesture tracking, alongside Speech Recognition and pyttsx3 for voice interaction, the system offers an intuitive interface accessible to users across diverse environments, including those with physical disabilities or in hygiene-sensitive settings. Designed to run on standard webcams and microphones, Wave Talk ensures cost-effectiveness and broad usability. The methodology encompasses data acquisition, preprocessing, model integration, and action execution, with system testing confirming high accuracy and low latency. Applicable in smart homes, healthcare, education, and public spaces, Wave Talk demonstrates the potential of multimodal interaction systems to enhance accessibility, efficiency, and user experience in next-generation smart technologies.
Gesture Recognition; Voice Assistant; Multimodal Interface; Media Pipe; OpenCV; Speech Recognition; Touchless Control; Real-Time Interaction
Preview Article PDF
Parwateeswar Gollapalli, Sana Tabasum, Sidhartha Tadaboina, Sai Kumar Ganta and Aishwarya Gottipamula. Wave Talk: A smart gesture and voice assistant. World Journal of Advanced Engineering Technology and Sciences, 2025, 15(02), 073-081. Article DOI: https://doi.org/10.30574/wjaets.2025.15.2.0512.