📞 +91-7667918914 | ✉️ ijarcce@gmail.com
International Journal of Advanced Research in Computer and Communication Engineering
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 15, ISSUE 4, APRIL 2026

A Hybrid Signal-Processing and Neural Network Pipeline for Low-Latency Acoustic Event Detection at the Edge

Jatin Verma, Abhishek Jatla, Thanuja Reddy, Ms. Charulatha R.T

👁 33 views📥 5 downloads
Share: 𝕏 f in
Abstract: This paper presents an edge-computed AI framework that detects a specific acoustic event — a human handclap — and actuates a wireless IoT device in response, without touching a cloud server at any point. Audio is captured through a professional condenser microphone, streamed into a Python processing pipeline via a non-blocking callback queue, and subjected to two lightweight deterministic gates before any neural computation occurs. The first gate discards frame whose peak amplitude falls below a fixed noise floor, keeping the classifier idle during silence. The second gate enforces a post-detection bypass window, preventing room echoes from generating spurious follow-on triggers. Frames that clear both gates pass through a two-stage neural pipeline: Google's YAMNet model extracts 1024- dimensional acoustic embeddings, and a purpose-trained Keras Sequential classifier maps those embeddings to a binary confidence score. When confidence exceeds 0.28, the system sends a UDP packet over the local Wi-Fi network to an ESP32 microcontroller, whose on-board LED toggles as hardware confirmation of successful end-to-end delivery. The entire loop — from live audio capture, through AI inference, to physical actuation — runs within a sub-second latency budget on commodity CPU hardware, with no internet dependency. The result is a privacy-preserving, network- independent architecture that generalises to any latency-critical acoustic IoT trigger application.

Keywords: Edge Computing; Acoustic Event Detection; YAMNet; Transfer Learning; Keras; ESP32; Real-Time Signal Processing; Deterministic Frame-Skipping; IoT Actuation.

How to Cite:

[1] Jatin Verma, Abhishek Jatla, Thanuja Reddy, Ms. Charulatha R.T, “A Hybrid Signal-Processing and Neural Network Pipeline for Low-Latency Acoustic Event Detection at the Edge,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2026.154125

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.