A voicedetector is a device invented or created to detect the sounds that are madewhen people speak or sing.
Computer scientists have been searching ways toenable computer to record, interpret and understand human speech since 1960’s. Thishas been a dauting task throughout the decades. Even the most rudimentaryproblem such as sampling voice was a huge challenge in the early years. It tookuntil the 1980s before the first systems arrived which could actually decipherspeech (Goel and Singh, 2014).
Furthermore,with the expectation of sound handling technique evolution, inventors andengineers invented the first voice recognition system in 1950s which could onlyrecognize digits (Pinola, 2011). “Audrey”, the first voice recognition systemin 1952, was able to recognize spoken digits (Warren, 2014). In other words,”Audrey” could only distinguish between ten digits from zero to nine.
The IBMShoebox was the most advanced voice recognition machine because of the abilityto understand 16 words spoken in English when it was revealed at the SeattleWorld’s Fair in 1962 (Kane, 2015). The improvement of voice recognitiontechnology can be seen after 20 years in Harpy system. Harpy is a voicerecognition system developed in Carnegie-Mellon University resulted from theperformance analysis in various design choices of two earlier speechrecognition systems, which are Hearsay-I system and Dragon system (Lowerre,1976). According to Pinola (2011), Harpy system could understand 1101 words,approximately the vocabulary that may learn by a three years old child.
In1980s, the Hidden Markov Model (HMM) was the turning point of voice recognitionto voice prediction (Gales and Young, 2007). HMM allows the conversion fromsound input to words written output accurately by using voice predictiontechnology. While in 1990s, the first voice recognition product for consumer,Dragon Dictate has been developed. This new product can recognize continuous speechin about 100 words per minute (Pinola, 2011). Moving to late 2000s, Google has introduced a voice recognition softwarethat will serve as a foundation for the company’s future Voice Search product(Huang, Baker, and Reddy, 2014). Besidesthat, according to Martins, Trancoso, Abad, and Meinedo (2009), current voicedetector technology can recognize the gender identity by detecting the voice.
This means that the gender of the speaker can be determined after analysis madeon the voice detected. Nevertheless, voice detector is used to detect unusualvoice for nursing system purposes (Wilson et al., 2009). Examples of theunusual voice include cough, groan, wheeze, cry and etc. In addition, voice/non-voice(VNV) detection which used for determining the vocal folds activity regions inthe speech signal are widely used in speech processing applications such asspeech enhancement, speech coding and speech recognition (Kumar and Rao, 2016). Withthe implementation of Artificial Intelligence (AI) in sound technology,interaction between humans and machines such as computer or smartphones hasbeen allowed nowadays. For instance, Siri in Apple smartphones and iPads,Google search and Window 10 Cortana has allowed the interaction between humansand smart device.