Helped by the voiceprint recognition method, theAlibaba-developed voice recognition system canidentify multiple languages such as Chinese, Japanese, English and Russian, as well as Chinesedialects from different provinces such as Hunan, Hubei, Henan, Sichuan and Guangdong.
Transforming voice into , the system compares the s with keywords in its lexiconand anti-spam audio models to determine if something is pornographic.
The lexicon and anti-spam audio models collect tens of thousands of pornographic words withthe same or similar pronunciations, according to Alibaba.
The system monitors both online and offline voice files. It also has the ability to adapt and"learn" through constant use. For example, its Cantonese recognition ability was cultivatedby watching TV series.