Hisanori Makinae
National Research Institute of Police Science
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Hisanori Makinae.
Archive | 2012
Kanae Amino; Takashi Osanai; Toshiaki Kamada; Hisanori Makinae; Takayuki Arai
This chapter introduces experiments on speaker recognition where we focus on two of the factors that affect speaker recognition accuracy: phonological contents of the speech materials used for identifying speakers and the transmission channel difference in automatic speaker verification. Through the experiments, we show that nasal sounds are effective for forensic speaker recognition despite the differences in speaker sets and recording channels. Also we show that performance degradation by the channel difference, in this study air- and bone-conduction, can be improved by devising normalisation methods and acoustic parameters.
Archive | 2012
Kanae Amino; Takashi Osanai; Toshiaki Kamada; Hisanori Makinae; Takayuki Arai
Forensic phonetics and acoustics are nowadays widely used regarding police and legal use of acoustic samples. Among many tasks included in this area, forensic speaker recognition is considered as one of the most complex problems. Forensic speaker recognition, sometimes called forensic speaker comparison, is a process for making judgments on whether or not two speech samples are from the same speaker. This chapter introduces the historical backgrounds of forensic speaker recognition including “voiceprint” controversy, human-based visual and auditory forensic speaker recognition, and automatic forensic speaker recognition. Procedural considerations in forensic speaker recognition processes and factors that affect recognition performances are also presented. Finally, we will give a summary of the progress and developments made in the forensic automatic speaker recognition.
IEICE Transactions on Information and Systems | 2008
Toshiaki Kamada; Nobuaki Minematsu; Takashi Osanai; Hisanori Makinae; Masumi Tanimoto
In forensic voice telephony speaker verification, we may be requested to identify a speaker in a very noisy environment, unlike the conditions in general research. In a noisy environment, we process speech first by clarifying it. However, the previous study of speaker verification from clarified speech did not yield satisfactory results. In this study, we experimented on speaker verification with clarification of speech in a noisy environment, and we examined the relationship between improving acoustic quality and speaker verification results. Moreover, experiments with realistic noise such as a crime prevention alarm and power supply noise was conducted, and speaker verification accuracy in a realistic environment was examined. We confirmed the validity of speaker verification with clarification of speech in a realistic noisy environment.
international conference on electrical computer and communication engineering | 2017
Shiming Zhang; Yosuke Sugiura; Tetsuya Shimamura; Hisanori Makinae
Air-conducted speech is currently used for fundamental frequency extraction in noisy environment. However, the results are severely affected by noise. In this paper, additionally bone-conducted speech is combined with air-conducted speech and a novel pitch extraction method is derived. Experiments indicate that the proposed method provides an accurate estimate of fundamental frequency of speech in highly noisy environment.
Journal of the Acoustical Society of America | 2016
Tatsuya Kitamura; Hironori Takemoto; Hisanori Makinae; Tetsutaro Yamaguchi; Kotaro Maki
The shape of the nasal cavity and paranasal sinuses is complex and varies between individuals. Because the shape is almost stable during speech, the acoustic properties could constantly provide speaker specific information to speech sounds, that is, speaker individuality. In this preliminary analysis, the shape was extracted from cone-beam CT data for a subject using a machine learning technique and its acoustic properties were examined using finite-difference time-domain simulation. The transfer function from the glottis to the nostrils was calculated and the distribution pattern of the pressure anti-nodes was visualized at frequencies of major spectral peaks and dips. In addition, transfer functions were calculated when each of the paranasal sinuses other than the ethmoidal ones was occluded to identify which sinus caused which dip. As a result, the longitudinal resonance in the right or left half of the nasal cavity generated each peak, while the transverse resonance in the pharyngeal cavity caused the...
conference of the international speech communication association | 2008
Satoru Tsuge; Takashi Osanai; Hisanori Makinae; Toshiaki Kamada; Minoru Fukumi; Shingo Kuroiwa
Acoustical Science and Technology | 2018
Kanae Amino; Hisanori Makinae; Toshiaki Kamada
Acoustical Science and Technology | 2018
Kanae Amino; Hisanori Makinae; Toshiaki Kamada; Takashi Osanai
international symposium on intelligent signal processing and communication systems | 2017
Daiki Watanabe; Yosuke Sugiura; Tetsuya Shimamura; Hisanori Makinae
conference of the international speech communication association | 2017
Tatsuya Kitamura; Hironori Takemoto; Hisanori Makinae; Tetsutaro Yamaguchi; Koutaro Maki
Collaboration
Dive into the Hisanori Makinae's collaboration.
National Institute of Information and Communications Technology
View shared research outputs