Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Shunsuke Kozawa is active.

Publication


Featured researches published by Shunsuke Kozawa.


Archive | 2011

Advice Extraction from Web for Providing Prior Information Concerning Outdoor Activities

Shunsuke Kozawa; Masayuki Okamoto; Shinichi Nagano; Kenta Cho; Shigeki Matsubara

Conventional context-aware recommendation systems do not provide information before user action, although they provide information considering users’ ongoing activity. However, users want to know prior information such as how to go to their destination or get necessary items when they plan to do outdoor activities such as climbing and sightseeing. It takes time to collect the prior information since it is not so easy to appropriately find them. This paper proposes a method for extracting prior advices from the web. The method first identifies whether a given sentence is an advice or not. Then the method identifies whether the sentence is a prior advice or not if the sentence is identified as advice. In this paper, we will show availability of the proposed method through our experimentation. We also developed a system for providing prior information using the proposed method.


2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA) | 2014

Revised catalogue specifications of speech corpora with user-friendly visualization and search system

Shuichi Itahashi; Tomoko Ohsuga; Yuichi Ishimoto; Hiroaki Kojima; Kiyotaka Uchimoto; Shunsuke Kozawa

It is well known that speech corpora are indispensable to speech research; several data centers of speech corpora have been set up worldwide in order to meet this demand that serve as a repository for various speech corpora. However, they use different specification systems for their corpora, and so it is difficult for speech corpora users to compare and select suitable corpora. It would be more convenient for the users if each data center used a common specification system for describing its corpora. Based on this idea, we have already proposed a set of specification attributes and items as the first step towards standardization, but the scale of the retrieval system was limited. This paper introduces a revised version of the speech corpora specification attributes and items to be connected with the large-scale metadata database “SHACHI” combined with the “Concentric Ring View (CRV) System” to improve the user interface.


Archive | 2012

Automatic Collection of Useful Phrases for English Academic Writing

Shunsuke Kozawa; Yuta Sakai; Kenji Sugiki; Shigeki Matsubara

English academic writing is indispensable for researchers to present their own research achievement. It is hard for non-native researchers to write research papers in English. They often refer to phrase dictionaries for academic writing to know useful expressions in academic writing. However, lexica available in the market do not have enough expressions and example sentences to serve the purpose since the lexica are created by hand. In order to respond to the demand for the better lexica, this paper proposes a method for extracting useful expressions automatically from English research papers. The expressions are extracted from research papers based on four characteristics of the expressions. The extracted expressions are classified into five classes; “introduction”, “related work”, “proposed method”, “experiment”, and “conclusion”. In our experiment using 1,232 research papers, our proposed method achieved 57.5% in precision and 51.9% in recall. The f-measure was higher than those of the baselines, and therefore, we confirmed the validity of our method. We developed a phrase search system using extracted phrasal expressions to support English academic writing.


International Journal of Knowledge and Web Intelligence | 2012

Design and collection of ontological metadata for enhancing interoperability of language resources

Shunsuke Kozawa; Hitomi Tohyama; Kiyotaka Uchimoto; Shigeki Matsubara; Hitoshi Isahara

This paper describes the design and implementation of a large scale ontological database named SHACHI, storing detailed metadata on language resources (LRs) in Asian and Western countries. SHACHI has been constructed to enhance the interoperability of LRs, that is, to effectively combine LRs, to systematically store LR metadata, to provide a common infrastructure for web services, to investigate languages, tag sets, and formats compiled in LRs, and to ultimately utilise all these factors for more efficient development of LRs. This ontological metadata database, containing more than 2,000 compiled LRs such as corpora, dictionaries, thesauruses and lexicons, has an aspect of an archive of a large scale metadata of LRs, and its website is now open to the public and accessible to all internet users. SHACHI metadata set is an extended version of OLAC metadata set which conforms to Dublin Core metadata element set. This paper first presents the methodologies to systematically store LR metadata and efficiently LR catalogues, and then explains the structure of the ontological metadata database, as well as the realisation of the LR catalogue search tool. The usefulness of the ontology search function has been investigated.


asia information retrieval symposium | 2011

Acquisition of know-how information from web

Shunsuke Kozawa; Kiyotaka Uchimoto; Shigeki Matsubara

A variety of know-how such as recipes and solutions for troubles have been stored on the Web. However, it is not so easy to appropriately find certain know-how information. If know-how could be appropriately detected, it would be much easier for us to know how to tackle unforeseen situations such as accidents and disasters. This paper proposes a promising method for acquiring know-how information from the Web. First, we extract passages containing at least one target object and then extract candidates for know-how from them. Then, passages containing the know-how are discriminated from non-know-how information considering each object and its typical usage.


Advances in intelligent decision technologies : proceedings of the Second KES International Symposium IDT 2010 | 2010

Automatic Extraction of Phrasal Expressions for Supporting English Academic Writing

Shunsuke Kozawa; Yuta Sakai; Kenji Sugiki; Shigeki Matsubara

English academic writing is not easy for non-native researchers. They often refer to lexica of phrases on English research papers to know useful expressions in academic writing. However, lexica on sales do not have enough amount of expressions. Therefore, we propose a method for automatically extracting useful expressions from English research papers. We found four characteristics of the expressions by analyzing the existing lexicon of phrases on English research papers. The expressions are extracted from research papers based on statistical and syntactic information. In our experiment using 1,232 research papers, our proposed method achieved 57.5% in precision and 51.9% in recall. The f-measure was higher than the baselines, and therefore, we confirmed the feasibility of our method.


language resources and evaluation | 2008

Automatic Acquisition of Usage Information for Language Resources

Shunsuke Kozawa; Hitomi Tohyama; Kiyotaka Uchimoto; Shigeki Matsubara


language resources and evaluation | 2008

Construction of a Metadata Database for Efficient Development and Use of Language Resources

Hitomi Tohyama; Shunsuke Kozawa; Kiyotaka Uchimoto; Shigeki Matsubara; Hitoshi Isahara


Journal of Natural Language Processing | 2014

Adaptation of Long-Unit-Word Analysis System to Different Part-Of-Speech Tagset

Shunsuke Kozawa; Kiyotaka Uchimoto; Yasuharu Den


international conference on computational linguistics | 2008

Construction of an Infrastructure for Providing Users with Suitable Language Resources

Hitomi Tohyama; Shunsuke Kozawa; Kiyotaka Uchimoto; Shigeki Matsubara; Hitoshi Isahara

Collaboration


Dive into the Shunsuke Kozawa's collaboration.

Top Co-Authors

Avatar

Kiyotaka Uchimoto

National Institute of Information and Communications Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Hitoshi Isahara

Toyohashi University of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Yuichi Ishimoto

National Institute of Informatics

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Hiroaki Kojima

National Institute of Advanced Industrial Science and Technology

View shared research outputs
Researchain Logo
Decentralizing Knowledge