Tsunenori Ishioka

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Tsunenori Ishioka is active.

Explore More

Publication

Featured researches published by Tsunenori Ishioka.

meeting of the association for computational linguistics | 2006

Automated Japanese Essay Scoring System based on Articles Written by Experts

Tsunenori Ishioka; Masayuki Kameda

We have developed an automated Japanese essay scoring system called Jess. The system needs expert writings rather than expert raters to build the evaluation model. By detecting statistical outliers of predetermined aimed essay features compared with many professional writings for each prompt, our system can evaluate essays. The following three features are examined: (1) rhetoric --- syntactic variety, or the use of various structures in the arrangement of phases, clauses, and sentences, (2) organization --- characteristics associated with the orderly presentation of ideas, such as rhetorical features and linguistic cues, and (3) content --- vocabulary related to the topic, such as relevant information and precise or specialized vocabulary. The final evaluation score is calculated by deducting from a perfect score assigned by a learning process using editorials and columns from the Mainichi Daily News newspaper. A diagnosis for the essay is also given.

database and expert systems applications | 2004

Automated Japanese essay scoring system:jess

Tsunenori Ishioka; Masayuki Kameda

We have developed an automated Japanese essay scoring system named jess. The system evaluates an essay from three features: (1) rhetoric - ease of reading, diversity of vocabulary, percentage of big words (long, difficult words), and percentage of passive sentences; (2) organization - characteristics associated with the orderly presentation of ideas, such as rhetorical features and linguistic cues; (3) contents - vocabulary related to the topic, such as relevant information and precise or specialized vocabulary. The final evaluated score is calculated by deducting from a perfect score assigned by a learning process using editorials and columns from the Mainichi Daily News newspaper. A diagnosis for the essay is also given. Our system does not need any essays graded by human experts.

web intelligence | 2003

Evaluation of criteria for information retrieval

Tsunenori Ishioka

We investigate van Rijsbergens F-measure, break-even point, and 11-point averaged precision, all of which can be translated into 1-dimensional scalar quantity from the precision and the recall. These investigations can be done by comparing to tetrachoric (four-fold) correlation coefficient and phi (four-fold point) coefficient, which are often used as the index of statistical association in a 2/spl times/2 contingency table. The results show that when a fallout rate is less than 0.1, (1) the F/sub 1/ measure has similar properties of the phi coefficient, (2) the break-even point is almost equivalent to a phi coefficient, and (3) the 11-point averaged precision should be a measure, which is larger than a phi coefficient and has a value smaller than a tetrachoric correlation coefficient.

Proceedings of the International Conference on Web Intelligence | 2017

Overwritable automated japanese short-answer scoring and support system

Tsunenori Ishioka; Masayuki Kameda

We have developed an automated Japanese short-answer scoring and support machine for new National Center written test exams. Our approach is based on the fact that accurate recognizing textual entailment and/or synonymy has been almost impossible for several years. The system generates automated scores on the basis of evaluation criteria or rubrics, and human raters revise them. The system determines semantic similarity between the model answers and the actual written answers as well as a certain degree of semantic identity and implication. Owing to the need for the scoring results to be classified at multiple levels, we use random forests to utilize many predictors effectively rather than use support vector machines. An experimental prototype operates as a web system on a Linux computer. We compared human scores with the automated scores for a case in which 3--6 allotment points were placed in 8 categories of a social studies test as a trial examination. The differences between the scores were within one point for 70--90 percent of the data when high semantic judgment was not needed.

information integration and web-based applications & services | 2014

Investigations into Missing Values Imputation Using Random Forests for Semi-supervised Data

Tsunenori Ishioka

This paper presents a revised procedure that imputes missing values by using random forests on semi-supervised data. The method has a feature that not only allows missing data to be found in a response variable but in a predictive variable, and furthermore, it can now deal with any types of data, i.e., numerical values, categories and categories with an order. By evaluating this method using Titanic data and eleven UC Irvine repository datasets, we found that our method performed fairly well, and a method of naive median imputation was also suitable in these cases.

Archive | 2003

Text Segmentation by Latent Semantic Indexing

Tsunenori Ishioka

We point out that the determination of the document boundary is possible by using Singular Value Decomposition (SVD) based on the idea of LSI (Latent Semantic Indexing), and show that the conditional entropy method is replaceable by the SVD method using illustrations from several well-known plays. If we use this entropy model, the homoscedasticity test can be available to detect the document boundary, because the entropy depends on the variance of k-variables of LSI.

Systems and Computers in Japan | 2004

Evaluation of criteria on information retrieval

Tsunenori Ishioka

intelligent data engineering and automated learning | 2000

Extended K-means with an Efficient Estimation of the Number of Clusters

Tsunenori Ishioka

information integration and web-based applications & services | 2012

Imputation of missing values for semi-supervised data using the proximity in random forests

Tsunenori Ishioka

Japanese Journal of Applied Statistics | 1999

Document Retrieval Based on Words' Cooccurrences, the Algorithm and Its Application

Tsunenori Ishioka; Masayuki Kameda

Explore More

Collaboration

Dive into the Tsunenori Ishioka's collaboration.

Top Co-Authors

Masayuki Kameda

Ricoh

View shared research outputs

Top Co-Authors

Hideki Nagatsuka

Tokyo Metropolitan University

View shared research outputs

Top Co-Authors

Toshinari Kamakura

Chuo University

View shared research outputs

Explore More

National Center for University Entrance Examinations

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot

Dive into the research topics where Tsunenori Ishioka is active.

Publication

Featured researches published by Tsunenori Ishioka.

Automated Japanese Essay Scoring System based on Articles Written by Experts

Automated Japanese essay scoring system:jess

Evaluation of criteria for information retrieval

Overwritable automated japanese short-answer scoring and support system

Investigations into Missing Values Imputation Using Random Forests for Semi-supervised Data

Text Segmentation by Latent Semantic Indexing

Evaluation of criteria on information retrieval

Extended K-means with an Efficient Estimation of the Number of Clusters

Imputation of missing values for semi-supervised data using the proximity in random forests

Document Retrieval Based on Words' Cooccurrences, the Algorithm and Its Application

Collaboration

Dive into the Tsunenori Ishioka's collaboration.