Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Tuğba Yıldız is active.

Publication


Featured researches published by Tuğba Yıldız.


international conference on computational linguistics | 2013

Extraction of part-whole relations from turkish corpora

Tuğba Yıldız; Savaş Yıldırım; Banu Diri

In this work, we present a model for semi-automatically extracting part-whole relations from a Turkish raw text. The model takes a list of manually prepared seeds to induce syntactic patterns and estimates their reliabilities. It then captures the variations of part-whole candidates from the corpus. To get precise meronymic relationships, the candidates are ranked and selected according to their reliability scores. We use and compare some metrics to evaluate the strength of association between a pattern and matched pairs. We conclude with a discussion of the result and show that the model presented here gives promising results for Turkish text.


international conference natural language processing | 2014

An Integrated Approach to Automatic Synonym Detection in Turkish Corpus

Tuğba Yıldız; Savaş Yıldırım; Banu Diri

In this study, we designed a model to determine synonymy. Our main assumption is that synonym pairs show similar semantic and dependency relation by the definition. They share same meronym/holonym and hypernym/hyponym relations. Contrary to synonymy, hypernymy and meronymy relations can probably be acquired by applying lexico-syntactic patterns to a big corpus. Such acquisition might be utilized and ease detection of synonymy. Likewise, we utilized some particular dependency relations such as object/subject of a verb, etc. Machine learning algorithms were applied on all these acquired features. The first aim is to find out which dependency and semantic features are the most informative and contribute most to the model. Performance of each feature is individually evaluated with cross validation. The model that combines all features shows promising results and successfully detects synonymy relation. The main contribution of the study is to integrate both semantic and dependency relation within distributional aspect. Second contribution is considered as being first major attempt for Turkish synonym identification based on corpus-driven approach.


international symposium on innovations in intelligent systems and applications | 2012

Association rule based acquisition of hyponym and hypernym relation from a Turkish corpus

Tuğba Yıldız; Savaş Yıldırım

In this paper, we propose a method for the automatic acquisition of hypernym/hyponymy relations from a Turkish raw text. Once the model has extracted prospective hyponyms by using lexico-syntactic patterns, an Apriori algorithm is applied to eliminate faulty hyponyms and increase precision. We show that a model based on a particular lexico-syntactic pattern and association rules for Turkish language can successfully retrieve many is-a relation with high precision.


international conference on computational linguistics | 2012

Corpus-Driven hyponym acquisition for turkish language

Savaş Yıldırım; Tuğba Yıldız

In this study, we propose a method for acquisition of hyponymy relations for the Turkish Language. This integrated method relies on both lexico-syntactic pattern and semantic similarity. Once the model has extracted the items using patterns it applies similarity based elimination of the incorrect ones in order to increase precision. We show that the algorithm based on a particular lexico-syntactic pattern for Turkish language can retrieve many hyponymy relations and also demonstrate that elimination based on semantic similarity gives promising results. We discuss how we measure the similarity between the concepts. The objective is to get better relevance and more precise results. The experiments show that this approach gives successful results with high precision.


signal processing and communications applications conference | 2015

Pattern and semantic similarity based automatic extraction of hyponym-hypernym relation from Turkish corpus

Gurkan Sahin; Banu Diri; Tuğba Yıldız

Extraction of semantic relations from various resources (Wikipedia, Web, corpus etc.) is an important issue in natural language processing. In this paper, automatic extraction of hyponym-hypernym pairs from Turkish corpus is aimed. For extraction of hyponym-hypernym pairs, pattern and semantic similarity based methods are used together. Patterns are extracted from initial hyponym-hypernym pairs and using patterns, hyponyms are extracted for various hypernyms. Incorrect candidate hyponyms are removed using document frequency and semantic similarity based elimination methods. After experiments for 14 hypernyms, average accuracy of 77% was obtained.


language and technology conference | 2009

Pronoun Resolution in Turkish Using Decision Tree and Rule-Based Learning Algorithms

Savaş Yıldırım; Yılmaz Kılıçaslan; Tuğba Yıldız

This paper reports on the results of some pronoun resolution experiments performed by applying a decision tree and a rule-based algorithm on an annotated Turkish text. The text has been compiled mostly from various popular child stories in a semi-automatic way. A knowledge-lean learning model has been devised using only nine most commonly employed features. An evaluation and comparison of the performances achieved with the two different algorithms is offered in terms of the recall, precision and f-measure metrics.


International Journal of Computational Intelligence Systems | 2018

Learning Turkish Hypernymy UsingWord Embeddings

Savaş Yıldırım; Tuğba Yıldız

Recently, Neural Network Language Models have been effectively applied to many types of Natural Language Processing (NLP) tasks. One popular type of tasks is the discovery of semantic and syntactic regularities that support the researchers in building a lexicon. Word embedding representations are notably good at discovering such linguistic regularities. We argue that two supervised learning approaches based on word embeddings can be successfully applied to the hypernym problem, namely, utilizing embedding offsets between word pairs and learning semantic projection to link the words. The offset-based model classifies offsets as hypernym or not. The semantic projection approach trains a semantic transformation matrix that ideally maps a hyponym to its hypernym. A semantic projection model can learn a projection matrix provided that there is a sufficient number of training word pairs. However, we argue that such models tend to learn is-a-particular-hypernym relation rather than to generalize is-a relation. The embeddings are trained by applying both the Continuous Bag-of Words and the Skip-Gram training models using a huge corpus in Turkish text. The main contribution of the study is the development of a novel and efficient architecture that is well-suited to applying word embeddings approaches to the Turkish language domain. We report that both the projection and the offset classification models give promising and novel results for the Turkish Language.


signal processing and communications applications conference | 2016

A hybrid method for extracting Turkish part-whole relation pairs from corpus

Gurkan Sahin; Banu Diri; Tuğba Yıldız

Extraction of various semantic relation pairs from different sources (dictionary definitions, corpus etc.) with high accuracy is one of the most popular topics in natural language processing (NLP). In this study, a hybrid method is proposed to extract Turkish part-whole pairs from corpus. Corpus statistics, WordNet similarities and Word2Vec word vector similarities are used together in this study. Firstly, initial part-whole seeds are prepared and by using these seeds part-whole patterns are extracted from corpus. For each pattern, a reliability score is calculated and reliable patterns are selected to produce new pairs from corpus. Various reliability scores are used for new pairs. To measure success of method, 19 target whole words are selected and average 83% (first 10 pairs), 74% (first 20 pairs), 68% (first 30 pairs) precisions are obtained, respectively.


language and technology conference | 2013

A Study on Turkish Meronym Extraction Using a Variety of Lexico-Syntactic Patterns

Tuğba Yıldız; Savaş Yıldırım; Banu Diri

In this paper, we applied lexico-syntactic patterns to disclose meronymy relation from a huge Turkish raw text. Once, the system takes a huge raw corpus and extract matched cases for a given pattern, it proposes a list of whole-part pairs depending on their co-occur frequencies. For the purpose, we exploited and compared a list of pattern clusters. The clusters to be examined could fall into three types; general patterns, dictionary-based pattern, and bootstrapped pattern. We evaluated how these patterns improve the system performance especially within corpus-based approach and distributional feature of words. Finally, we discuss all the experiments with a comparison analysis and we showed advantage and disadvantage of the approaches with promising results.


international conference on computational linguistics | 2012

Automatic Extraction of Turkish Hypernym-Hyponym Pairs From Large Corpus

Savaş Yıldırım; Tuğba Yıldız

Collaboration


Dive into the Tuğba Yıldız's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Banu Diri

Yıldız Technical University

View shared research outputs
Top Co-Authors

Avatar

Gurkan Sahin

Yıldız Technical University

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge