Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Arda Çelebi is active.

Publication


Featured researches published by Arda Çelebi.


language resources and evaluation | 2004

MEAD - A Platform for Multidocument Multilingual Text Summarization

Dragomir R. Radev; Timothy Allison; Sasha Blair-Goldensohn; John Blitzer; Arda Çelebi; Stanko Dimitrov; Elliott Franco Drábek; Ali Hakim; Wai Lam; Danyu Liu; Jahna Otterbacher; Hong Qi; Horacio Saggion; Simone Teufel; Michael Topper; Adam Winkel; Zhu Zhang

Abstract This paper describes the functionality of MEAD, a comprehensive, public domain, open source, multidocument multilingual summarization environment that has been thus far downloaded by more than 500 organizations. MEAD has been used in a variety of summarization applications ranging from summarization for mobile devices to Web page summarization within a search engine and to novelty detection.


meeting of the association for computational linguistics | 2003

Evaluation Challenges in Large-Scale Document Summarization

Dragomir R. Radev; Simone Teufel; Horacio Saggion; Wai Lam; John Blitzer; Hong Qi; Arda Çelebi; Danyu Liu; Elliott Franco Drábek

We present a large-scale meta evaluation of eight evaluation measures for both single-document and multi-document summarizers. To this end we built a corpus consisting of (a) 100 Million automatic summaries using six summarizers and baselines at ten summary lengths in both English and Chinese, (b) more than 10,000 manual abstracts and extracts, and (c) 200 Million automatic document and summary retrievals using 20 queries. We present both qualitative and quantitative results showing the strengths and draw-backs of all evaluation methods and how they rank the different summarizers.


international conference on acoustics, speech, and signal processing | 2012

Semi-supervised discriminative language modeling for Turkish ASR

Arda Çelebi; Hasim Sak; Erinç Dikici; Murat Saraclar; Maider Lehr; Emily Prud'hommeaux; Puyang Xu; Nathan Glenn; Damianos Karakos; Sanjeev Khudanpur; Brian Roark; Kenji Sagae; Izhak Shafran; Daniel M. Bikel; Chris Callison-Burch; Yuan Cao; Keith B. Hall; Eva Hasler; Philipp Koehn; Adam Lopez; Matt Post; Darcey Riley

We present our work on semi-supervised learning of discriminative language models where the negative examples for sentences in a text corpus are generated using confusion models for Turkish at various granularities, specifically, word, sub-word, syllable and phone levels. We experiment with different language models and various sampling strategies to select competing hypotheses for training with a variant of the perceptron algorithm. We find that morph-based confusion models with a sample selection strategy aiming to match the error distribution of the baseline ASR system gives the best performance. We also observe that substituting half of the supervised training examples with those obtained in a semi-supervised manner gives similar results.


Polibits | 2013

N-gram Parsing for Jointly Training a Discriminative Constituency Parser

Arda Çelebi; Arzucan Özgür

Syntactic parsers are designed to detect the complete syntactic structure of grammatically correct sentences. In this paper, we introduce the concept of n-gr...


Journal of the Association for Information Science and Technology | 2018

Segmenting hashtags and analyzing their grammatical structure

Arda Çelebi; Arzucan Özgür

Originated as a label to mark specific tweets, hashtags are increasingly used to convey messages that people like to see in the trending hashtags list. Complex noun phrases and even sentences can be turned into a hashtag. Breaking hashtags into their words is a challenging task due to the irregular and compact nature of the language used in Twitter. In this study, we investigate feature‐based machine learning and language model (LM)‐based approaches for hashtag segmentation. Our results show that LM alone is not successful at segmenting nontrivial hashtags. However, when the N‐best LM‐based segmentations are incorporated as features into the feature‐based approach, along with context‐based features proposed in this study, state‐of‐the‐art results in hashtag segmentation are achieved. In addition, we provide an analysis of over two million distinct hashtags, autosegmented by using our best configuration. The analysis reveals that half of all 60 million hashtag occurrences contain multiple words and 80% of sentiment is trapped inside multiword hashtags, justifying the need for hashtag segmentation. Furthermore, we analyze the grammatical structure of hashtags by parsing them and observe that 77% of the hashtags are noun‐based, whereas 11.9% are verb‐based.


Archive | 2003

Evaluation of Text Summarization in a Cross-lingual Information Retrieval Framework

Dragomir R. Radev; Simone Teufel; Horacio Saggion; Wai Lam; John Blitzer; Arda Çelebi; Danyu Liu


joint conference on lexical and computational semantics | 2013

BOUNCE: Sentiment Classification in Twitter using Rich Feature Sets

Nadin Kökciyan; Arda Çelebi; Arzucan Özgür; Suzan ÜsküdarlÄ


conference of the international speech communication association | 2012

Performance Comparison of Training Algorithms for Semi-Supervised Discriminative Language Modeling.

Erinç Dikici; Arda Çelebi; Murat Saraclar


Theory and Applications of Categories | 2017

Description of the BOUN System for the Trilingual Entity Detection and Linking Tasks at TAC KBP 2017.

Arda Çelebi; Arzucan Özgür


north american chapter of the association for computational linguistics | 2013

Semi-Supervised Discriminative Language Modeling with Out-of-Domain Text Data

Arda Çelebi; Murat Saraclar

Collaboration


Dive into the Arda Çelebi's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Danyu Liu

University of Alabama

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Wai Lam

The Chinese University of Hong Kong

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge