Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Chaya Liebeskind is active.

Publication


Featured researches published by Chaya Liebeskind.


ACM Journal on Computing and Cultural Heritage | 2013

Automatic thesaurus construction for cross generation corpus

Hadas Zohar; Chaya Liebeskind; Jonathan Schler; Ido Dagan

This article describes methods for semiautomatic thesaurus construction, for a cross generation, cross genre, and cross cultural corpus. Semiautomatic thesaurus construction is a complex task, and applying it on a cross generation corpus brings its own challenges. We used a Jewish juristic corpus containing documents and genres that were written across 2000 years, and contain a mix of different languages, dialects, geographies, and writing styles. We evaluated different first and second order methods, and introduced a special annotation scheme for this problem, which showed that first order methods performed surprisingly well. We found that in our case, improving the coverage is the more difficult task, for this we introduce a new algorithm to increase recall (coverage)—which is applicable to many other problems as well, and demonstrates significant improvement in our corpus.


ACM Journal on Computing and Cultural Heritage | 2016

Semiautomatic Construction of Cross-Period Thesaurus

Chaya Liebeskind; Ido Dagan; Jonathan Schler

A cross-period (diachronic) thesaurus enables users to search for information using modern terminology and obtain semantically related terms from earlier historical periods. The complex task of supporting the construction of a diachronic thesaurus by a domain expert lexicographer has hardly been addressed computationally until now. In this article, we introduce a semiautomatic iterative Query Expansion (QE) scheme for supporting diachronic thesaurus construction, which identifies candidate related terms based on statistical corpus-based measures. We use ancient-modern period classification to increase the performance of the statistical cooccurrence measures and extend our methods to deal with Multi-Word Expressions (MWEs). We demonstrate the empirical benefit of our scheme for a Jewish cross-period thesaurus and evaluate its impact on recall and on the effectiveness of the lexicographer’s manual efforts.


Archive | 2018

Comment Relevance Classification in Facebook

Chaya Liebeskind; Shmuel Liebeskind; Yaakov HaCohen-Kerner

Social posts and their comments are rich and interesting social data. In this study, we aim to classify comments as relevant or irrelevant to the content of their posts. Since the comments in social media are usually short, their bag-of-words (BoW) representations are highly sparse. We investigate four semantic vector representations for the relevance classification task. We investigate different types of large unlabeled data for learning the distributional representations. We also empirically demonstrate that expanding the input of the task to include the post text does not improve the classification performance over using only the comment text. We show that representing the comment in the post space is a cheap and good representation for comment relevance classification.


language resources and evaluation | 2015

Text Categorization from category name in an industry-motivated scenario

Chaya Liebeskind; Lili Kotlerman; Ido Dagan

In this work we suggest a novel Text Categorization (TC) scenario, motivated by an ad-hoc industrial need to assign documents to a set of predefined categories, while labeled training data for the categories is not available. The scenario is applicable in many industrial settings and is interesting from the academic perspective. We present a new dataset geared for the main characteristics of the scenario, and utilize it to investigate the name-based TC approach, which uses the category names as its only input and does not require training data. We evaluate and analyze the performance of state-of-the-art methods for this dataset to identify the shortcomings of these methods for our scenario, and suggest ways for overcoming these shortcomings. We utilize statistical correlation measured over a target corpus for improving the state-of-the-art, and offer a different classification scheme based on the characteristics of the setting. We evaluate our improvements and adaptations and show superior performance of our suggested method.


sighum workshop on language technology for cultural heritage social sciences and humanities | 2015

Integrating Query Performance Prediction in Term Scoring for Diachronic Thesaurus

Chaya Liebeskind; Ido Dagan

A diachronic thesaurus is a lexical resource that aims to map between modern terms and their semantically related terms in earlier periods. In this paper, we investigate the task of collecting a list of relevant modern target terms for a domain-specific diachronic thesaurus. We propose a supervised learning scheme, which integrates features from two closely related fields: Terminology Extraction and Query Performance Prediction (QPP). Our method further expands modern candidate terms with ancient related terms, before assessing their corpus relevancy with QPP measures. We evaluate the empirical benefit of our method for a thesaurus for a diachronic Jewish corpus.


language resources and evaluation | 2016

A Lexical Resource of Hebrew Verb-Noun Multi-Word Expressions.

Chaya Liebeskind; Yaakov HaCohen-Kerner


sighum workshop on language technology for cultural heritage social sciences and humanities | 2013

Semi-automatic Construction of Cross-period Thesaurus

Chaya Liebeskind; Ido Dagan; Jonathan Schler


joint conference on lexical and computational semantics | 2012

Statistical Thesaurus Construction for a Morphologically Rich Language

Chaya Liebeskind; Ido Dagan; Jonathan Schler


international conference on computational linguistics | 2016

Semantically Motivated Hebrew Verb-Noun Multi-Word Expressions Identification.

Chaya Liebeskind; Yaakov HaCohen-Kerner


language resources and evaluation | 2018

Automatic Thesaurus Construction for Modern Hebrew.

Chaya Liebeskind; Ido Dagan; Jonathan Schler

Collaboration


Dive into the Chaya Liebeskind's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Yaakov HaCohen-Kerner

Jerusalem College of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Karine Nahon

University of Washington

View shared research outputs
Researchain Logo
Decentralizing Knowledge