Marco Lui
University of Melbourne
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Marco Lui.
Proceedings of the 5th Workshop on Language Analysis for Social Media (LASM) | 2014
Marco Lui; Timothy Baldwin
We present an evaluation of “off-theshelf” language identification systems as applied to microblog messages from Twitter. A key challenge is the lack of an adequate corpus of messages annotated for language that reflects the linguistic diversity present on Twitter. We overcome this through a “mostly-automated” approach to gathering language-labeled Twitter messages for evaluating language identification. We present the method to construct this dataset, as well as empirical results over existing datasets and off-theshelf language identifiers. We also test techniques that have been proposed in the literature to boost language identification performance over Twitter messages. We find that simple voting over three specific systems consistently outperforms any specific system, and achieves state-of-the-art accuracy on the task.
international conference on computational linguistics | 2014
Marco Lui; Ned Letcher; Oliver Adams; Long Duong; Paul Cook; Timothy Baldwin
The Discriminating between Similar Languages (DSL) shared task at VarDial challenged participants to build an automatic language identification system to discriminate between 13 languages in 6 groups of highly-similar languages (or national varieties of the same language). In this paper, we describe the submissions made by team UniMelb-NLP, which took part in both the closed and open categories. We present the text representations and modeling techniques used, including cross-lingual POS tagging as well as fine-grained tags extracted from a deep grammar of English, and discuss additional data we collected for the open submissions, utilizing custombuilt web corpora based on top-level domains as well as existing corpora.
meeting of the association for computational linguistics | 2012
Marco Lui; Timothy Baldwin
international joint conference on natural language processing | 2013
Timothy Baldwin; Paul Cook; Marco Lui; Andrew MacKinlay; Li Wang
north american chapter of the association for computational linguistics | 2010
Timothy Baldwin; Marco Lui
international joint conference on natural language processing | 2011
Marco Lui; Timothy Baldwin
Transactions of the Association for Computational Linguistics | 2014
Marco Lui; Jey Han Lau; Timothy Baldwin
empirical methods in natural language processing | 2011
Li Wang; Marco Lui; Su Nam Kim; Joakim Nivre; Timothy Baldwin
Proceedings of the Australasian Language Technology Association Workshop 2013 (ALTA 2013) | 2013
Marco Lui; Paul Cook
north american chapter of the association for computational linguistics | 2010
Timothy Baldwin; David Martinez; Richard B. Penman; Su Nam Kim; Marco Lui; Li Wang; Andrew MacKinlay