Marta Arias
Polytechnic University of Catalonia
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Marta Arias.
ACM Transactions on Intelligent Systems and Technology | 2013
Marta Arias; Argimiro Arratia; Ramon Xuriguera
The dramatic rise in the use of social network platforms such as Facebook or Twitter has resulted in the availability of vast and growing user-contributed repositories of data. Exploiting this data by extracting useful information from it has become a great challenge in data mining and knowledge discovery. A recently popular way of extracting useful information from social network platforms is to build indicators, often in the form of a time series, of general public mood by means of sentiment analysis. Such indicators have been shown to correlate with a diverse variety of phenomena. In this article we follow this line of work and set out to assess, in a rigorous manner, whether a public sentiment indicator extracted from daily Twitter messages can indeed improve the forecasting of social, economic, or commercial indicators. To this end we have collected and processed a large amount of Twitter posts from March 2011 to the present date for two very different domains: stock market and movie box office revenue. For each of these domains, we build and evaluate forecasting models for several target time series both using and ignoring the Twitter-related data. If Twitter does help, then this should be reflected in the fact that the predictions of models that use Twitter-related data are better than the models that do not use this data. By systematically varying the models that we use and their parameters, together with other tuning factors such as lag or the way in which we build our Twitter sentiment index, we obtain a large dataset that allows us to test our hypothesis under different experimental conditions. Using a novel decision-tree-based technique that we call summary tree we are able to mine this large dataset and obtain automatically those configurations that lead to an improvement in the prediction power of our forecasting models. As a general result, we have seen that nonlinear models do take advantage of Twitter data when forecasting trends in volatility indices, while linear ones fail systematically when forecasting any kind of financial time series. In the case of predicting box office revenue trend, it is support vector machines that make best use of Twitter data. In addition, we conduct statistical tests to determine the relation between our Twitter time series and the different target time series.
acm symposium on parallel algorithms and architectures | 2003
Marta Arias; Lenore J. Cowen; Kofi A. Laing; Rajmohan Rajaraman; Orjeta Taka
This paper is concerned with compact routing in the name independent model first introduced by Awerbuch et al. [1] for adaptive routing in dynamic networks. A compact routing scheme that uses local routing tables of size <i>Õ(n<sup>1/2</sup>)</i>, <i>O(log<sup>2</sup> n)</i>-sized packet headers, and stretch bounded by 5 is obtained. Alternative schemes reduce the packet header size to <i>O(log n)</i> at cost of either increasing the stretch to 7, or increasing the table size to <i>Õ(n<sup>2/3</sup>)</i>. For smaller table-size requirements, the ideas in these schemes are generalized to a scheme that uses <i>O(log<sup>2</sup> n)</i>-sized headers, <i>Õ(k<sup>2</sup>n<sup>2/k</sup>)</i>-sized tables, and achieves a stretch of <i>min[1 + (k-1)(2<sup>k/2</sup>-2), 16k<sup>2</sup>+4k ]</i>, improving the best previously-known name-independent scheme due to Awerbuch and Peleg [3].
knowledge discovery and data mining | 2007
Hila Becker; Marta Arias
In many practical applications, one is interested in generating a ranked list of items using information mined from continuous streams of data. For example, in the context of computer networks, one might want to generate lists of nodes ranked according to their susceptibility to attack. In addition, real-world data streams often exhibit concept drift, making the learning task even more challenging. We present an online learning approach to ranking with concept drift, using weighted majority techniques. By continuously modeling different snapshots of the data and tuning our measure of belief in these models over time, we capture changes in the underlying concept and adapt our predictions accordingly. We measure the performance of our algorithm on real electricity data as well as asynthetic data stream, and demonstrate that our approach to ranking from stream data outperforms previously known batch-learning methods and other online methods that do not account for concept drift.
Machine Learning | 2011
Marta Arias; José L. Balcázar
We describe an alternative construction of an existing canonical representation for definite Horn theories, the Guigues-Duquenne basis (or GD basis), which minimizes a natural notion of implicational size. We extend the canonical representation to general Horn, by providing a reduction from definite to general Horn CNF. Using these tools, we provide a new, simpler validation of the classic Horn query learning algorithm of Angluin, Frazier, and Pitt, and we prove that this algorithm always outputs the GD basis regardless of the counterexamples it receives.
SIAM Journal on Discrete Mathematics | 2006
Marta Arias; Lenore J. Cowen; Kofi A. Laing; Rajmohan Rajaraman; Orjeta Taka
This paper is concerned with compact routing schemes for arbitrary undirected networks in the name-independent model first introduced by Awerbuch, Bar-Noy, Linial, and Peleg. A compact routing scheme that uses local routing tables of size
Information & Computation | 2002
Marta Arias; Roni Khardon
\~{O}(n^{1/2})
software engineering and knowledge engineering | 2007
Christian Murphy; Gail E. Kaiser; Marta Arias
,
algorithmic learning theory | 2009
Marta Arias; José L. Balcázar
O(\log^2 n)
Information & Computation | 2006
Marta Arias; Aaron Feigelson; Roni Khardon; Rocco A. Servedio
-sized packet headers, and stretch bounded by 5 is obtained, where
Annals of the New York Academy of Sciences | 2007
Anshul Kundaje; Steve Lianoglou; Xuejing Li; David Quigley; Marta Arias; Chris H. Wiggins; Li Zhang; Christina S. Leslie
n