Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Julien Masanès is active.

Publication


Featured researches published by Julien Masanès.


Future Internet | 2014

ARCOMEM Crawling Architecture

Vassilis Plachouras; Florent Carpentier; Muhammad Faheem; Julien Masanès; Thomas Risse; Pierre Senellart; Patrick Siehndel; Yannis Stavrakas

The World Wide Web is the largest information repository available today. However, this information is very volatile and Web archiving is essential to preserve it for the future. Existing approaches to Web archiving are based on simple definitions of the scope of Web pages to crawl and are limited to basic interactions with Web servers. The aim of the ARCOMEM project is to overcome these limitations and to provide flexible, adaptive and intelligent content acquisition, relying on social media to create topical Web archives. In this article, we focus on ARCOMEM’s crawling architecture. We introduce the overall architecture and we describe its modules, such as the online analysis module, which computes a priority for the Web pages to be crawled, and the Application-Aware Helper which takes into account the type of Web sites and applications to extract structure from crawled content. We also describe a large-scale distributed crawler that has been developed, as well as the modifications we have implemented to adapt Heritrix, an open source crawler, to the needs of the project. Our experimental results from real crawls show that ARCOMEM’s crawling architecture is effective in acquiring focused information about a topic and leveraging the information from social media.


WWW '18 Companion Proceedings of the The Web Conference 2018 | 2018

TempWeb 2018 Chairs' Welcome and Organization

Marc Spaniol; Ricardo A. Baeza-Yates; Julien Masanès

Time is a key dimension to understand the Web. It is fair to say that it has not received yet all the attention it deserves and TempWeb is an attempt to help remedy this situation by putting time as the center of its reflection. Studying time in this context actually covers a large spectrum, from the extraction of temporal information and knowledge, to diachronic studies for the design of infrastructural and experimental settings enabling a proper observation of this dimension.


Cultural heritage on line | 2010

Turning Pure Web Page Storages into Living Web Archives

Thomas Risse; Marc Spaniol; András A. Benczúr; Julien Masanès

Web content plays an increasingly important role in the knowledge-based society, and the preservation and long-term accessibility of Web history has high value (e.g., for scholarly studies, market analyses, intellectual property disputes, etc.). There is strongly growing interest in its preservation by libraries and archival organizations as well as emerging industrial services. Web content characteristics (high dynamics, volatility, contributor and format variety) make adequate Web archiving a challenge. LiWA will look beyond the pure “freezing” of Web content snapshots for a long time, transforming pure snapshot storage into a “Living” Web Archive. In order to create Living Web Archives, the LiWA project will address R&D challenges in the three areas: Archive Fidelity, Archive coherence and Archive interpretability. The results of the project will be demonstrated within two application scenarios namely “Streaming Archive” and “Social Web Archive”. The Streaming Archive application will showcase the building of an audio-visual Web archive and how audio and video broadcast related web information can be preserved. The Social Web application will demonstrate how web archives can capture the dynamics and the different types of user interaction of the social web.


TempWeb '12 2nd Temporal Web Analytics Workshop | 2012

Proceedings of the 2nd Temporal Web Analytics Workshop

Ricardo A. Baeza-Yates; Julien Masanès; Marc Spaniol


TempWeb'17: The 7th Temporal Web Analytics Workshop | 2017

TempWeb 2017 Chairs’ Welcome & Organization

Marc Spaniol; Ricardo A. Baeza-Yates; Julien Masanès


Archive | 2016

TempWeb 2016 Chairs’ Welcome Message

Marc Spaniol; Ricardo A. Baeza-Yates; Julien Masanès


WWW (Companion Volume) | 2013

Preface for the TempWeb '13 Workshop.

Ricardo A. Baeza-Yates; Julien Masanès; Marc Spaniol


TempWeb'17: The 7th Temporal Web Analytics Workshop | 2013

Proceedings of the 3rd International Temporal Web Analytics Workshop (TempWeb 2013)

Marc Spaniol; Ricardo A. Baeza-Yates; Julien Masanès


Archive | 2012

The 3rd Temporal Web Analytics Workshop (TempWeb)

Ricardo A. Baeza-Yates; Julien Masanès; Marc Spaniol


1st International Temporal Web Analytics Workshop | 2011

Proceedings of the 1st International Temporal Web Analytics Workshop (TWAW 2011)

Ricardo A. Baeza-Yates; Julien Masanès; Marc Spaniol

Collaboration


Dive into the Julien Masanès's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Andreas Rauber

Vienna University of Technology

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Vassilis Plachouras

Association for Computing Machinery

View shared research outputs
Top Co-Authors

Avatar

Pierre Senellart

National University of Singapore

View shared research outputs
Top Co-Authors

Avatar

Yannis Stavrakas

National Technical University of Athens

View shared research outputs
Researchain Logo
Decentralizing Knowledge