Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Dina Sukhobok is active.

Publication


Featured researches published by Dina Sukhobok.


Sprachwissenschaft | 2017

DataGraft: One-stop-shop for open data management

Dumitru Roman; Nikolay Nikolov; Antoine Pultier; Dina Sukhobok; Brian Elvesæter; Arne J. Berre; Xianglin Ye; Marin Dimitrov; Alex Simov; Momchill Zarev; Rick Moynihan; Bill Roberts; Ivan Berlocher; Seon-Ho Kim; Tony Lee; Amanda Smith; Tom Heath

This paper introduces DataGraft (https://datagraft.net/) – a cloud-based platform for data transformation and publishing. DataGraft was developed to provide better and easier to use tools for data workers and developers (e.g. open data publishers, linked data developers, data scientists) who consider existing approaches to data transformation, hosting, and access too costly and technically complex. DataGraft offers an integrated, flexible, and reliable cloud-based solution for hosted open data management. Key features include flexible management of data transformations (e.g. interactive creation, execution, sharing, reuse) and reliable data hosting services. This paper provides an overview of DataGraft focusing on the rationale, key features and components, and evaluation.


international semantic web conference | 2016

DataGraft: Simplifying Open Data Publishing

Dumitru Roman; Marin Dimitrov; Nikolay Nikolov; Antoine Putlier; Dina Sukhobok; Brian Elvesæter; Arne J. Berre; Xianglin Ye; Alex Simov; Yavor Petkov

In this demonstrator we introduce DataGraft – a platform for Open Data management. DataGraft provides data transformation, publishing and hosting capabilities that aim to simplify the data publishing lifecycle for data workers (i.e., Open Data publishers, Linked Data developers, data scientists). This demonstrator highlights the key features of DataGraft by exemplifying a data transformation and publishing use case with property-related data.


international semantic web conference | 2016

Tabular Data Cleaning and Linked Data Generation with Grafterizer

Dina Sukhobok; Nikolay Nikolov; Antoine Pultier; Xianglin Ye; Arne J. Berre; Rick Moynihan; Bill Roberts; Brian Elvesæter; Nivethika Mahasivam; Dumitru Roman

Over the past several years the amount of published open data has increased significantly. The majority of this is tabular data, that requires powerful and flexible approaches for data cleaning and preparation in order to convert it into Linked Data. This paper introduces Grafterizer – a software framework developed to support data workers and data developers in the process of converting raw tabular data into linked data. Its main components include Grafter, a powerful software library and DSL for data cleaning and RDF-ization, and Grafterizer, a user interface for interactive specification of data transformations along with a back-end for management and execution of data transformations. The proposed demonstration will focus on Grafterizer’s powerful features for data cleaning and RDF-ization in a scenario using data about the risk of failure of transport infrastructure components due to natural hazards.


OTM Confederated International Conferences "On the Move to Meaningful Internet Systems" | 2017

Norwegian State of Estate Report as Linked Open Data

Ling Shi; Dina Sukhobok; Nikolay Nikolov; Dumitru Roman

This paper presents the Norwegian State of Estate (SoE) dataset containing data about real estates owned by the central government in Norway. The dataset is produced by integrating cross-domain government datasets including data from sources such as the Norwegian business entity register, cadastral system, building accessibility register and the previous SoE report. The dataset is made available as Linked Data. The Linked Data generation process includes data acquisition, cleaning, transformation, annotation, publishing, augmentation and interlinking the annotated data as well as quality assessment of the interlinked datasets. The dataset is published under the Norwegian License for Open Government Data (NLOD) and serves as a reference point for applications using data on central government real estates, such as generation of the SoE report, searching properties suitable for asylum reception centres, risk assessment for state-owned buildings or a public building application for visitors.


european conference on service-oriented and cloud computing | 2017

Data Preparation as a Service Based on Apache Spark

Nivethika Mahasivam; Nikolay Nikolov; Dina Sukhobok; Dumitru Roman

Data preparation is the process of collecting, cleaning and consolidating raw datasets into cleaned data of certain quality. It is an important aspect in almost every data analysis process, and yet it remains tedious and time-consuming. The complexity of the process is further increased by the recent tendency to derive knowledge from very large datasets. Existing data preparation tools provide limited capabilities to effectively process such large volumes of data. On the other hand, frameworks and software libraries that do address the requirements of big data, require expert knowledge in various technical areas. In this paper, we propose a dynamic, service-based, scalable data preparation approach that aims to solve the challenges in data preparation on a large scale, while retaining the accessibility and flexibility provided by data preparation tools. Furthermore, we describe its implementation and integration with an existing framework for data preparation – Grafterizer. Our solution is based on Apache Spark, and exposes application programming interfaces (APIs) to integrate with external tools. Finally, we present experimental results that demonstrate the improvements to the scalability of Grafterizer.


OTM Confederated International Conferences "On the Move to Meaningful Internet Systems" | 2017

The InfraRisk Ontology: Enabling Semantic Interoperability for Critical Infrastructures at Risk from Natural Hazards

Dumitru Roman; Dina Sukhobok; Nikolay Nikolov; Brian Elvesæter; Antoine Pultier

Earthquakes, landslides, and other natural hazard events have severe negative socio-economic impacts. Among other consequences, those events can cause damage to infrastructure networks such as roads and railways. Novel methodologies and tools are needed to analyse the potential impacts of extreme natural hazard events and aid in the decision-making process regarding the protection of existing critical road and rail infrastructure as well as the development of new infrastructure. Enabling uniform, integrated, and reliable access to data on historical failures of critical transport infrastructure can help infrastructure managers and scientist from various related areas to better understand, prevent, and mitigate the impact of natural hazards on critical infrastructures. This paper describes the construction of the InfraRisk ontology for representing relevant information about natural hazard events and their impact on infrastructure components. Furthermore, we present a software prototype that visualizes data published using the proposed ontology.


international conference on big data | 2017

Tabular Data Anomaly Patterns

Dina Sukhobok; Nikolay Nikolov; Dumitru Roman


international semantic web conference | 2017

Linked Data for the Norwegian State of Estate Reporting Service.

Ling Shi; Bjørg Elsa Pettersen; Dina Sukhobok; Nikolay Nikolov; Dumitru Roman


international semantic web conference | 2017

Publishing Socio-Economic Territory Indices as Linked Data and their Visualization for Real Estate Valuation.

Dina Sukhobok; Divna Djordjevic; Diego Sanvito; Javier Paniagua; Dumitru Roman


international semantic web conference | 2017

Interacting with subterranean infrastructure linked data using augmented reality

Dina Sukhobok; Nikolay Nikolov; Till Christopher Lech; Arnt-Henning Moberg; Roar Frantsvaag; Helene Risti Bergaas; Dumitru Roman

Collaboration


Dive into the Dina Sukhobok's collaboration.

Researchain Logo
Decentralizing Knowledge