Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Matthew Shardlow is active.

Publication


Featured researches published by Matthew Shardlow.


Journal of Biomedical Informatics | 2016

Analysis of the effect of sentiment analysis on extracting adverse drug reactions from tweets and forum posts

Ioannis Korkontzelos; Azadeh Nikfarjam; Matthew Shardlow; Abeed Sarker; Sophia Ananiadou; Graciela Gonzalez

Graphical abstract


Database | 2016

Text mining resources for the life sciences

Piotr Przybyła; Matthew Shardlow; Sophie Aubin; Robert Bossy; Richard Eckart de Castilho; Stelios Piperidis; John McNaught; Sophia Ananiadou

Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable—those that have the crucial ability to share information, enabling smooth integration and reusability.


north american chapter of the association for computational linguistics | 2016

NaCTeM at SemEval-2016 Task 1: Inferring sentence-level semantic similarity from an ensemble of complementary lexical and sentence-level features.

Piotr Przybyła; Nhung T. H. Nguyen; Matthew Shardlow; Georgios Kontonatsios; Sophia Ananiadou

We present a description of the system submitted to the Semantic Textual Similarity (STS) shared task at SemEval 2016. The task is to assess the degree to which two sentences carry the same meaning. We have designed two different methods to automatically compute a similarity score between sentences. The first method combines a variety of semantic similarity measures as features in a machine learning model. In our second approach, we employ training data from the Interpretable Similarity subtask to create a combined wordsimilarity measure and assess the importance of both aligned and unaligned words. Finally, we combine the two methods into a single hybrid model. Our best-performing run attains a score of 0.7732 on the 2015 STS evaluation data and 0.7488 on the 2016 STS evaluation data.


BMC Medical Informatics and Decision Making | 2018

Identification of research hypotheses and new knowledge from scientific literature

Matthew Shardlow; Riza Theresa Batista-Navarro; Paul Thompson; Raheel Nawaz; John McNaught; Sophia Ananiadou

BackgroundText mining (TM) methods have been used extensively to extract relations and events from the literature. In addition, TM techniques have been used to extract various types or dimensions of interpretative information, known as Meta-Knowledge (MK), from the context of relations and events, e.g. negation, speculation, certainty and knowledge type. However, most existing methods have focussed on the extraction of individual dimensions of MK, without investigating how they can be combined to obtain even richer contextual information. In this paper, we describe a novel, supervised method to extract new MK dimensions that encode Research Hypotheses (an author’s intended knowledge gain) and New Knowledge (an author’s findings). The method incorporates various features, including a combination of simple MK dimensions.MethodsWe identify previously explored dimensions and then use a random forest to combine these with linguistic features into a classification model. To facilitate evaluation of the model, we have enriched two existing corpora annotated with relations and events, i.e., a subset of the GENIA-MK corpus and the EU-ADR corpus, by adding attributes to encode whether each relation or event corresponds to Research Hypothesis or New Knowledge. In the GENIA-MK corpus, these new attributes complement simpler MK dimensions that had previously been annotated.ResultsWe show that our approach is able to assign different types of MK dimensions to relations and events with a high degree of accuracy. Firstly, our method is able to improve upon the previously reported state of the art performance for an existing dimension, i.e., Knowledge Type. Secondly, we also demonstrate high F1-score in predicting the new dimensions of Research Hypothesis (GENIA: 0.914, EU-ADR 0.802) and New Knowledge (GENIA: 0.829, EU-ADR 0.836).ConclusionWe have presented a novel approach for predicting New Knowledge and Research Hypothesis, which combines simple MK dimensions to achieve high F1-scores. The extraction of such information is valuable for a number of practical TM applications.


International Journal of Advanced Computer Science and Applications | 2014

A Survey of Automated Text Simplification

Matthew Shardlow


meeting of the association for computational linguistics | 2013

A Comparison of Techniques to Automatically Identify Complex Words.

Matthew Shardlow


language resources and evaluation | 2014

Out in the Open: Finding and Categorising Errors in the Lexical Simplification Pipeline

Matthew Shardlow


meeting of the association for computational linguistics | 2013

The CW Corpus: A New Resource for Evaluating the Identification of Complex Words

Matthew Shardlow


north american chapter of the association for computational linguistics | 2018

Manchester Metropolitan at SemEval-2018 Task 2: Random Forest with an Ensemble of Features for Predicting Emoji in Tweets

Luciano Gerber; Matthew Shardlow


language resources and evaluation | 2018

A New Corpus to Support Text Mining for the Curation of Metabolites in the ChEBI Database

Matthew Shardlow; Nhung T. H. Nguyen; Gareth Owen; Claire O'Donovan; Andrew R. Leach; John McNaught; Steve Turner; Sophia Ananiadou

Collaboration


Dive into the Matthew Shardlow's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

John McNaught

University of Manchester

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Andrew R. Leach

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Claire O'Donovan

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar

Gareth Owen

European Bioinformatics Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge