Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Anne E. Thessen is active.

Publication


Featured researches published by Anne E. Thessen.


PLOS Biology | 2015

Finding Our Way through Phenotypes

Andrew R. Deans; Suzanna E. Lewis; Eva Huala; Salvatore S. Anzaldo; Michael Ashburner; James P. Balhoff; David C. Blackburn; Judith A. Blake; J. Gordon Burleigh; Bruno Chanet; Laurel Cooper; Mélanie Courtot; Sándor Csösz; Hong Cui; Wasila M. Dahdul; Sandip Das; T. Alexander Dececchi; Agnes Dettai; Rui Diogo; Robert E. Druzinsky; Michel Dumontier; Nico M. Franz; Frank Friedrich; George V. Gkoutos; Melissa Haendel; Luke J. Harmon; Terry F. Hayamizu; Yongqun He; Heather M. Hines; Nizar Ibrahim

Imagine if we could compute across phenotype data as easily as genomic data; this article calls for efforts to realize this vision and discusses the potential benefits.


ZooKeys | 2011

Data issues in the life sciences

Anne E. Thessen; David J. Patterson

Abstract We review technical and sociological issues facing the Life Sciences as they transform into more data-centric disciplines - the “Big New Biology”. Three major challenges are: 1) lack of comprehensive standards; 2) lack of incentives for individual scientists to share data; 3) lack of appropriate infrastructure and support. Technological advances with standards, bandwidth, distributed computing, exemplar successes, and a strong presence in the emerging world of Linked Open Data are sufficient to conclude that technical issues will be overcome in the foreseeable future. While motivated to have a shared open infrastructure and data pool, and pressured by funding agencies in move in this direction, the sociological issues determine progress. Major sociological issues include our lack of understanding of the heterogeneous data cultures within Life Sciences, and the impediments to progress include a lack of incentives to build appropriate infrastructures into projects and institutions or to encourage scientists to make data openly available.


Environmental Research Letters | 2015

The influence of droplet size and biodegradation on the transport of subsurface oil droplets during the Deepwater Horizon spill: a model sensitivity study

Elizabeth W. North; E. Eric Adams; Anne E. Thessen; Zachary Schlag; Ruoying He; Scott A. Socolofsky; Stephen M. Masutani; Scott D. Peckham

Abetter understanding of oil droplet formation, degradation, and dispersal in deepwaters is needed to enhance prediction of the fate and transport of subsurface oil spills. This research evaluates the influence of initial droplet size and rates of biodegradation on the subsurface transport of oil droplets, specifically those from theDeepwaterHorizon oil spill. A three-dimensional coupledmodel was employedwith components that included analyticalmultiphase plume, hydrodynamic and Lagrangianmodels. Oil droplet biodegradationwas simulated based onfirst order decay rates of alkanes. The initial diameter of droplets (10–300 μm) spanned a range of sizes expected fromdispersant-treated oil. Results indicate thatmodel predictions are sensitive to biodegradation processes, with depth distributions deepening by hundreds ofmeters, horizontal distributions decreasing by hundreds to thousands of kilometers, andmass decreasing by 92–99%when biodegradation is applied compared to simulationswithout biodegradation. In addition, there are twoto four-fold changes in the area of the seafloor contacted by oil droplets among scenarios with different biodegradation rates. The spatial distributions of hydrocarbons predicted by themodel with biodegradation are similar to those observed in the sediment andwater column, although themodel predicts hydrocarbons to the northeast and east of thewell where no observations weremade. This study indicates that improvement in knowledge of droplet sizes and biodegradation processes is important for accurate prediction of subsurface oil spills.


PLOS ONE | 2012

The taxonomic significance of species that have only been observed once: The genus Gymnodinium (dinoflagellata) as an example

Anne E. Thessen; David J. Patterson; Shauna A. Murray

Taxonomists have been tasked with cataloguing and quantifying the Earth’s biodiversity. Their progress is measured in code-compliant species descriptions that include text, images, type material and molecular sequences. It is from this material that other researchers are to identify individuals of the same species in future observations. It has been estimated that 13% to 22% (depending on taxonomic group) of described species have only ever been observed once. Species that have only been observed at the time and place of their original description are referred to as oncers. Oncers are important to our current understanding of biodiversity. They may be validly described species that are members of a rare biosphere, or they may indicate endemism, or that these species are limited to very constrained niches. Alternatively, they may reflect that taxonomic practices are too poor to allow the organism to be re-identified or that the descriptions are unknown to other researchers. If the latter are true, our current tally of species will not be an accurate indication of what we know. In order to investigate this phenomenon and its potential causes, we examined the microbial eukaryote genus Gymnodinium. This genus contains 268 extant species, 103 (38%) of which have not been observed since their original description. We report traits of the original descriptions and interpret them in respect to the status of the species. We conclude that the majority of oncers were poorly described and their identity is ambiguous. As a result, we argue that the genus Gymnodinium contains only 234 identifiable species. Species that have been observed multiple times tend to have longer descriptions, written in English. The styles of individual authors have a major effect, with a few authors describing a disproportionate number of oncers. The information about the taxonomy of Gymnodinium that is available via the internet is incomplete, and reliance on it will not give access to all necessary knowledge. Six new names are presented – Gymnodinium campbelli for the homonymous name Gymnodinium translucens Campbell 1973, Gymnodinium antarcticum for the homonymous name Gymnodinium frigidum Balech 1965, Gymnodinium manchuriensis for the homonymous name Gymnodinium autumnale Skvortzov 1968, Gymnodinium christenum for the homonymous name Gymnodinium irregulare Christen 1959, Gymnodinium conkufferi for the homonymous name Gymnodinium irregulare Conrad & Kufferath 1954 and Gymnodinium chinensis for the homonymous name Gymnodinium frigidum Skvortzov 1968.


Applied Ontology | 2014

Semantic Web and Big Data meets Applied Ontology: The Ontology Summit 2014

Leo Obrst; Michael Gruninger; Kenneth Baclawski; Mike Bennett; Dan Brickley; Gary Berg-Cross; Pascal Hitzler; Krzysztof Janowicz; Christine Kapp; Oliver Kutz; Christoph Lange; Anatoly Levenchuk; Francesca Quattri; Alan L. Rector; Todd Schneider; Simon Spero; Anne E. Thessen; Marcela Vegetti; Amanda Vizedom; Andrea Westerinen; Matthew West; Peter Yim

Leo Obrst a,∗, Michael Gruninger b, Ken Baclawski c, Mike Bennett d, Dan Brickley e, Gary Berg-Cross f, Pascal Hitzler g, Krzysztof Janowicz h, Christine Kapp i, Oliver Kutz j, Christoph Lange k, Anatoly Levenchuk l, Francesca Quattri m, Alan Rector n, Todd Schneider o, Simon Spero p, Anne Thessen q, Marcela Vegetti r, Amanda Vizedom s, Andrea Westerinen t, Matthew West u and Peter Yim v a The MITRE Corporation, McLean, VA, USA b University of Toronto, Toronto, Canada c Northeastern University, Boston, MA, USA d Hypercube Ltd., London, UK e Google, London, UK f Knowledge Strategies, Washington, DC, USA g Wright State University, Dayton, OH, USA h University of California, Santa Barbara, Santa Barbara, CA, USA i JustIntegration, Inc., Kissimmee, FL, USA j Otto von Guericke University Magdeburg, Magdeburg, Germany k University of Bonn, Bonn, Germany; Fraunhofer IAIS, Sankt Augustin, Germany l TechInvestLab.ru, Moscow, Russia m The Hong Kong Polytechnic University, Hong Kong n University of Manchester, Manchester, UK o PDS, Inc., Arvada, CO, USA p University of North Carolina, Chapel Hill, NC, USA q Arizona State University, Phoenix, AZ, USA r INGAR (CONICET/UTN), Santa Fe, Argentina s Criticollab, LLC, Durham, NC, USA t Nine Points Solutions, LLC, Potomac, MD, USA u Information Junction, Fareham, UK v CIM Engineering, Inc., San Mateo, CA, USA


Advances in Bioinformatics | 2012

Applications of Natural Language Processing in Biodiversity Science

Anne E. Thessen; Hong Cui; Dmitry Mozzherin

Centuries of biological knowledge are contained in the massive body of scientific literature, written for human-readability but too big for any one person to consume. Large-scale mining of information from the literature is necessary if biology is to transform into a data-driven science. A computer can handle the volume but cannot make sense of the language. This paper reviews and discusses the use of natural language processing (NLP) and machine-learning algorithms to extract information from systematic literature. NLP algorithms have been used for decades, but require special development for application in the biological realm due to the special nature of the language. Many tools exist for biological information extraction (cellular processes, taxonomic names, and morphological characters), but none have been applied life wide and most still require testing and development. Progress has been made in developing algorithms for automated annotation of taxonomic text, identification of taxonomic names in text, and extraction of morphological character information from taxonomic descriptions. This manuscript will briefly discuss the key steps in applying information extraction tools to enhance biodiversity science.


Standards in Genomic Sciences | 2010

Meeting Report: BioSharing at ISMB 2010

Dawn Field; Susanna Sansone; Edward F. DeLong; Peter Sterk; Iddo Friedberg; Pascale Gaudet; Susanna Lewis; Renzo Kottmann; Lynette Hirschman; George M Garrity; Guy Cochrane; John Wooley; Folker Meyer; Sarah Hunter; Owen White; Brian Bramlett; Susan K. Gregurick; Hilmar Lapp; Sandra Orchard; Philippe Rocca-Serra; Alan Ruttenberg; Nigam H. Shah; Chris F. Taylor; Anne E. Thessen

This report summarizes the proceedings of the one day BioSharing meeting held at the Intelligent Systems for Molecular Biology (ISMB) 2010 conference in Boston, MA, USA This inaugural BioSharing event was hosted by the Genomic Standards Consortium as part of its M3 & BioSharing special interest group (SIG) workshop. The BioSharing event included invited talks from a range of community leaders and a panel discussion at the end of the day. The panel session led to the formal agreement among community leaders to join together to promote cross-community knowledge exchange and collaborations. A key focus of the newly formed Biosharing community will be linking up resources to promote real-world data sharing (virtuous cycle of data) and supporting compliance with data policies through the creation of a one-stop-portal of information. Further information about the newly established BioSharing effort can be found at http://biosharing.org.


Biodiversity Data Journal | 2016

Challenges with using names to link digital biodiversity information

David J. Patterson; Dmitry Mozzherin; David P. Shorthouse; Anne E. Thessen

The need for a names-based cyber-infrastructure for digital biology is based on the argument that scientific names serve as a standardized metadata system that has been used consistently and near universally for 250 years. As we move towards data-centric biology, name-strings can be called on to discover, index, manage, and analyze accessible digital biodiversity information from multiple sources. Known impediments to the use of scientific names as metadata include synonyms, homonyms, mis-spellings, and the use of other strings as identifiers. We here compare the name-strings in GenBank, Catalogue of Life (CoL), and the Dryad Digital Repository (DRYAD) to assess the effectiveness of the current names-management toolkit developed by Global Names to achieve interoperability among distributed data sources. New tools that have been used here include Parser (to break name-strings into component parts and to promote the use of canonical versions of the names), a modified TaxaMatch fuzzy-matcher (to help manage typographical, transliteration, and OCR errors), and Cross-Mapper (to make comparisons among data sets). The data sources include scientific names at multiple ranks; vernacular (common) names; acronyms; strain identifiers and other surrogates including idiosyncratic abbreviations and concatenations. About 40% of the name-strings in GenBank are scientific names representing about 400,000 species or infraspecies and their synonyms.


PLOS ONE | 2014

Knowledge extraction and semantic annotation of text from the encyclopedia of life.

Anne E. Thessen; Cynthia Sims Parr

Numerous digitization and ontological initiatives have focused on translating biological knowledge from narrative text to machine-readable formats. In this paper, we describe two workflows for knowledge extraction and semantic annotation of text data objects featured in an online biodiversity aggregator, the Encyclopedia of Life. One workflow tags text with DBpedia URIs based on keywords. Another workflow finds taxon names in text using GNRD for the purpose of building a species association network. Both workflows work well: the annotation workflow has an F1 Score of 0.941 and the association algorithm has an F1 Score of 0.885. Existing text annotators such as Terminizer and DBpedia Spotlight performed well, but require some optimization to be useful in the ecology and evolution domain. Important future work includes scaling up and improving accuracy through the use of distributional semantics.


PeerJ | 2015

Emerging semantics to link phenotype and environment

Anne E. Thessen; Daniel E. Bunker; Pier Luigi Buttigieg; Laurel Cooper; Wasila M. Dahdul; Sami Domisch; Nico M. Franz; Pankaj Jaiswal; Carolyn J. Lawrence-Dill; Peter E. Midford; Christopher J. Mungall; Martín J. Ramírez; Chelsea D. Specht; Lars Vogt; Rutger A. Vos; Ramona L. Walls; Jeffrey W. White; Guanyang Zhang; Andrew R. Deans; Eva Huala; Suzanna E. Lewis; Paula M. Mabee

Understanding the interplay between environmental conditions and phenotypes is a fundamental goal of biology. Unfortunately, data that include observations on phenotype and environment are highly heterogeneous and thus difficult to find and integrate. One approach that is likely to improve the status quo involves the use of ontologies to standardize and link data about phenotypes and environments. Specifying and linking data through ontologies will allow researchers to increase the scope and flexibility of large-scale analyses aided by modern computing methods. Investments in this area would advance diverse fields such as ecology, phylogenetics, and conservation biology. While several biological ontologies are well-developed, using them to link phenotypes and environments is rare because of gaps in ontological coverage and limits to interoperability among ontologies and disciplines. In this manuscript, we present (1) use cases from diverse disciplines to illustrate questions that could be answered more efficiently using a robust linkage between phenotypes and environments, (2) two proof-of-concept analyses that show the value of linking phenotypes to environments in fishes and amphibians, and (3) two proposed example data models for linking phenotypes and environments using the extensible observation ontology (OBOE) and the Biological Collections Ontology (BCO); these provide a starting point for the development of a data model linking phenotypes and environments.

Collaboration


Dive into the Anne E. Thessen's collaboration.

Top Co-Authors

Avatar

David J. Patterson

Marine Biological Laboratory

View shared research outputs
Top Co-Authors

Avatar

Elizabeth W. North

University of Maryland Center for Environmental Science

View shared research outputs
Top Co-Authors

Avatar

Diane K. Stoecker

University of Maryland Center for Environmental Science

View shared research outputs
Top Co-Authors

Avatar

Eva Huala

Carnegie Institution for Science

View shared research outputs
Top Co-Authors

Avatar

Hong Cui

University of Arizona

View shared research outputs
Top Co-Authors

Avatar

Wasila M. Dahdul

University of South Dakota

View shared research outputs
Top Co-Authors

Avatar

Alex K. Lancaster

Massachusetts Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Andrew R. Deans

Pennsylvania State University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Dmitry Mozzherin

Marine Biological Laboratory

View shared research outputs
Researchain Logo
Decentralizing Knowledge