Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Terry Catapano is active.

Publication


Featured researches published by Terry Catapano.


ZooKeys | 2015

Community next steps for making globally unique identifiers work for biocollections data

Robert P. Guralnick; Nico Cellinese; John Deck; Richard L. Pyle; John Kunze; Lyubomir Penev; Ramona L. Walls; Gregor Hagedorn; Donat Agosti; John Wieczorek; Terry Catapano; Roderic D. M. Page

Abstract Biodiversity data is being digitized and made available online at a rapidly increasing rate but current practices typically do not preserve linkages between these data, which impedes interoperation, provenance tracking, and assembly of larger datasets. For data associated with biocollections, the biodiversity community has long recognized that an essential part of establishing and preserving linkages is to apply globally unique identifiers at the point when data are generated in the field and to persist these identifiers downstream, but this is seldom implemented in practice. There has neither been coalescence towards one single identifier solution (as in some other domains), nor even a set of recommended best practices and standards to support multiple identifier schemes sharing consistent responses. In order to further progress towards a broader community consensus, a group of biocollections and informatics experts assembled in Stockholm in October 2014 to discuss community next steps to overcome current roadblocks. The workshop participants divided into four groups focusing on: identifier practice in current field biocollections; identifier application for legacy biocollections; identifiers as applied to biodiversity data records as they are published and made available in semantically marked-up publications; and cross-cutting identifier solutions that bridge across these domains. The main outcome was consensus on key issues, including recognition of differences between legacy and new biocollections processes, the need for identifier metadata profiles that can report information on identifier persistence missions, and the unambiguous indication of the type of object associated with the identifier. Current identifier characteristics are also summarized, and an overview of available schemes and practices is provided.


ZooKeys | 2011

XML schemas and mark-up practices of taxonomic literature

Lyubomir Penev; Christopher H. C. Lyal; Anna L. Weitzman; David R. Morse; Guido Sautter; Teodor Georgiev; Robert A. Morris; Terry Catapano; Donat Agosti

Abstract We review the three most widely used XML schemas used to mark-up taxonomic texts, TaxonX, TaxPub and taXMLit. These are described from the viewpoint of their development history, current status, implementation, and use cases. The concept of “taxon treatment” from the viewpoint of taxonomy mark-up into XML is discussed. TaxonX and taXMLit are primarily designed for legacy literature, the former being more lightweight and with a focus on recovery of taxon treatments, the latter providing a much more detailed set of tags to facilitate data extraction and analysis. TaxPub is an extension of the National Library of Medicine Document Type Definition (NLM DTD) for taxonomy focussed on layout and recovery and, as such, is best suited for mark-up of new publications and their archiving in PubMedCentral. All three schemas have their advantages and shortcomings and can be used for different purposes.


Biodiversity Data Journal | 2015

Integrating and visualizing primary data from prospective and legacy taxonomic literature

Jeremy Miller; Donat Agosti; Lyubomir Penev; Guido Sautter; Teodor Georgiev; Terry Catapano; David J. Patterson; Serrano Pereira; Rutger A. Vos; Soraya Sierra

Abstract Specimen data in taxonomic literature are among the highest quality primary biodiversity data. Innovative cybertaxonomic journals are using workflows that maintain data structure and disseminate electronic content to aggregators and other users; such structure is lost in traditional taxonomic publishing. Legacy taxonomic literature is a vast repository of knowledge about biodiversity. Currently, access to that resource is cumbersome, especially for non-specialist data consumers. Markup is a mechanism that makes this content more accessible, and is especially suited to machine analysis. Fine-grained XML (Extensible Markup Language) markup was applied to all (37) open-access articles published in the journal Zootaxa containing treatments on spiders (Order: Araneae). The markup approach was optimized to extract primary specimen data from legacy publications. These data were combined with data from articles containing treatments on spiders published in Biodiversity Data Journal where XML structure is part of the routine publication process. A series of charts was developed to visualize the content of specimen data in XML-tagged taxonomic treatments, either singly or in aggregate. The data can be filtered by several fields (including journal, taxon, institutional collection, collecting country, collector, author, article and treatment) to query particular aspects of the data. We demonstrate here that XML markup using GoldenGATE can address the challenge presented by unstructured legacy data, can extract structured primary biodiversity data which can be aggregated with and jointly queried with data from other Darwin Core-compatible sources, and show how visualization of these data can communicate key information contained in biodiversity literature. We complement recent studies on aspects of biodiversity knowledge using XML structured data to explore 1) the time lag between species discovry and description, and 2) the prevelence of rarity in species descriptions.


international conference on contemporary computing | 2009

A New Approach towards Bibliographic Reference Identification, Parsing and Inline Citation Matching

Deepank Gupta; Bob Morris; Terry Catapano; Guido Sautter

A number of algorithms and approaches have been proposed towards the problem of scanning and digitizing research papers. We can classify work done in the past into three major approaches: regular expression based heuristics, learning based algorithm and knowledge based systems. Our findings point to the inadequacy of existing open-source solutions such as Paracite for papers with “micro-citations” in various European Languages. This paper describes the work done as part of the Google Summer of Code 2008 using a combination of regular-expression based heuristics and knowledge-based systems to develop a system which matches inline citations to their corresponding bibliographic references and identifies and extracts metadata from references. The description, implementation and results of our approach have been presented here. Our approach enhances the accuracy and provides better recognition rates.


Journal of Biomedical Semantics | 2018

OpenBiodiv-O: ontology of the OpenBiodiv knowledge management system

Viktor Senderov; Kiril Simov; Nico M. Franz; Pavel Stoev; Terry Catapano; Donat Agosti; Guido Sautter; Robert A. Morris; Lyubomir Penev

BackgroundThe biodiversity domain, and in particular biological taxonomy, is moving in the direction of semantization of its research outputs. The present work introduces OpenBiodiv-O, the ontology that serves as the basis of the OpenBiodiv Knowledge Management System. Our intent is to provide an ontology that fills the gaps between ontologies for biodiversity resources, such as DarwinCore-based ontologies, and semantic publishing ontologies, such as the SPAR Ontologies. We bridge this gap by providing an ontology focusing on biological taxonomy.ResultsOpenBiodiv-O introduces classes, properties, and axioms in the domains of scholarly biodiversity publishing and biological taxonomy and aligns them with several important domain ontologies (FaBiO, DoCO, DwC, Darwin-SW, NOMEN, ENVO). By doing so, it bridges the ontological gap across scholarly biodiversity publishing and biological taxonomy and allows for the creation of a Linked Open Dataset (LOD) of biodiversity information (a biodiversity knowledge graph) and enables the creation of the OpenBiodiv Knowledge Management System.A key feature of the ontology is that it is an ontology of the scientific process of biological taxonomy and not of any particular state of knowledge. This feature allows it to express a multiplicity of scientific opinions. The resulting OpenBiodiv knowledge system may gain a high level of trust in the scientific community as it does not force a scientific opinion on its users (e.g. practicing taxonomists, library researchers, etc.), but rather provides the tools for experts to encode different views as science progresses.ConclusionsOpenBiodiv-O provides a conceptual model of the structure of a biodiversity publication and the development of related taxonomic concepts. It also serves as the basis for the OpenBiodiv Knowledge Management System.


language resources and evaluation | 1999

TEI and the Encoding of the Physical Structure of Books

Syd Bauman; Terry Catapano

The TEI Guidelines provide little detail on how to encode a text within the physical structures of the book in which it is contained. This paper describes the physical structures of an early printed book and presents two methods for encoding a text within that structure through use of the TEI elements and .


BMC Biology | 2012

From taxonomic literature to cybertaxonomic content

Jeremy Miller; Torsten Dikow; Donat Agosti; Guido Sautter; Terry Catapano; Lyubomir Penev; Zhi-Qiang Zhang; Dean Pentcheff; Richard L. Pyle; Stan Blum; Cynthia Sims Parr; Chris Freeland; Tom Garnett; Linda Ford; Burgert Muller; Leo Smith; Ginger Strader; Teodor Georgiev; Laurence Bénichou


Bulletin of the Association for Information Science and Technology | 2011

Organizing our knowledge of biodiversity

Hilmar Lapp; Robert A. Morris; Terry Catapano; Donald Hobern; Norman Morrison


Computers and The Humanities | 1999

TEI and the Encoding of the Physical Structure of Books.

Syd Bauman; Terry Catapano


Biodiversity Information Science and Standards | 2018

The Open Biodiversity Knowledge Management (eco-)System: Tools and Services for Extraction, Mobilization, Handling and Re-use of Data from the Published Literature

Lyubomir Penev; Donat Agosti; Teodor Georgiev; Viktor Senderov; Guido Sautter; Terry Catapano; Pavel Stoev

Collaboration


Dive into the Terry Catapano's collaboration.

Top Co-Authors

Avatar

Guido Sautter

Karlsruhe Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Donat Agosti

American Museum of Natural History

View shared research outputs
Top Co-Authors

Avatar

Lyubomir Penev

American Museum of Natural History

View shared research outputs
Top Co-Authors

Avatar

Teodor Georgiev

Bulgarian Academy of Sciences

View shared research outputs
Top Co-Authors

Avatar

Robert A. Morris

University of Massachusetts Boston

View shared research outputs
Top Co-Authors

Avatar

Donat Agosti

American Museum of Natural History

View shared research outputs
Top Co-Authors

Avatar

Pavel Stoev

National Museum of Natural History

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Lyubomir Penev

American Museum of Natural History

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge