Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Erik Schultes is active.

Publication


Featured researches published by Erik Schultes.


Scientific Data | 2016

The FAIR Guiding Principles for scientific data management and stewardship

Mark D. Wilkinson; Michel Dumontier; IJsbrand Jan Aalbersberg; Gabrielle Appleton; Myles Axton; Arie Baak; Niklas Blomberg; Jan Willem Boiten; Luiz Olavo Bonino da Silva Santos; Philip E. Bourne; Jildau Bouwman; Anthony J. Brookes; Timothy W.I. Clark; Mercè Crosas; Ingrid Dillo; Olivier Dumon; Scott C Edmunds; Chris T. Evelo; Richard Finkers; Alejandra Gonzalez-Beltran; Alasdair J. G. Gray; Paul T. Groth; Carole A. Goble; Jeffrey S. Grethe; Jaap Heringa; Peter A. C. 't Hoen; Rob W. W. Hooft; Tobias Kuhn; Ruben Kok; Joost N. Kok

There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.


Genome Biology | 2015

Gateways to the FANTOM5 promoter level mammalian expression atlas

Marina Lizio; Jayson Harshbarger; Hisashi Shimoji; Jessica Severin; Takeya Kasukawa; Serkan Sahin; Imad Abugessaisa; Shiro Fukuda; Fumi Hori; Sachi Ishikawa-Kato; Christopher J. Mungall; Erik Arner; J. Kenneth Baillie; Nicolas Bertin; Hidemasa Bono; Michiel Jl de Hoon; Alexander D. Diehl; Emmanuel Dimont; Tom C. Freeman; Kaori Fujieda; Winston Hide; Rajaram Kaliyaperumal; Toshiaki Katayama; Timo Lassmann; Terrence F. Meehan; Koro Nishikata; Hiromasa Ono; Michael Rehli; Albin Sandelin; Erik Schultes

The FANTOM5 project investigates transcription initiation activities in more than 1,000 human and mouse primary cells, cell lines and tissues using CAGE. Based on manual curation of sample information and development of an ontology for sample classification, we assemble the resulting data into a centralized data resource (http://fantom.gsc.riken.jp/5/). This resource contains web-based tools and data-access points for the research community to search and extract data related to samples, genes, promoter activities, transcription factors and enhancers across the FANTOM5 atlas.


Nature Genetics | 2011

The value of data

Barend Mons; Herman H. H. B. M. van Haagen; Christine Chichester; P.A.C. ’t Hoen; Johan T. den Dunnen; Gert-Jan B. van Ommen; Erik M. van Mulligen; Bharat Singh; Rob W. W. Hooft; Marco Roos; Joel K. Hammond; Bruce Kiesel; Belinda Giardine; Jan Velterop; Paul T. Groth; Erik Schultes

Data citation and the derivation of semantic constructs directly from datasets have now both found their place in scientific communication. The social challenge facing us is to maintain the value of traditional narrative publications and their relationship to the datasets they report upon while at the same time developing appropriate metrics for citation of data and data constructs.


Analytical Biochemistry | 2012

Phage display screening without repetitious selection rounds.

Peter A. C. 't Hoen; Silvana M.G. Jirka; Bradley R. ten Broeke; Erik Schultes; B. Aguilera; Kar Him Pang; Hans Heemskerk; Annemieke Aartsma-Rus; Gertjan Jb van Ommen; Johan T. den Dunnen

Phage display screenings are frequently employed to identify high-affinity peptides or antibodies. Although successful, phage display is a laborious technology and is notorious for identification of false positive hits. To accelerate and improve the selection process, we have employed Illumina next generation sequencing to deeply characterize the Ph.D.-7 M13 peptide phage display library before and after several rounds of biopanning on KS483 osteoblast cells. Sequencing of the naive library after one round of amplification in bacteria identifies propagation advantage as an important source of false positive hits. Most important, our data show that deep sequencing of the phage pool after a first round of biopanning is already sufficient to identify positive phages. Whereas traditional sequencing of a limited number of clones after one or two rounds of selection is uninformative, the required additional rounds of biopanning are associated with the risk of losing promising clones propagating slower than nonbinding phages. Confocal and live cell imaging confirms that our screen successfully selected a peptide with very high binding and uptake in osteoblasts. We conclude that next generation sequencing can significantly empower phage display screenings by accelerating the finding of specific binders and restraining the number of false positive hits.


Human Mutation | 2012

Microattribution and nanopublication as means to incentivize the placement of human genome variation data into the public domain

George P. Patrinos; David Neil Cooper; Erik M. van Mulligen; Vassiliki Gkantouna; Giannis Tzimas; Zuotian Tatum; Erik Schultes; Marco Roos; Barend Mons

The advances in bioinformatics required to annotate human genomic variants and to place them in public data repositories have not kept pace with their discovery. Moreover, a law of diminishing returns has begun to operate both in terms of data publication and submission. Although the continued deposition of such data in the public domain is essential to maximize both their scientific and clinical utility, rewards for data sharing are few, representing a serious practical impediment to data submission. To date, two main strategies have been adopted as a means to encourage the submission of human genomic variant data: (1) database journal linkups involving the affiliation of a scientific journal with a publicly available database and (2) microattribution, involving the unambiguous linkage of data to their contributors via a unique identifier. The latter could in principle lead to the establishment of a microcitation‐tracking system that acknowledges individual endeavor and achievement. Both approaches could incentivize potential data contributors, thereby encouraging them to share their data with the scientific community. Here, we summarize and critically evaluate approaches that have been proposed to address current deficiencies in data attribution and discuss ways in which they could become more widely adopted as novel scientific publication modalities. Hum Mutat 33:1503–1512, 2012.


Biotechnology Journal | 2013

An autonomously self‐assembling dendritic DNA nanostructure for target DNA detection

Harish Chandran; Abhijit Rangnekar; Geetha A. Shetty; Erik Schultes; John H. Reif; Thomas H. LaBean

There is a growing need for sensitive and reliable nucleic acid detection methods that are convenient and inexpensive. Responsive and programmable DNA nanostructures have shown great promise as chemical detection systems. Here, we describe a DNA detection system employing the triggered self-assembly of a novel DNA dendritic nanostructure. The detection protocol is executed autonomously without external intervention. Detection begins when a specific, single-stranded target DNA strand (T) triggers a hybridization chain reaction (HCR) between two, distinct DNA hairpins (α and β). Each hairpin opens and hybridizes up to two copies of the other. In the absence of T, α and β are stable and remain in their poised, closed-hairpin form. In the presence of T, α hairpins are opened by toe-hold mediated strand-displacement, each of which then opens and hybridizes two β hairpins. Likewise, each opened β hairpin can open and hybridize two α hairpins. Hence, each layer of the growing dendritic nanostructure can in principle accommodate an exponentially increasing number of cognate molecules, generating a high molecular weight nanostructure. This HCR system has minimal sequence constraints, allowing reconfiguration for the detection of arbitrary target sequences. Here, we demonstrate detection of unique sequence identifiers of HIV and Chlamydia pathogens.


Genes | 2011

Protein Folding Absent Selection

Thomas H. LaBean; Tauseef R. Butt; Stuart A. Kauffman; Erik Schultes

Biological proteins are known to fold into specific 3D conformations. However, the fundamental question has remained: Do they fold because they are biological, and evolution has selected sequences which fold? Or is folding a common trait, widespread throughout sequence space? To address this question arbitrary, unevolved, random-sequence proteins were examined for structural features found in folded, biological proteins. Libraries of long (71 residue), random-sequence polypeptides, with ensemble amino acid composition near the mean for natural globular proteins, were expressed as cleavable fusions with ubiquitin. The structural properties of both the purified pools and individual isolates were then probed using circular dichroism, fluorescence emission, and fluorescence quenching techniques. Despite this necessarily sparse “sampling” of sequence space, structural properties that define globular biological proteins, namely collapsed conformations, secondary structure, and cooperative unfolding, were found to be prevalent among unevolved sequences. Thus, for polypeptides the size of small proteins, natural selection is not necessary to account for the compact and cooperative folded states observed in nature.


PLOS ONE | 2013

Generic information can retrieve known biological associations: implications for biomedical knowledge discovery.

Herman H. H. B. M. van Haagen; Peter A. C. 't Hoen; Barend Mons; Erik Schultes

Motivation Weighted semantic networks built from text-mined literature can be used to retrieve known protein-protein or gene-disease associations, and have been shown to anticipate associations years before they are explicitly stated in the literature. Our text-mining system recognizes over 640,000 biomedical concepts: some are specific (i.e., names of genes or proteins) others generic (e.g., ‘Homo sapiens’). Generic concepts may play important roles in automated information retrieval, extraction, and inference but may also result in concept overload and confound retrieval and reasoning with low-relevance or even spurious links. Here, we attempted to optimize the retrieval performance for protein-protein interactions (PPI) by filtering generic concepts (node filtering) or links to generic concepts (edge filtering) from a weighted semantic network. First, we defined metrics based on network properties that quantify the specificity of concepts. Then using these metrics, we systematically filtered generic information from the network while monitoring retrieval performance of known protein-protein interactions. We also systematically filtered specific information from the network (inverse filtering), and assessed the retrieval performance of networks composed of generic information alone. Results Filtering generic or specific information induced a two-phase response in retrieval performance: initially the effects of filtering were minimal but beyond a critical threshold network performance suddenly drops. Contrary to expectations, networks composed exclusively of generic information demonstrated retrieval performance comparable to unfiltered networks that also contain specific concepts. Furthermore, an analysis using individual generic concepts demonstrated that they can effectively support the retrieval of known protein-protein interactions. For instance the concept “binding” is indicative for PPI retrieval and the concept “mutation abnormality” is indicative for gene-disease associations. Conclusion Generic concepts are important for information retrieval and cannot be removed from semantic networks without negative impact on retrieval performance.


Complexity | 1999

A parameterization of RNA sequence space

Erik Schultes; Peter Hraber; Thomas H. LaBean

RNA polymers are constructed from four distinct nucleotide bases. The sequence of these nucleotide bases determines both the folded conformation and the biological function of RNA. It recently has been established that disparately related functional classes of evolved RNA possess similar base composition biases despite a lack of sequence similarity, folded structure, or metabolic function. We have proposed that intrinsic constraints on RNA structure have imposed this convergent evolution in base composition. Here, we test this hypothesis by first calculating the distribution of the mean thermodynamic stability of random RNA sequences as a function of base composition. Then, using a model describing mutation (as a random walk in sequence space) and selection (which tends to increase thermodynamic stability), we relate the computed underlying distribution of conformational stability to empirically derived, tRNA and 5S rRNA sequence data. We find a close correspondence between predicted and observed distributions of base composition.


PLOS ONE | 2016

The Implicitome: A Resource for Rationalizing Gene-Disease Associations

Kristina M. Hettne; Mark Thompson; Herman H. H. B. M. van Haagen; Eelke van der Horst; Rajaram Kaliyaperumal; Eleni Mina; Zuotian Tatum; Jeroen F. J. Laros; Erik M. van Mulligen; Martijn J. Schuemie; Emmelien Aten; Tong Shu Li; Richard Bruskiewich; Benjamin M. Good; Andrew I. Su; Jan A. Kors; Johan T. den Dunnen; Gert-Jan B. van Ommen; Marco Roos; Peter A. C. 't Hoen; Barend Mons; Erik Schultes

High-throughput experimental methods such as medical sequencing and genome-wide association studies (GWAS) identify increasingly large numbers of potential relations between genetic variants and diseases. Both biological complexity (millions of potential gene-disease associations) and the accelerating rate of data production necessitate computational approaches to prioritize and rationalize potential gene-disease relations. Here, we use concept profile technology to expose from the biomedical literature both explicitly stated gene-disease relations (the explicitome) and a much larger set of implied gene-disease associations (the implicitome). Implicit relations are largely unknown to, or are even unintended by the original authors, but they vastly extend the reach of existing biomedical knowledge for identification and interpretation of gene-disease associations. The implicitome can be used in conjunction with experimental data resources to rationalize both known and novel associations. We demonstrate the usefulness of the implicitome by rationalizing known and novel gene-disease associations, including those from GWAS. To facilitate the re-use of implicit gene-disease associations, we publish our data in compliance with FAIR Data Publishing recommendations [https://www.force11.org/group/fairgroup] using nanopublications. An online tool (http://knowledge.bio) is available to explore established and potential gene-disease associations in the context of other biomedical relations.

Collaboration


Dive into the Erik Schultes's collaboration.

Top Co-Authors

Avatar

Mark Thompson

Leiden University Medical Center

View shared research outputs
Top Co-Authors

Avatar

Marco Roos

Leiden University Medical Center

View shared research outputs
Top Co-Authors

Avatar

Barend Mons

Leiden University Medical Center

View shared research outputs
Top Co-Authors

Avatar

Rajaram Kaliyaperumal

Leiden University Medical Center

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Kristina M. Hettne

Leiden University Medical Center

View shared research outputs
Top Co-Authors

Avatar

Eleni Mina

Leiden University Medical Center

View shared research outputs
Top Co-Authors

Avatar

Peter A. C. 't Hoen

Leiden University Medical Center

View shared research outputs
Top Co-Authors

Avatar

Peter Hraber

Los Alamos National Laboratory

View shared research outputs
Top Co-Authors

Avatar

Erik M. van Mulligen

Erasmus University Medical Center

View shared research outputs
Researchain Logo
Decentralizing Knowledge