Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Razvan Sultana is active.

Publication


Featured researches published by Razvan Sultana.


Bioinformatics | 2003

TIGR gene indices clustering tools (TGICL): a software system for fast clustering of large est datasets

Geo Pertea; Xiaoqiu Huang; Feng Liang; Valentin Antonescu; Razvan Sultana; Svetlana Karamycheva; Yuandan Lee; Joseph White; Foo Cheung; Babak Parvizi; Jennifer Tsai; John Quackenbush

TGICL is a pipeline for analysis of large Expressed Sequence Tags (EST) and mRNA databases in which the sequences are first clustered based on pairwise sequence similarity, and then assembled by individual clusters (optionally with quality values) to produce longer, more complete consensus sequences. The system can run on multi-CPU architectures including SMP and PVM.


Nucleic Acids Research | 2001

The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species

John Quackenbush; Jennifer Cho; Daniel Lee; Feng Liang; Ingeborg Holt; Svetlana Karamycheva; Babak Parvizi; Geo Pertea; Razvan Sultana; Joseph White

While genome sequencing projects are advancing rapidly, EST sequencing and analysis remains a primary research tool for the identification and categorization of gene sequences in a wide variety of species and an important resource for annotation of genomic sequence. The TIGR Gene Indices (http://www.tigr.org/tdb/tgi. shtml) are a collection of species-specific databases that use a highly refined protocol to analyze EST sequences in an attempt to identify the genes represented by that data and to provide additional information regarding those genes. Gene Indices are constructed by first clustering, then assembling EST and annotated gene sequences from GenBank for the targeted species. This process produces a set of unique, high-fidelity virtual transcripts, or Tentative Consensus (TC) sequences. The TC sequences can be used to provide putative genes with functional annotation, to link the transcripts to mapping and genomic sequence data, to provide links between orthologous and paralogous genes and as a resource for comparative sequence analysis.


Nature Methods | 2005

Independence and reproducibility across microarray platforms

Jennie E. Larkin; Bryan Frank; Haralambos Gavras; Razvan Sultana; John Quackenbush

Microarrays have been widely used for the analysis of gene expression, but the issue of reproducibility across platforms has yet to be fully resolved. To address this apparent problem, we compared gene expression between two microarray platforms: the short oligonucleotide Affymetrix Mouse Genome 430 2.0 GeneChip and a spotted cDNA array using a mouse model of angiontensin II–induced hypertension. RNA extracted from treated mice was analyzed using Affymetrix and cDNA platforms and then by quantitative RT-PCR (qRT-PCR) for validation of specific genes. For the 11,710 genes present on both arrays, we assessed the relative impact of experimental treatment and platform on measured expression and found that biological treatment had a far greater impact on measured expression than did platform for more than 90% of genes, a result validated by qRT-PCR. In the small number of cases in which platforms yielded discrepant results, qRT-PCR generally did not confirm either set of data, suggesting that sequence-specific effects may make expression predictions difficult to make using any technique.


Nucleic Acids Research | 2004

The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomes.

Yuandan Lee; Jennifer Tsai; Sirisha Sunkara; Svetlana Karamycheva; Geo Pertea; Razvan Sultana; Valentin Antonescu; Agnes P. Chan; Foo Cheung; John Quackenbush

Although the list of completed genome sequencing projects has expanded rapidly, sequencing and analysis of expressed sequence tags (ESTs) remain a primary tool for discovery of novel genes in many eukaryotes and a key element in genome annotation. The TIGR Gene Indices (http://www.tigr.org/tdb/tgi) are a collection of 77 species-specific databases that use a highly refined protocol to analyze gene and EST sequences in an attempt to identify and characterize expressed transcripts and to present them on the Web in a user-friendly, consistent fashion. A Gene Index database is constructed for each selected organism by first clustering, then assembling EST and annotated cDNA and gene sequences from GenBank. This process produces a set of unique, high-fidelity virtual transcripts, or tentative consensus (TC) sequences. The TC sequences can be used to provide putative genes with functional annotation, to link the transcripts to genetic and physical maps, to provide links to orthologous and paralogous genes, and as a resource for comparative and functional genomic analysis.


Plant Physiology | 2005

The Institute for Genomic Research Osa1 Rice Genome Annotation Database

Qiaoping Yuan; Shu Ouyang; Aihui Wang; Wei Zhu; Rama Maiti; Haining Lin; John P. Hamilton; Brian J. Haas; Razvan Sultana; Foo Cheung; Jennifer R. Wortman; C. Robin Buell

We have developed a rice (Oryza sativa) genome annotation database (Osa1) that provides structural and functional annotation for this emerging model species. Using the sequence of O. sativa subsp. japonica cv Nipponbare from the International Rice Genome Sequencing Project, pseudomolecules, or virtual contigs, of the 12 rice chromosomes were constructed. Our most recent release, version 3, represents our third build of the pseudomolecules and is composed of 98% finished sequence. Genes were identified using a series of computational methods developed for Arabidopsis (Arabidopsis thaliana) that were modified for use with the rice genome. In release 3 of our annotation, we identified 57,915 genes, of which 14,196 are related to transposable elements. Of these 43,719 nontransposable element-related genes, 18,545 (42.4%) were annotated with a putative function, 5,777 (13.2%) were annotated as encoding an expressed protein with no known function, and the remaining 19,397 (44.4%) were annotated as encoding a hypothetical protein. Multiple splice forms (5,873) were detected for 2,538 genes, resulting in a total of 61,250 gene models in the rice genome. We incorporated experimental evidence into 18,252 gene models to improve the quality of the structural annotation. A series of functional data types has been annotated for the rice genome that includes alignment with genetic markers, assignment of gene ontologies, identification of flanking sequence tags, alignment with homologs from related species, and syntenic mapping with other cereal species. All structural and functional annotation data are available through interactive search and display windows as well as through download of flat files. To integrate the data with other genome projects, the annotation data are available through a Distributed Annotation System and a Genome Browser. All data can be obtained through the project Web pages at http://rice.tigr.org.


Nature Genetics | 2013

Genomic analysis identifies targets of convergent positive selection in drug-resistant Mycobacterium tuberculosis.

Maha R. Farhat; B. Jesse Shapiro; Karen J. Kieser; Razvan Sultana; Karen R. Jacobson; Thomas C. Victor; Robin M. Warren; Elizabeth M. Streicher; Alistair Calver; Alexander Sloutsky; Devinder Kaur; Jamie E. Posey; Bonnie B. Plikaytis; Marco R. Oggioni; Jennifer L. Gardy; James C. Johnston; Mabel Rodrigues; Patrick Tang; Midori Kato-Maeda; Mark L. Borowsky; Bhavana Muddukrishna; Barry N. Kreiswirth; Natalia Kurepina; James E. Galagan; Sebastien Gagneux; Bruce Birren; Eric J. Rubin; Eric S. Lander; Pardis C. Sabeti; Megan Murray

M. tuberculosis is evolving antibiotic resistance, threatening attempts at tuberculosis epidemic control. Mechanisms of resistance, including genetic changes favored by selection in resistant isolates, are incompletely understood. Using 116 newly sequenced and 7 previously sequenced M. tuberculosis whole genomes, we identified genome-wide signatures of positive selection specific to the 47 drug-resistant strains. By searching for convergent evolution—the independent fixation of mutations in the same nucleotide position or gene—we recovered 100% of a set of known resistance markers. We also found evidence of positive selection in an additional 39 genomic regions in resistant isolates. These regions encode components in cell wall biosynthesis, transcriptional regulation and DNA repair pathways. Mutations in these regions could directly confer resistance or compensate for fitness costs associated with resistance. Functional genetic analysis of mutations in one gene, ponA1, demonstrated an in vitro growth advantage in the presence of the drug rifampicin.


Plant Physiology | 2003

Comparative Analyses of Potato Expressed Sequence Tag Libraries

Catherine M. Ronning; Svetlana Stegalkina; Robert A. Ascenzi; Oleg Bougri; Amy L. Hart; Teresa R. Utterbach; Susan E. Vanaken; Steve B. Riedmuller; Joseph White; Jennifer Cho; Geo Pertea; Yuandan Lee; Svetlana Karamycheva; Razvan Sultana; Jennifer Tsai; John Quackenbush; H. M. Griffiths; Silvia Restrepo; Christine D. Smart; William E. Fry; Rutger Van der Hoeven; Steve Tanksley; Peifen Zhang; Hailing Jin; Miki L. Yamamoto; Barbara Baker; C. Robin Buell

The cultivated potato (Solanum tuberosum) shares similar biology with other members of the Solanaceae, yet has features unique within the family, such as modified stems (stolons) that develop into edible tubers. To better understand potato biology, we have undertaken a survey of the potato transcriptome using expressed sequence tags (ESTs) from diverse tissues. A total of 61,940 ESTs were generated from aerial tissues, below-ground tissues, and tissues challenged with the late-blight pathogen (Phytophthora infestans). Clustering and assembly of these ESTs resulted in a total of 19,892 unique sequences with 8,741 tentative consensus sequences and 11,151 singleton ESTs. We were able to identify a putative function for 43.7% of these sequences. A number of sequences (48) were expressed throughout the libraries sampled, representing constitutively expressed sequences. Other sequences (13,068, 21%) were uniquely expressed and were detected only in a single library. Using hierarchal and k means clustering of the EST sequences, we were able to correlate changes in gene expression with major physiological events in potato biology. Using pair-wise comparisons of tuber-related tissues, we were able to associate genes with tuber initiation, dormancy, and sprouting. We also were able to identify a number of characterized as well as novel sequences that were unique to the incompatible interaction of late-blight pathogen, thereby providing a foundation for further understanding the mechanism of resistance.


Genome Biology | 2001

RESOURCERER: a database for annotating and linking microarray resources within and across species

Jennifer Tsai; Razvan Sultana; Yudan Lee; Geo Pertea; Svetlana Karamycheva; Valentin Antonescu; Jennifer Cho; Babak Parvizi; Foo Cheung; John Quackenbush

Microarray expression analysis is providing unprecedented data on gene expression in humans and mammalian model systems. Although such studies provide a tremendous resource for understanding human disease states, one of the significant challenges is cross-referencing the data derived from different species, across diverse expression analysis platforms, in order to properly derive inferences regarding gene expression and disease state. To address this problem, we have developed RESOURCERER, a microarray-resource annotation and cross-reference database built using the analysis of expressed sequence tags (ESTs) and gene sequences provided by the TIGR Gene Index (TGI) and TIGR Orthologous Gene Alignment (TOGA) databases [now called Eukaryotic Gene Orthologs (EGO)].


Mammalian Genome | 2002

Porcine gene discovery by normalized cDNA-library sequencing and EST cluster assembly.

Scott C. Fahrenkrug; T. P. L. Smith; Brad A. Freking; Jennifer Cho; Joseph White; J. L. Vallet; T. H. Wise; G. A. Rohrer; Geo Pertea; Razvan Sultana; John Quackenbush; J. W. Keele

Genetic and environmental factors affect the efficiency of pork production by influencing gene expression during porcine reproduction, tissue development, and growth. The identification and functional analysis of gene products important to these processes would be greatly enhanced by the development of a database of expressed porcine gene sequence. Two normalized porcine cDNA libraries (MARC 1PIG and MARC 2PIG), derived respectively from embryonic and reproductive tissues, were constructed, sequenced, and analyzed. A total of 66,245 clones from these two libraries were 5?-end sequenced and deposited in GenBank. Cluster analysis revealed that within-library redundancy is low, and comparison of all porcine ESTs with the human database suggests that the sequences from these two libraries represent portions of a significant number of independent pig genes. A Porcine Gene Index (PGI), comprising 15,616 tentative consensus sequences and 31,466 singletons, includes all sequences in public repositories and has been developed to facilitate further comparative map development and characterization of porcine genes (http://www.tigr.org/tdb/ssgi/). The clones and sequences from these libraries provide a catalog of expressed porcine genes and a resource for development of high-density hybridization arrays for transcriptional profiling of porcine tissues. In addition, comparison of porcine ESTs with sequences from other species serves as a valuable resource for comparative map development. Both arrayed cDNA libraries are available for unrestricted public use.


Nucleic Acids Research | 2010

GeneSigDB—a curated database of gene expression signatures

Aedín C. Culhane; Thomas Schwarzl; Razvan Sultana; Kermshlise C. Picard; Shaita C. Picard; Tim H. Lu; Katherine R. Franklin; Simon J. French; Gerald Papenhausen; Mick Correll; John Quackenbush

The primary objective of most gene expression studies is the identification of one or more gene signatures; lists of genes whose transcriptional levels are uniquely associated with a specific biological phenotype. Whilst thousands of experimentally derived gene signatures are published, their potential value to the community is limited by their computational inaccessibility. Gene signatures are embedded in published article figures, tables or in supplementary materials, and are frequently presented using non-standard gene or probeset nomenclature. We present GeneSigDB (http://compbio.dfci.harvard.edu/genesigdb) a manually curated database of gene expression signatures. GeneSigDB release 1.0 focuses on cancer and stem cells gene signatures and was constructed from more than 850 publications from which we manually transcribed 575 gene signatures. Most gene signatures (n = 560) were successfully mapped to the genome to extract standardized lists of EnsEMBL gene identifiers. GeneSigDB provides the original gene signature, the standardized gene list and a fully traceable gene mapping history for each gene from the original transcribed data table through to the standardized list of genes. The GeneSigDB web portal is easy to search, allows users to compare their own gene list to those in the database, and download gene signatures in most common gene identifier formats.

Collaboration


Dive into the Razvan Sultana's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Geo Pertea

Johns Hopkins University

View shared research outputs
Top Co-Authors

Avatar

Foo Cheung

J. Craig Venter Institute

View shared research outputs
Top Co-Authors

Avatar

Jennifer Cho

J. Craig Venter Institute

View shared research outputs
Top Co-Authors

Avatar

Jennifer Tsai

J. Craig Venter Institute

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge