Alexander Kozik | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Alexander Kozik is active.

Explore More

Publication

Featured researches published by Alexander Kozik.

The Plant Cell | 2003

Genome-Wide Analysis of NBS-LRR–Encoding Genes in Arabidopsis

Blake C. Meyers; Alexander Kozik; Alyssa Griego; Hanhui Kuang; Richard W. Michelmore

The Arabidopsis genome contains ∼200 genes that encode proteins with similarity to the nucleotide binding site and other domains characteristic of plant resistance proteins. Through a reiterative process of sequence analysis and reannotation, we identified 149 NBS-LRR–encoding genes in the Arabidopsis (ecotype Columbia) genomic sequence. Fifty-six of these genes were corrected from earlier annotations. At least 12 are predicted to be pseudogenes. As described previously, two distinct groups of sequences were identified: those that encoded an N-terminal domain with Toll/Interleukin-1 Receptor homology (TIR-NBS-LRR, or TNL), and those that encoded an N-terminal coiled-coil motif (CC-NBS-LRR, or CNL). The encoded proteins are distinct from the 58 predicted adapter proteins in the previously described TIR-X, TIR-NBS, and CC-NBS groups. Classification based on protein domains, intron positions, sequence conservation, and genome distribution defined four subgroups of CNL proteins, eight subgroups of TNL proteins, and a pair of divergent NL proteins that lack a defined N-terminal motif. CNL proteins generally were encoded in single exons, although two subclasses were identified that contained introns in unique positions. TNL proteins were encoded in modular exons, with conserved intron positions separating distinct protein domains. Conserved motifs were identified in the LRRs of both CNL and TNL proteins. In contrast to CNL proteins, TNL proteins contained large and variable C-terminal domains. The extant distribution and diversity of the NBS-LRR sequences has been generated by extensive duplication and ectopic rearrangements that involved segmental duplications as well as microscale events. The observed diversity of these NBS-LRR proteins indicates the variety of recognition molecules available in an individual genotype to detect diverse biotic challenges.

Molecular Biology and Evolution | 2008

Multiple Paleopolyploidizations during the Evolution of the Compositae Reveal Parallel Patterns of Duplicate Gene Retention after Millions of Years

Michael S. Barker; Nolan C. Kane; Marta Matvienko; Alexander Kozik; Richard W. Michelmore; Steven J. Knapp; Loren H. Rieseberg

Of the approximately 250,000 species of flowering plants, nearly one in ten are members of the Compositae (Asteraceae), a diverse family found in almost every habitat on all continents except Antarctica. With an origin in the mid Eocene, the Compositae is also a relatively young family with remarkable diversifications during the last 40 My. Previous cytologic and systematic investigations suggested that paleopolyploidy may have occurred in at least one Compositae lineage, but a recent analysis of genomic data was equivocal. We tested for evidence of paleopolyploidy in the evolutionary history of the family using recently available expressed sequence tag (EST) data from the Compositae Genome Project. Combined with data available on GenBank, we analyzed nearly 1 million ESTs from 18 species representing seven genera and four tribes. Our analyses revealed at least three ancient whole-genome duplications in the Compositae-a paleopolyploidization shared by all analyzed taxa and placed near the origin of the family just prior to the rapid radiation of its tribes and independent genome duplications near the base of the tribes Mutisieae and Heliantheae. These results are consistent with previous research implicating paleopolyploidy in the evolution and diversification of the Heliantheae. Further, we observed parallel retention of duplicate genes from the basal Compositae genome duplication across all tribes, despite divergence times of 33-38 My among these lineages. This pattern of retention was also repeated for the paleologs from the Heliantheae duplication. Intriguingly, the categories of genes retained in duplicate were substantially different from those in Arabidopsis. In particular, we found that genes annotated to structural components or cellular organization Gene Ontology categories were significantly enriched among paleologs, whereas genes associated with transcription and other regulatory functions were significantly underrepresented. Our results suggest that paleopolyploidy can yield strikingly consistent signatures of gene retention in plant genomes despite extensive lineage radiations and recurrent genome duplications but that these patterns vary substantially among higher taxonomic categories.

Nature Genetics | 2016

The genome sequences of Arachis duranensis and Arachis ipaensis , the diploid ancestors of cultivated peanut

David J. Bertioli; Steven B. Cannon; Lutz Froenicke; Guodong Huang; Andrew D. Farmer; Ethalinda K. S. Cannon; Xin Liu; Dongying Gao; Josh Clevenger; Sudhansu Dash; Longhui Ren; Márcio C. Moretzsohn; Kenta Shirasawa; Wei Huang; Bruna Vidigal; Brian Abernathy; Ye Chu; Chad E. Niederhuth; Pooja E. Umale; Ana Claudia Guerra Araujo; Alexander Kozik; Kyung Do Kim; Mark D. Burow; Rajeev K. Varshney; Xingjun Wang; Xinyou Zhang; Noelle A. Barkley; Patricia M. Guimarães; Sachiko Isobe; Baozhu Guo

Cultivated peanut (Arachis hypogaea) is an allotetraploid with closely related subgenomes of a total size of ∼2.7 Gb. This makes the assembly of chromosomal pseudomolecules very challenging. As a foundation to understanding the genome of cultivated peanut, we report the genome sequences of its diploid ancestors (Arachis duranensis and Arachis ipaensis). We show that these genomes are similar to cultivated peanuts A and B subgenomes and use them to identify candidate disease resistance genes, to guide tetraploid transcript assemblies and to detect genetic exchange between cultivated peanuts subgenomes. On the basis of remarkably high DNA identity of the A. ipaensis genome and the B subgenome of cultivated peanut and biogeographic evidence, we conclude that A. ipaensis may be a direct descendant of the same population that contributed the B subgenome to cultivated peanut.

BMC Plant Biology | 2007

Global expression analysis of nucleotide binding site-leucine rich repeat-encoding and related genes in Arabidopsis

Xiaoping Tan; Blake C. Meyers; Alexander Kozik; Marilyn A. L. West; Michele Morgante; Dina A. St. Clair; Andrew F. Bent; Richard W. Michelmore

BackgroundNucleotide binding site-leucine rich repeat (NBS-LRR)-encoding genes comprise the largest class of plant disease resistance genes. The 149 NBS-LRR-encoding genes and the 58 related genes that do not encode LRRs represent approximately 0.8% of all ORFs so far annotated in Arabidopsis ecotype Col-0. Despite their prevalence in the genome and functional importance, there was little information regarding expression of these genes.ResultsWe analyzed the expression patterns of ~170 NBS-LRR-encoding and related genes in Arabidopsis Col-0 using multiple analytical approaches: expressed sequenced tag (EST) representation, massively parallel signature sequencing (MPSS), microarray analysis, rapid amplification of cDNA ends (RACE) PCR, and gene trap lines. Most of these genes were expressed at low levels with a variety of tissue specificities. Expression was detected by at least one approach for all but 10 of these genes. The expression of some but not the majority of NBS-LRR-encoding and related genes was affected by salicylic acid (SA) treatment; the response to SA varied among different accessions. An analysis of previously published microarray data indicated that ten NBS-LRR-encoding and related genes exhibited increased expression in wild-type Landsberg erecta (Ler) after flagellin treatment. Several of these ten genes also showed altered expression after SA treatment, consistent with the regulation of R gene expression during defense responses and overlap between the basal defense response and salicylic acid signaling pathways. Enhancer trap analysis indicated that neither jasmonic acid nor benzothiadiazole (BTH), a salicylic acid analog, induced detectable expression of the five NBS-LRR-encoding genes and one TIR-NBS-encoding gene tested; however, BTH did induce detectable expression of the other TIR-NBS-encoding gene analyzed. Evidence for alternative mRNA polyadenylation sites was observed for many of the tested genes. Evidence for alternative splicing was found for at least 12 genes, 11 of which encode TIR-NBS-LRR proteins. There was no obvious correlation between expression pattern, phylogenetic relationship or genomic location of the NBS-LRR-encoding and related genes studied.ConclusionTranscripts of many NBS-LRR-encoding and related genes were defined. Most were present at low levels and exhibited tissue-specific expression patterns. Expression data are consistent with most Arabidopsis NBS-LRR-encoding and related genes functioning in plant defense responses but do not preclude other biological roles.

PLOS ONE | 2011

Next Generation Sequencing Provides Rapid Access to the Genome of Puccinia striiformis f. sp. tritici, the Causal Agent of Wheat Stripe Rust

Dario Cantu; Manjula Govindarajulu; Alexander Kozik; Meinan Wang; Xianming Chen; Kenji K. Kojima; Jerzy Jurka; Richard W. Michelmore; Jorge Dubcovsky

(XLSX)

BMC Genomics | 2012

De novo assembly of the pepper transcriptome (Capsicum annuum): a benchmark for in silico discovery of SNPs, SSRs and candidate genes

Hamid Ashrafi; Theresa Hill; Kevin Stoffel; Alexander Kozik; Jiqiang Yao; Sebastián Reyes Chin-Wo; Allen Van Deynze

BackgroundMolecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes.ResultsTwo pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80–120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins.ConclusionsBefore availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project.

Molecular Genetics and Genomics | 2008

Genetic diversity and genomic distribution of homologs encoding NBS-LRR disease resistance proteins in sunflower

Osman Radwan; Sonali Gandhi; Adam Heesacker; Brett Whitaker; Christopher A Taylor; Alex Plocik; Rick Kesseli; Alexander Kozik; Richard W. Michelmore; Steven J. Knapp

Three-fourths of the recognition-dependent disease resistance genes (R-genes) identified in plants encode nucleotide binding site (NBS) leucine-rich repeat (LRR) proteins. NBS-LRR homologs have only been isolated on a limited scale from sunflower (Helianthus annuus L.), and most of the previously identified homologs are members of two large NBS-LRR clusters harboring downy mildew R-genes. We mined the sunflower EST database and used comparative genomics approaches to develop a deeper understanding of the diversity and distribution of NBS-LRR homologs in the sunflower genome. Collectively, 630 NBS-LRR homologs were identified, 88 by mining a database of 284,241 sunflower ESTs and 542 by sequencing 1,248 genomic DNA amplicons isolated from common and wild sunflower species. DNA markers were developed from 196 unique NBS-LRR sequences and facilitated genetic mapping of 167 NBS-LRR loci. The latter were distributed throughout the sunflower genome in 44 clusters or singletons. Wild species ESTs were a particularly rich source of novel NBS-LRR homologs, many of which were tightly linked to previously mapped downy mildew, rust, and broomrape R-genes. The DNA sequence and mapping resources described here should facilitate the discovery and isolation of recognition-dependent R-genes guarding sunflower from a broad spectrum of economically important diseases.

American Journal of Botany | 2012

Genomics of Compositae weeds: EST libraries, microarrays, and evidence of introgression.

Zhao Lai; Nolan C. Kane; Alexander Kozik; Kathryn A. Hodgins; Katrina M. Dlugosch; Michael S. Barker; Marta Matvienko; Qian Yu; Kathryn G. Turner; Stephanie A. Pearl; Graeme D.M. Bell; Yi Zou; Chris Grassa; Alessia Guggisberg; Keith L. Adams; James V. Anderson; David P. Horvath; Rick Kesseli; John M. Burke; Richard W. Michelmore; Loren H. Rieseberg

PREMISE OF STUDY Weeds cause considerable environmental and economic damage. However, genomic characterization of weeds has lagged behind that of model plants and crop species. Here we describe the development of genomic tools and resources for 11 weeds from the Compositae family that will serve as a basis for subsequent population and comparative genomic analyses. Because hybridization has been suggested as a stimulus for the evolution of invasiveness, we also analyze these genomic data for evidence of hybridization. METHODS We generated 22 expressed sequence tag (EST) libraries for the 11 targeted weeds using Sanger, 454, and Illumina sequencing, compared the coverage and quality of sequence assemblies, and developed NimbleGen microarrays for expression analyses in five taxa. When possible, we also compared the distributions of Ks values between orthologs of congeneric taxa to detect and quantify hybridization and introgression. RESULTS Gene discovery was enhanced by sequencing from multiple tissues, normalization of cDNA libraries, and especially greater sequencing depth. However, assemblies from short sequence reads sometimes failed to resolve close paralogs. Substantial introgression was detected in Centaurea and Helianthus, but not in Ambrosia and Lactuca. CONCLUSIONS Transcriptome sequencing using next-generation platforms has greatly reduced the cost of genomic studies of nonmodel organisms, and the ESTs and microarrays reported here will accelerate evolutionary and molecular investigations of Compositae weeds. Our study also shows how ortholog comparisons can be used to approximately estimate the genome-wide extent of introgression and to identify genes that have been exchanged between hybridizing taxa.

BMC Genomics | 2007

Diversity in conserved genes in tomato

Allen Van Deynze; Kevin Stoffel; C. Robin Buell; Alexander Kozik; Jia Liu; Esther van der Knaap; David M. Francis

BackgroundTomato has excellent genetic and genomic resources including a broad set of Expressed Sequence Tag (EST) data and high-density genetic maps. In addition, emerging physical maps and bacterial artificial clone sequence data serve as template to investigate genetic variation within the cultivated germplasm pool with the goal to manipulate agriculturally important traits. Unfortunately, the nearly exclusive focus of resource development on interspecific populations for genetic analyses and diversity studies has left a void in our understanding of genotypic variation within tomato breeding programs that focus on intra-specific populations. We describe the results of a study to identify nucleotide variation within tomato breeding germplasm and mapping parents for a set of conserved single-copy ESTs that are orthologous between tomato and Arabidopsis.ResultsUsing a pooled sequencing strategy, 967 tomato transcripts were screened for polymorphism in 12 tomato lines. Although intron position was conserved, intron lengths were 2-fold larger in tomato than in Arabidopsis. A total of 1,487 single nucleotide polymorphisms and 282 insertion/deletions were identified, of which 579 and 206 were polymorphic in breeding germplasm, respectively. Fresh market and processing germplasm were clearly divergent, as were Solanum lycopersicum var. cerasiformae and Solanum pimpinellifolium, tomatos closest relatives. The polymorphisms identified serve as marker resources for tomato. The COS is also applicable to other Solanaceae crops.ConclusionsThe results from this research enabled significant progress towards bridging the gap between genetic and genomic resources developed for populations derived from wide crosses and those applicable to intra-specific crosses for breeding in tomato.

Plant Physiology | 2009

Comparative Large-Scale Analysis of Interactions between Several Crop Species and the Effector Repertoires from Multiple Pathovars of Pseudomonas and Ralstonia

Tadeusz Wroblewski; Katherine S. Caldwell; Urszula Piskurewicz; Keri A. Cavanaugh; Huaqin Xu; Alexander Kozik; Oswaldo Ochoa; Leah K. McHale; Kirsten A. Lahre; Joanna Jelenska; J. Castillo; Daniel Blumenthal; Boris A. Vinatzer; Jean T. Greenberg; Richard W. Michelmore

Bacterial plant pathogens manipulate their hosts by injection of numerous effector proteins into host cells via type III secretion systems. Recognition of these effectors by the host plant leads to the induction of a defense reaction that often culminates in a hypersensitive response manifested as cell death. Genes encoding effector proteins can be exchanged between different strains of bacteria via horizontal transfer, and often individual strains are capable of infecting multiple hosts. Host plant species express diverse repertoires of resistance proteins that mediate direct or indirect recognition of bacterial effectors. As a result, plants and their bacterial pathogens should be considered as two extensive coevolving groups rather than as individual host species coevolving with single pathovars. To dissect the complexity of this coevolution, we cloned 171 effector-encoding genes from several pathovars of Pseudomonas and Ralstonia. We used Agrobacterium tumefaciens-mediated transient assays to test the ability of each effector to induce a necrotic phenotype on 59 plant genotypes belonging to four plant families, including numerous diverse accessions of lettuce (Lactuca sativa) and tomato (Solanum lycopersicum). Known defense-inducing effectors (avirulence factors) and their homologs commonly induced extensive necrosis in many different plant species. Nonhost species reacted to multiple effector proteins from an individual pathovar more frequently and more intensely than host species. Both homologous and sequence-unrelated effectors could elicit necrosis in a similar spectrum of plants, suggesting common effector targets or targeting of the same pathways in the plant cell.

Explore More