Arwa Shahin
Wageningen University and Research Centre
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Arwa Shahin.
BMC Genomics | 2012
Arwa Shahin; Martijn van Kaauwen; Danny Esselink; Joachim W. Bargsten; Jaap M. van Tuyl; Richard G. F. Visser; Paul Arens
BackgroundBulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology.ResultsSuccessfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology.ConclusionsTwo transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies.
BMC Research Notes | 2012
Arwa Shahin; Thomas van Gurp; Sander A. Peters; Richard G. F. Visser; Jaap M. van Tuyl; Paul Arens
BackgroundSNP (Single Nucleotide Polymorphism) markers are rapidly becoming the markers of choice for applications in breeding because of next generation sequencing technology developments. For SNP development by NGS technologies, correct assembly of the huge amounts of sequence data generated is essential. Little is known about assemblers performance, especially when dealing with highly heterogeneous species that show a high genome complexity and what the possible consequences are of differences in assemblies on SNP retrieval. This study tested two assemblers (CAP3 and CLC) on 454 data from four lily genotypes and compared results with respect to SNP retrieval.ResultsCAP3 assembly resulted in higher numbers of contigs, lower numbers of reads per contig, and shorter average read lengths compared to CLC. Blast comparisons showed that CAP3 contigs were highly redundant. Contrastingly, CLC in rare cases combined paralogs in one contig. Redundant and chimeric contigs may lead to erroneous SNPs. Filtering for redundancy can be done by blasting selected SNP markers to the contigs and discarding all the SNP markers that show more than one blast hit. Results on chimeric contigs showed that only four out of 2,421 SNP markers were selected from chimeric contigs.ConclusionIn practice, CLC performs better in assembling highly heterogeneous genome sequences compared to CAP3, and consequently SNP retrieval is more efficient. Additionally a simple flow scheme is suggested for SNP marker retrieval that can be valid for all non-model species.
Frontiers in Plant Science | 2014
Arwa Shahin; M.J.M. Smulders; Jaap M. van Tuyl; Paul Arens; Freek T. Bakker
Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from transcriptome sequences using three approaches: POFAD (Phylogeny of Organisms from Allelic Data, uses allelic information of sequence data), RAxML (Randomized Accelerated Maximum Likelihood, tree building based on concatenated consensus sequences) and Consensus Network (constructing a network summarizing among gene tree conflicts). Twenty six gene contigs were chosen based on the presence of orthologous sequences in all cultivars, seven of which also had an orthologous sequence in Tulipa, used as out-group. The three approaches generated the same topology. Although the resolution offered by these approaches is high, in this case there was no extra benefit in using allelic information. We conclude that these 26 genes can be widely applied to construct a species tree for the genus Lilium.
Molecular Breeding | 2017
Yiqian Fu; Alex van Silfhout; Arwa Shahin; Ronny Egberts; Martin Beers; Ans van der Velde; Adrie van Houten; Jaap M. van Tuyl; Richard G. F. Visser; Paul Arens
Gerbera hybrida is an economically important cut flower. In the production and transportation of gerbera with unavoidable periods of high relative humidity, grey mould occurs and results in losses in quality and quantity of flowers. Considering the limitations of chemical use in greenhouses and the impossibility to use these chemicals in auction or after sale, breeding for resistant gerbera cultivars is considered as the best practical approach. In this study, we developed two segregating F1 populations (called S and F). Four parental linkage maps were constructed using common and parental specific SNP markers developed from expressed sequence tag sequencing. Parental genetic maps, containing 30, 29, 27 and 28 linkage groups and a consensus map covering 24 of the 25 expected chromosomes, could be constructed. After evaluation of Botrytis disease severity using three different tests, whole inflorescence, bottom (of disc florets) and ray floret, quantitative trait locus (QTL) mapping was performed using the four individual parental maps. A total of 20 QTLs (including one identical QTL for whole inflorescence and bottom tests) were identified in the parental maps of the two populations. The number of QTLs found and the explained variance of most QTLs detected reflect the complex mechanism of Botrytis disease response.
Bioinformatics | 2018
Peter M. Bourke; Geert van Geest; Roeland E. Voorrips; Johannes Jansen; Twan Kranenburg; Arwa Shahin; Richard G. F. Visser; Paul Arens; M.J.M. Smulders; Chris Maliepaard
Motivation: Polyploid species carry more than two copies of each chromosome, a condition found in many of the worlds most important crops. Genetic mapping in polyploids is more complex than in diploid species, resulting in a lack of available software tools. These are needed if we are to realize all the opportunities offered by modern genotyping platforms for genetic research and breeding in polyploid crops. Results: polymapR is an R package for genetic linkage analysis and integrated genetic map construction from bi‐parental populations of outcrossing autopolyploids. It can currently analyse triploid, tetraploid and hexaploid marker datasets and is applicable to various crops including potato, leek, alfalfa, blueberry, chrysanthemum, sweet potato or kiwifruit. It can detect, estimate and correct for preferential chromosome pairing, and has been tested on high‐density marker datasets from potato, rose and chrysanthemum, generating high‐density integrated linkage maps in all of these crops. Availability and implementation: polymapR is freely available under the general public license from the Comprehensive R Archive Network (CRAN) at http://cran.r‐project.org/package=polymapR. Supplementary information: Supplementary data are available at Bioinformatics online.
Molecular Breeding | 2017
Yiqian Fu; Alex van Silfhout; Arwa Shahin; Ronny Egberts; Martin Beers; Ans van der Velde; Adrie van Houten; Jaap M. van Tuyl; Richard G. F. Visser; Paul Arens
Unfortunately, in the original version of this article part of Fig. 1 was deleted in the final proofing step. The missing part of Fig. 1 with linkage groups 18–24.(Figure presented.).
Plant Breeding | 2011
Arwa Shahin; Paul Arens; Adriaan W. van Heusden; Gerard van der Linden; Martijn van Kaauwen; Nadeem Khan; Henk J. Schouten; W. Eric van de Weg; Richard G. F. Visser; Jaap M. van Tuyl
Scientia Horticulturae | 2013
Nan Tang; Arwa Shahin; Paul Bijman; Jianjun Liu; Jaap M. van Tuyl; Paul Arens
Acta Horticulturae | 2012
M.J.M. Smulders; M. Vukosavljev; Arwa Shahin; W.E. van de Weg; Paul Arens
Insect Molecular Biology | 2009
Arwa Shahin; Paul Arens; S. van Heusden; J.M. van Tuyl