Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Ting-Yi Sung is active.

Publication


Featured researches published by Ting-Yi Sung.


IEEE Transactions on Parallel and Distributed Systems | 2000

Edge congestion and topological properties of crossed cubes

Chien-Ping Chang; Ting-Yi Sung; Lih-Hsing Hsu

An n-dimensional crossed cube, CQ/sub n/, is a variation of hypercubes. In this paper, we give a new shortest path routing algorithm based on a new distance measure defined herein. In comparison with Efes algorithm, which generates one shortest path in O(n/sup 2/) time, our algorithm can generate more shortest paths in O(n) time. Based on a given shortest path routing algorithm, we consider a new performance measure of interconnection networks called edge congestion. Using our shortest path routing algorithm and assuming that message exchange between all pairs of vertices is equally probable, we show that the edge congestion of crossed cubes is the same as that of hypercubes. Using the result of edge congestion, we can show that the bisection width of crossed cubes is 2/sup n-1/. We also prove that wide diameter and fault diameter are [n/2]+2. Furthermore, we study embedding of cycles in cross cubes and construct more types than previous work of cycles of length at least four.


Molecular & Cellular Proteomics | 2010

IDEAL-Q, an Automated Tool for Label-free Quantitation Analysis Using an Efficient Peptide Alignment Approach and Spectral Data Validation

Chih-Chiang Tsou; Chia-Feng Tsai; Ying-Hao Tsui; Putty-Reddy Sudhir; Yi-Ting Wang; Yu-Ju Chen; Jeou-Yuan Chen; Ting-Yi Sung; Wen-Lian Hsu

In this study, we present a fully automated tool, called IDEAL-Q, for label-free quantitation analysis. It accepts raw data in the standard mzXML format as well as search results from major search engines, including Mascot, SEQUEST, and X!Tandem, as input data. To quantify as many identified peptides as possible, IDEAL-Q uses an efficient algorithm to predict the elution time of a peptide unidentified in a specific LC-MS/MS run but identified in other runs. Then, the predicted elution time is used to detect peak clusters of the assigned peptide. Detected peptide peaks are processed by statistical and computational methods and further validated by signal-to-noise ratio, charge state, and isotopic distribution criteria (SCI validation) to filter out noisy data. The performance of IDEAL-Q has been evaluated by several experiments. First, a serially diluted protein mixed with Escherichia coli lysate showed a high correlation with expected ratios and demonstrated good linearity (R2 = 0.996). Second, in a biological replicate experiment on the THP-1 cell lysate, IDEAL-Q quantified 87% (1,672 peptides) of all identified peptides, surpassing the 45.7% (909 peptides) achieved by the conventional identity-based approach, which only quantifies peptides identified in all LC-MS/MS runs. Manual validation on all 11,940 peptide ions in six replicate LC-MS/MS runs revealed that 97.8% of the peptide ions were correctly aligned, and 93.3% were correctly validated by SCI. Thus, the mean of the protein ratio, 1.00 ± 0.05, demonstrates the high accuracy of IDEAL-Q without human intervention. Finally, IDEAL-Q was applied again to the biological replicate experiment but with an additional SDS-PAGE step to show its compatibility for label-free experiments with fractionation. For flexible workflow design, IDEAL-Q supports different fractionation strategies and various normalization schemes, including multiple spiked internal standards. User-friendly interfaces are provided to facilitate convenient inspection, validation, and modification of quantitation results. In summary, IDEAL-Q is an efficient, user-friendly, and robust quantitation tool. It is available for download.


BMC Bioinformatics | 2008

Predicting RNA-binding sites of proteins using support vector machines and evolutionary information

Cheng Wei Cheng; Emily Chia Yu Su; Jenn-Kang Hwang; Ting-Yi Sung; Wen-Lian Hsu

BackgroundRNA-protein interaction plays an essential role in several biological processes, such as protein synthesis, gene expression, posttranscriptional regulation and viral infectivity. Identification of RNA-binding sites in proteins provides valuable insights for biologists. However, experimental determination of RNA-protein interaction remains time-consuming and labor-intensive. Thus, computational approaches for prediction of RNA-binding sites in proteins have become highly desirable. Extensive studies of RNA-binding site prediction have led to the development of several methods. However, they could yield low sensitivities in trade-off for high specificities.ResultsWe propose a method, RNAProB, which incorporates a new smoothed position-specific scoring matrix (PSSM) encoding scheme with a support vector machine model to predict RNA-binding sites in proteins. Besides the incorporation of evolutionary information from standard PSSM profiles, the proposed smoothed PSSM encoding scheme also considers the correlation and dependency from the neighboring residues for each amino acid in a protein. Experimental results show that smoothed PSSM encoding significantly enhances the prediction performance, especially for sensitivity. Using five-fold cross-validation, our method performs better than the state-of-the-art systems by 4.90%~6.83%, 0.88%~5.33%, and 0.10~0.23 in terms of overall accuracy, specificity, and Matthews correlation coefficient, respectively. Most notably, compared to other approaches, RNAProB significantly improves sensitivity by 7.0%~26.9% over the benchmark data sets. To prevent data over fitting, a three-way data split procedure is incorporated to estimate the prediction performance. Moreover, physicochemical properties and amino acid preferences of RNA-binding proteins are examined and analyzed.ConclusionOur results demonstrate that smoothed PSSM encoding scheme significantly enhances the performance of RNA-binding site prediction in proteins. This also supports our assumption that smoothed PSSM encoding can better resolve the ambiguity of discriminating between interacting and non-interacting residues by modelling the dependency from surrounding residues. The proposed method can be used in other research areas, such as DNA-binding site prediction, protein-protein interaction, and prediction of posttranslational modification sites.


Mathematical Programming | 1991

An analytical comparison of different formulations of the travelling sales man problem

Manfred W. Padberg; Ting-Yi Sung

A transformation technique is proposed that permits one to derive the linear description of the imageX of a polyhedronZ under an affine linear transformation from the (given) linear description ofZ. This result is used to analytically compare various formulations of the asymmetric travelling salesman problem to the standard formulation due to Dantzig, Fulkerson and Johnson which are all shown to be “weaker formulations” in a precise setting. We also apply this transformation technique to “symmetrize” formulations and show, in particular, that the symmetrization of the standard asymmetric formulation results into the standard one for the symmetric version of the travelling salesman problem.


BMC Bioinformatics | 2006

Various criteria in the evaluation of biomedical named entity recognition

Richard Tzong-Han Tsai; Shih-Hung Wu; Wen-Chi Chou; Yu-Chun Lin; Ding He; Jieh Hsiang; Ting-Yi Sung; Wen-Lian Hsu

BackgroundText mining in the biomedical domain is receiving increasing attention. A key component of this process is named entity recognition (NER). Generally speaking, two annotated corpora, GENIA and GENETAG, are most frequently used for training and testing biomedical named entity recognition (Bio-NER) systems. JNLPBA and BioCreAtIvE are two major Bio-NER tasks using these corpora. Both tasks take different approaches to corpus annotation and use different matching criteria to evaluate system performance. This paper details these differences and describes alternative criteria. We then examine the impact of different criteria and annotation schemes on system performance by retesting systems participated in the above two tasks.ResultsTo analyze the difference between JNLPBAs and BioCreAtIvEs evaluation, we conduct Experiment 1 to evaluate the top four JNLPBA systems using BioCreAtIvEs classification scheme. We then compare them with the top four BioCreAtIvE systems. Among them, three systems participated in both tasks, and each has an F-score lower on JNLPBA than on BioCreAtIvE. In Experiment 2, we apply hypothesis testing and correlation coefficient to find alternatives to BioCreAtIvEs evaluation scheme. It shows that right-match and left-match criteria have no significant difference with BioCreAtIvE. In Experiment 3, we propose a customized relaxed-match criterion that uses right match and merges JNLPBAs five NE classes into two, which achieves an F-score of 81.5%. In Experiment 4, we evaluate a range of five matching criteria from loose to strict on the top JNLPBA system and examine the percentage of false negatives. Our experiment gives the relative change in precision, recall and F-score as matching criteria are relaxed.ConclusionIn many applications, biomedical NEs could have several acceptable tags, which might just differ in their left or right boundaries. However, most corpora annotate only one of them. In our experiment, we found that right match and left match can be appropriate alternatives to JNLPBA and BioCreAtIvEs matching criteria. In addition, our relaxed-match criterion demonstrates that users can define their own relaxed criteria that correspond more realistically to their application requirements.


Analytical Chemistry | 2014

Sequential phosphoproteomic enrichment through complementary metal-directed immobilized metal ion affinity chromatography.

Chia-Feng Tsai; Chuan-Chih Hsu; Jo-Nan Hung; Yi-Ting Wang; Wai-Kok Choong; Ming-Yao Zeng; Pei-Yi Lin; Ruo-Wei Hong; Ting-Yi Sung; Yu-Ju Chen

Methodologies to enrich heterogeneous types of phosphopeptides are critical for comprehensive mapping of the under-explored phosphoproteome. Taking advantage of the distinct binding affinities of Ga(3+) and Fe(3+) for phosphopeptides, we designed a metal-directed immobilized metal ion affinity chromatography for the sequential enrichment of phosphopeptides. In Raji B cells, the sequential Ga(3+)-Fe(3+)-immobilized metal affinity chromatography (IMAC) strategy displayed a 1.5-3.5-fold superior phosphoproteomic coverage compared to single IMAC (Fe(3+), Ti(4+), Ga(3+), and Al(3+)). In addition, up to 92% of the 6283 phosphopeptides were uniquely enriched in either the first Ga(3+)-IMAC (41%) or second Fe(3+)-IMAC (51%). The complementary properties of Ga(3+) and Fe(3+) were further demonstrated through the exclusive enrichment of almost all of 1214 multiply phosphorylated peptides (99.4%) in the Ga(3+)-IMAC, whereas only 10% of 5069 monophosphorylated phosphopeptides were commonly enriched in both fractions. The application of sequential Ga(3+)-Fe(3+)-IMAC to human lung cancer tissue allowed the identification of 2560 unique phosphopeptides with only 8% overlap. In addition to the above-mentioned mono- and multiply phosphorylated peptides, this fractionation ability was also demonstrated on the basic and acidic phosphopeptides: acidophilic phosphorylation sites were predominately enriched in the first Ga(3+)-IMAC (72%), while Pro-directed (85%) and basophilic (79%) phosphorylation sites were enriched in the second Fe(3+)-IMAC. This strategy provided complementary mapping of different kinase substrates in multiple cellular pathways related to cancer invasion and metastasis of lung cancer. Given the fractionation ability and ease of tip preparation of this Ga(3+)-Fe(3+)-IMAC, we propose that this strategy allows more comprehensive characterization of the phosphoproteome both in vitro and in vivo.


BMC Bioinformatics | 2007

Protein subcellular localization prediction based on compartment-specific features and structure conservation

Emily Chia Yu Su; Hua-Sheng Chiu; Allan Lo; Jenn-Kang Hwang; Ting-Yi Sung; Wen-Lian Hsu

BackgroundProtein subcellular localization is crucial for genome annotation, protein function prediction, and drug discovery. Determination of subcellular localization using experimental approaches is time-consuming; thus, computational approaches become highly desirable. Extensive studies of localization prediction have led to the development of several methods including composition-based and homology-based methods. However, their performance might be significantly degraded if homologous sequences are not detected. Moreover, methods that integrate various features could suffer from the problem of low coverage in high-throughput proteomic analyses due to the lack of information to characterize unknown proteins.ResultsWe propose a hybrid prediction method for Gram-negative bacteria that combines a one-versus-one support vector machines (SVM) model and a structural homology approach. The SVM model comprises a number of binary classifiers, in which biological features derived from Gram-negative bacteria translocation pathways are incorporated. In the structural homology approach, we employ secondary structure alignment for structural similarity comparison and assign the known localization of the top-ranked protein as the predicted localization of a query protein. The hybrid method achieves overall accuracy of 93.7% and 93.2% using ten-fold cross-validation on the benchmark data sets. In the assessment of the evaluation data sets, our method also attains accurate prediction accuracy of 84.0%, especially when testing on sequences with a low level of homology to the training data. A three-way data split procedure is also incorporated to prevent overestimation of the predictive performance. In addition, we show that the prediction accuracy should be approximately 85% for non-redundant data sets of sequence identity less than 30%.ConclusionOur results demonstrate that biological features derived from Gram-negative bacteria translocation pathways yield a significant improvement. The biological features are interpretable and can be applied in advanced analyses and experimental designs. Moreover, the overall accuracy of combining the structural homology approach is further improved, which suggests that structural conservation could be a useful indicator for inferring localization in addition to sequence homology. The proposed method can be used in large-scale analyses of proteomes.


BMC Bioinformatics | 2007

BIOSMILE: A semantic role labeling system for biomedical verbs using a maximum-entropy model with automatically generated template features

Richard Tzong-Han Tsai; Wen-Chi Chou; Ying-Shan Su; Yu-Chun Lin; Cheng-Lung Sung; Hong-Jie Dai; Irene Tzu-Hsuan Yeh; Wei Ku; Ting-Yi Sung; Wen-Lian Hsu

BackgroundBioinformatics tools for automatic processing of biomedical literature are invaluable for both the design and interpretation of large-scale experiments. Many information extraction (IE) systems that incorporate natural language processing (NLP) techniques have thus been developed for use in the biomedical field. A key IE task in this field is the extraction of biomedical relations, such as protein-protein and gene-disease interactions. However, most biomedical relation extraction systems usually ignore adverbial and prepositional phrases and words identifying location, manner, timing, and condition, which are essential for describing biomedical relations. Semantic role labeling (SRL) is a natural language processing technique that identifies the semantic roles of these words or phrases in sentences and expresses them as predicate-argument structures. We construct a biomedical SRL system called BIOSMILE that uses a maximum entropy (ME) machine-learning model to extract biomedical relations. BIOSMILE is trained on BioProp, our semi-automatic, annotated biomedical proposition bank. Currently, we are focusing on 30 biomedical verbs that are frequently used or considered important for describing molecular events.ResultsTo evaluate the performance of BIOSMILE, we conducted two experiments to (1) compare the performance of SRL systems trained on newswire and biomedical corpora; and (2) examine the effects of using biomedical-specific features. The experimental results show that using BioProp improves the F-score of the SRL system by 21.45% over an SRL system that uses a newswire corpus. It is noteworthy that adding automatically generated template features improves the overall F-score by a further 0.52%. Specifically, ArgM-LOC, ArgM-MNR, and Arg2 achieve statistically significant performance improvements of 3.33%, 2.27%, and 1.44%, respectively.ConclusionWe demonstrate the necessity of using a biomedical proposition bank for training SRL systems in the biomedical domain. Besides the different characteristics of biomedical and newswire sentences, factors such as cross-domain framesets and verb usage variations also influence the performance of SRL systems. For argument classification, we find that NE (named entity) features indicating if the target node matches with NEs are not effective, since NEs may match with a node of the parsing tree that does not have semantic role labels in the training set. We therefore incorporate templates composed of specific words, NE types, and POS tags into the SRL system. As a result, the classification accuracy for adjunct arguments, which is especially important for biomedical SRL, is improved significantly.


Journal of Proteome Research | 2010

An informatics-assisted label-free quantitation strategy that depicts phosphoproteomic profiles in lung cancer cell invasion

Yi Ting Wang; Chia Feng Tsai; Tzu Chan Hong; Chih Chiang Tsou; Pei Yi Lin; Szu Hua Pan; Tse-Ming Hong; Pan-Chyr Yang; Ting-Yi Sung; Wen-Lian Hsu; Yu-Ju Chen

Aberrant protein phosphorylation plays important roles in cancer-related cell signaling. With the goal of achieving multiplexed, comprehensive, and fully automated relative quantitation of site-specific phosphorylation, we present a simple label-free strategy combining an automated pH/acid-controlled IMAC procedure and informatics-assisted SEMI (sequence, elution time, mass-to-charge, and internal standard) algorithm. The SEMI strategy effectively increased the number of quantifiable peptides more than 4-fold in replicate experiments (from 262 to 1171, p < 0.05, false discovery rate = 0.46%) by using a fragmental regression algorithm for elution time alignment followed by peptide cross-assignment in all LC-MS/MS runs. In addition, the strategy demonstrated good quantitation accuracy (10-12%) for standard phosphoprotein and variation less than 1.9 fold (within 99% confidence range) in proteome scale and reliable linear quantitation correlation (R(2) = 0.99) with 4000-fold dynamic concentrations, which was attributed to our reproducible experimental procedure and informatics-assisted peptide alignment tool to minimize system variations. In an attempt to explore metastasis-associated phosphoproteomic alterations in lung cancer, this approach was used to delineate differential phosphoproteomic profiles of a lung cancer metastasis model. Without sample fractionation, the SEMI algorithm enabled quantification of 1796 unique phosphopeptides (false discovery rate = 0.56%) corresponding to 854 phosphoproteins from a series of non-small cell lung cancer lines with varying degrees of in vivo invasiveness. Nearly 40% of the phosphopeptides showed >2-fold change in highly invasive cells; validation of phosphoprotein subsets by Western blotting not only demonstrated the consistency of data obtained by our SEMI strategy but also revealed that such dramatic changes in the phosphoproteome result mostly from translational or post-translational regulation. Mapping of these differentially expressed phosphoproteins in multiple cellular pathways related to cancer invasion and metastasis suggests that the site and degree of phosphorylation might have distinct patterns or functions in the complex process of cancer progression.


Bioinformatics | 2009

Predicting helix–helix interactions from residue contacts in membrane proteins

Allan Lo; Yi-Yuan Chiu; Einar Andreas Rødland; Ping-Chiang Lyu; Ting-Yi Sung; Wen-Lian Hsu

Motivation: Helix–helix interactions play a critical role in the structure assembly, stability and function of membrane proteins. On the molecular level, the interactions are mediated by one or more residue contacts. Although previous studies focused on helix-packing patterns and sequence motifs, few of them developed methods specifically for contact prediction. Results: We present a new hierarchical framework for contact prediction, with an application in membrane proteins. The hierarchical scheme consists of two levels: in the first level, contact residues are predicted from the sequence and their pairing relationships are further predicted in the second level. Statistical analyses on contact propensities are combined with other sequence and structural information for training the support vector machine classifiers. Evaluated on 52 protein chains using leave-one-out cross validation (LOOCV) and an independent test set of 14 protein chains, the two-level approach consistently improves the conventional direct approach in prediction accuracy, with 80% reduction of input for prediction. Furthermore, the predicted contacts are then used to infer interactions between pairs of helices. When at least three predicted contacts are required for an inferred interaction, the accuracy, sensitivity and specificity are 56%, 40% and 89%, respectively. Our results demonstrate that a hierarchical framework can be applied to eliminate false positives (FP) while reducing computational complexity in predicting contacts. Together with the estimated contact propensities, this method can be used to gain insights into helix-packing in membrane proteins. Availability: http://bio-cluster.iis.sinica.edu.tw/TMhit/ Contact: [email protected] Supplementary information:Supplementary data are available at Bioinformatics online.

Collaboration


Dive into the Ting-Yi Sung's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Lih-Hsing Hsu

National Chiao Tung University

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Ching-Tai Chen

National Chiao Tung University

View shared research outputs
Top Co-Authors

Avatar

Jeng-Jung Wang

National Chiao Tung University

View shared research outputs
Top Co-Authors

Avatar

Chun-Nan Hung

National Chiao Tung University

View shared research outputs
Top Co-Authors

Avatar

Tung-Yang Ho

National Chiao Tung University

View shared research outputs
Researchain Logo
Decentralizing Knowledge