Dangzhi Zhao
University of Alberta
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Dangzhi Zhao.
Information Processing and Management | 2006
Dangzhi Zhao
The present study examines one of the fundamental aspects of author co-citation analysis (ACA): the way co-citation counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings are based, and we compare ACA results based on two different types of co-citation counting: on the one hand, the traditional type that only counts the first one among a cited works authors, and on the other hand, a simplified approach to all-author co-citation counting that takes into account the first five authors of a cited work. Results indicate that the picture produced through this simplified all-author co-citation counting contains author groups that are more coherent, and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed. Variations of counting more than first authors are compared.
Scientometrics | 2002
Dangzhi Zhao; Elisabeth Logan
With the primary goal of exploring whether citation analysis using scientific papers found on the Web as a data source is a worthwhile means of studying scholarly communication in the new digital environment, the present case study examines the scholarly communication patterns in XML research revealed by citation analysis of ResearchIndex data and SCI data. Results suggest that citation analysis using scientific papers found on the Web as a data source has both advantages and disadvantages when compared with citation analysis of SCI data, but is nonetheless a valid method for evaluating scholarly contributions and for studying the intellectual structure in XML research.
Journal of the Association for Information Science and Technology | 2012
Andreas Strotmann; Dangzhi Zhao
In this article, we explore how strongly author name disambiguation (AND) affects the results of an author-based citation analysis study, and identify conditions under which the traditional simplified approach of using surnames and first initials may suffice in practice. We compare author citation ranking and cocitation mapping results in the stem cell research field from 2004 to 2009 using two AND approaches: the traditional simplified approach of using author surname and first initial and a sophisticated algorithmic approach. We find that the traditional approach leads to extremely distorted rankings and substantially distorted mappings of authors in this field when based on first- or all-author citation counting, whereas last-author-based citation ranking and cocitation mapping both appear relatively immune to the author name ambiguity problem. This is largely because Romanized names of Chinese and Korean authors, who are very active in this field, are extremely ambiguous, but few of these researchers consistently publish as last authors in bylines. We conclude that a more earnest effort is required to deal with the author name ambiguity problem in both citation analysis and information retrieval, especially given the current trend toward globalization. In the stem cell research field, in which laboratory heads are traditionally listed as last authors in bylines, last-author-based citation ranking and cocitation mapping using the traditional approach to author name disambiguation may serve as a simple workaround, but likely at the price of largely filtering out Chinese and Korean contributions to the field as well as important contributions by young researchers.
Scientometrics | 2010
Dangzhi Zhao
This paper reports on a bibliometric study of the characteristics and impact of research in the library and information science (LIS) field which was funded through research grant programs, and compares it with research that received no extra funding. Seven core LIS journals were examined to identify articles published in 1998 that acknowledge research grant funding. The distribution of these articles by various criteria (e.g., topic, affiliation, funding agency) was determined. Their impact as indicated by citation counts during 1998–2008 was evaluated against that of articles without acknowledging extra funding and published in the same journals in the same year using citation data collected from Scopus’ Citation Tracker. The impact of grant-funded research as measured by citation counts was substantially higher than that of other research, both overall and in each journal individually. Scholars from outside LIS core institutions contributed heavily to grant-funded research. The two highest-impact publications by far reported non-grant-based research, and grant-based funding of research reported in core LIS journals was biased towards the information retrieval (IR) area, particularly towards research on IR systems. The percentage of articles reporting grant-funded research was substantially higher in information-oriented journals than in library-focused ones.
Journal of Informetrics | 2008
Dangzhi Zhao; Andreas Strotmann
Although it is generally understood that different citation counting methods can produce quite different author rankings, and although “optimal” author co-citation counting methods have been identified theoretically, studies that compare author co-citation counting methods in author co-citation analysis (ACA) studies are still rare. The present study applies strict all-author-based ACA to the Information Science (IS) field, in that all authors of all cited references in a classic IS dataset are counted, and in that even the diagonal values of the co-citation matrix are computed in their theoretically optimal form. Using Scopus instead of SSCI as the data source, we find that results from a theoretically optimal all-author ACA appear to be excellent in practice, too, although in a field like IS where co-authorship levels are relatively low, its advantages over classic first-author ACA appear considerably smaller than in the more highly collaborative ones targeted before. Nevertheless, we do find some differences between the two approaches, in that first-author ACA appears to favor theorists who presumably tend to work alone, while all-author ACA appears to paint a somewhat more recent picture of the field, and to pick out some collaborative author clusters.
Journal of the Association for Information Science and Technology | 2011
Dangzhi Zhao; Andreas Strotmann
How can citation analysis take into account the highly collaborative nature and unique research and publication culture of biomedical research fields? This study explores this question by introducing last-author citation counting and comparing it with traditional first-author counting and theoretically optimal all-author counting in the stem cell research field for the years 2004–2009. For citation ranking, last-author counting, which is directly supported by Scopus but not by ISI databases, appears to approximate all-author counting quite well in a field where heads of research labs are traditionally listed as last authors; however, first author counting does not. For field mapping, we find that author co-citation analyses based on different counting methods all produce similar overall intellectual structures of a research field, but detailed structures and minor specialties revealed differ to various degrees and thus require great caution to interpret. This is true especially when authors are selected into the analysis based on citedness, because author selection is found to have a greater effect on mapping results than does choice of co-citation counting method. Findings are based on a comprehensive, high-quality dataset extracted in several steps from PubMed and Scopus and subjected to automatic reference and author name disambiguation.
association for information science and technology | 2014
Dangzhi Zhao; Andreas Strotmann
This study continues a long history of author cocitation analysis (and more recently, author bibliographic coupling analysis) of the intellectual structure of information science (IS) into the time period 2006 to 2010 (IS 2006–2010). We find that web technologies continue to drive developments, especially at the research front, although perhaps more indirectly than before. A broadening of perspectives is visible in IS 2006–2010, where network science becomes influential and where full‐text analysis methods complement traditional computer science influences. Research in the areas of the h‐index and mapping of science appears to have been highlights of IS 2006–2011. This study tests and confirms a forecast made previously by comparing knowledge‐base and research‐front findings for IS 2001–2005, which expected both the information retrieval (IR) systems and webometrics specialties to shrink in 2006 to 2010. A corresponding comparison of the knowledge base and research front of IS 2006–2010 suggests a continuing decline of the IR systems specialty in the near future, but also a considerable (re)growth of the webometrics area after a period of decline from 2001 to 2005 and 2006 to 2010, with the latter due perhaps in part to its contribution to an emerging web science.
Journal of Informetrics | 2010
Andreas Strotmann; Dangzhi Zhao
Field delimitation for citation analysis, the process of collecting a set of bibliographic records with cited-reference information of research articles that represent a research field, is the first step in any citation analysis study of a research field. Due to a number of limitations, the commercial citation indexes have long made it difficult to obtain a comprehensive dataset in this step. This paper discusses some of the limitations imposed by these databases, and reports on a method to overcome some of these limitations that was used with great success to delimit an emerging and highly interdisciplinary biomedical research field, stem cell research. The resulting field delimitation and the citation network it induces are both excellent. This multi-database method relies on using PubMed for the actual field delimitation, and on mapping between Scopus and PubMed records for obtaining comprehensive information about cited-references contained in the resulting literature. This method provides high-quality field delimitations for citation studies that can be used as benchmarks for studies of the impact of data collection biases on citation metrics, and may help improve confidence in results of scientometric studies for an increased impact of scientometrics on research policy.
Synthesis Lectures on Information Concepts, Retrieval, and Services | 2015
Dangzhi Zhao; Andreas Strotmann
Policy-based data management enables the creation of community-specific collections. Every collection is created for a purpose. e purpose defines the set of properties that will be associated with the collection.e properties are enforced bymanagement policies that control the execution of procedures that are applied whenever data are ingested or accessed. e procedures generate state information that defines the outcome of enforcing the management policy. e state information can be queried to validate assessment criteria and verify that the required collection properties have been conserved. e integrated Rule-Oriented Data System implements the data management framework required to support policy-based data management. Policies are turned into computer actionable Rules. Procedures are composed from a microservice-oriented architecture. e result is a highly extensible and tunable system that can enforce management policies, automate administrative tasks, and periodically validate assessment criteria. iRODS 4.0+ represents a major effort to analyze, harden, and package iRODS for sustainability, modularization, security, and testability. is has led to a fairly significant refactorization of much of the underlying codebase. iRODS has been modularized whereby existing iRODS 3.x functionality has been replaced and provided by small, interoperable plugins. e core is designed to be as immutable as possible and serve as a bus for handling the internal logic of the business of iRODS. Seven major interfaces have been exposed by the core and allow extensibility and separation of functionality into plugins.
Scientometrics | 2011
Dangzhi Zhao; Andreas Strotmann
This study is an attempt to approach the intellectual structure of the stem cell research field 2004–2009 through a comprehensive author co-citation analysis (ACA), and to contribute to a better understanding of a field that has been brought to the forefront of research, therapy and political and public debates, which, hopefully, will in turn better inform research and policy. Based on a nearly complete and clean dataset of stem cell literature compiled from PubMed and Scopus, and using automatic author disambiguation to further improve results, we perform an exclusive all-author ACA of the 200 top-ranked researchers of the field by fractional citation count. We find that, despite the theoretically highly interdisciplinary nature of the field, stem cell research has been dominated by a few central medical research areas—cancer and regenerative medicine of the brain, the blood, the skin, and the heart—and a core of cell biologists trying to understand the nature and the molecular biology of stem cells along with biotechnology researchers investigating the practical identification, isolation, creation, and culturing of stem cells. It is also remarkably self-contained, drawing only on a few related areas of cell biology. This study also serves as a baseline against which the effectiveness of a range of author-based bibliometric methods and indicators can be tested, especially when based on less comprehensive datasets using less optimal analysis methods.