Edited by: L. J. Muhammad, Federal University Kashere, Nigeria
Reviewed by: Sumayah Al-Mahmood, Al-Iraqia University, Iraq; Kun Fang, First Affiliated Hospital of Harbin Medical University, China
*Correspondence: Yexi Chen,
This article was submitted to Breast Cancer, a section of the journal Frontiers in Oncology
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
Recently, ctDNA has become the focus for scientists with respect to personalized treatment, early screening, precise diagnosis, and prognosis of BC. This paper aims to use bibliometric analysis to investigate the research status and future trends in this field.
All the related literature in the field of ctDNA and breast cancer was gathered from the Web of Science Core Collection. Data analyses were performed with R package Bibliometrics, VOS viewer 1.6.18, and online analysis in WoS. IBM SPSS (version 26.0) was used for statistical analysis.
A total of 739 publications, including 472 articles and 267 reviews, were retrieved. The overall number of articles published showed an upward trend. The United States has the largest number of published articles (266 papers) and citations (20,225 times). The most productive journal was Clinical Cancer Research. Cristofanilli M was the most prolific author, while Carlos C was the most cited one. The most frequent keywords excluding the search subject were “liquid biopsy”, “plasma”, “mutations”, “metastatic breast cancer”, “acquired resistance”.
This article explored the application value of ctDNA in breast cancer with bibliometric analysis, offering an overall and intuitive understanding of this topic and revealing the study trends in the past ten years.
香京julia种子在线播放
Breast cancer (BC) has been proven to be the most common malignancy with the highest mortality rate in women worldwide (
Circulating tumor DNA (ctDNA) is a cancer-derived blood biomarker and also the widely studied circulating biomarker applied to liquid biopsy till now (
Bibliometrics is defined as the quantitative assessment of scientific outputs within a particular field using statistical methods (
Although related articles in this field were widely discussed by researchers, there has been no bibliometric analysis found to comprehensively investigate the current progress, hotspots, and challenges with respect to ctDNA and BC. Based on this situation, this paper aims to use scientometric analysis to figure out the publications, authors, countries, journals, and cited references in the field of ctDNA and BC in the last decade, summarizing the hotpots and novel trends in this domain and providing references for future research direction.
Subject words were searched using the Medical Subject Headings (Mesh) database of Pubmed (
Flowchart of data collection and analysis.
With the function of “analyze results” in WoS, we did the preliminary analysis of the retrieved literature, including author, publication year, document type, research direction, country, affiliated institution, publisher. The function of “citation report” was used to acquire the information about the number and citation frequency of each paper.
Then, the information of the literature retrieved by WoS was exported in plain txt format and imported into the tools of biblioshiny (the shiny app for bibliometrix) and VOSviewer for further analysis of annual production, authors, countries, journals, institutions, keywords, and citations.
Bibliometrix package is a scientometric analysis software based on R language. In this study, we used the biblioshiny website with R language (version R 4.1.0) to analyze the bibliographic information of retrieved literature. An overview of basic information contained the time span, annual growth rate, average citations per doc, document contents, document types etc. Descriptive analysis and visual presentation about the annual production, average citation per year, contribution of countries, organizations, and journals, authors publications and impact were acquired. The author impact was evaluated by Hirsch h-index, g-index, m-index. The h-index indicated the impact of a researcher’s scientific outputs, defined as the largest number of articles published by a researcher which have been cited at least h times (
VOSviewer (version 1.6.18), a computer program for constructing and visualizing bibliometric maps in an intuitive and intelligible view (
IBM SPSS (version 26.0) was used for statistics analysis. We set the data type as numerical variables and used Spearman correlation analysis to analyze the correlations between selected variables. All the tests were two-sided, and p-values < 0.05 were considered statistically significant.
A total number of 739 documents contributing to this research field from 2012 to 2021 was obtained from WoS.
Annual publication and average article citations per year of ctDNA in the field of breast cancer from 2012 to 2021.
Among all the retrieved articles, a total of 37,987 citations were received and the average citation frequency of each article was 51.4 times.
Country analysis helps to reveal the geographical distribution of relevant papers in the field. There are 52 countries involved in the 739 retrieved documents contributing to this research area. The top 10 cited countries according to the rank of total citations were performed in
Top 10 cited countries contributing to this research area.
Country | Production | %/of papers | Total citations | TC/P |
---|---|---|---|---|
USA | 266 | 35.994 | 20225 | 76.033 |
UK | 73 | 9.878 | 8377 | 114.753 |
Italy | 86 | 11.637 | 7913 | 92.012 |
Australia | 31 | 4.194 | 7338 | 236.710 |
Germany | 59 | 7.984 | 6856 | 116.203 |
France | 57 | 7.713 | 4448 | 78.035 |
China | 148 | 20.027 | 3387 | 22.885 |
South Korea | 20 | 2.706 | 3338 | 166.9 |
Sweden | 7 | 0.947 | 3052 | 436 |
Brazil | 15 | 2.030 | 2882 | 192.133 |
TC, the total number of citations a country has received.
TC/P, the average number of citations per production in a country.
And then VOSviewer was used to conduct co-authorships analysis of countries with the threshold set as 3 documents. The size of nodes represents the quantity of literature published in each country, and the larger the node is, the more the country produced. The connection between nodes represents the cooperation between countries, and the thicker the line, the closer the cooperation between countries. It is apparent that the United States was the central country in this field, and has a close cooperation with China, Italy, UK, France, and Germany (
Co-authorships analysis of countries in network visualization map.
Setting a threshold of 5 documents of an organization in the co-authorships analysis of organizations, and we got 91 of 1375.
Co-authorships analysis of organizations in network visualization map.
Top 10 organizations based on publications.
Organization | Publication | Total citations | Total link strength | Country |
---|---|---|---|---|
University of Texas MD Anderson Cancer Center | 28 | 3093 | 68 | USA |
Harvard Medical School | 26 | 1382 | 55 | USA |
Institute of Cancer Research | 23 | 2695 | 68 | UK |
Memorial Sloan Kettering Cancer Center | 22 | 6219 | 67 | USA |
Royal Marsden Hospital | 21 | 2608 | 65 | UK |
IRCCS | 18 | 636 | 29 | Italy |
Northwestern University | 18 | 539 | 37 | USA |
Cambridge University | 17 | 4085 | 42 | UK |
Dana-Farber Cancer Institute | 15 | 1710 | 44 | USA |
Vrije Universiteit Brussel | 15 | 957 | 30 | Belgium |
The analysis of journals in related research fields can provide references for researchers to get better submission selection. There are 300 journals active in the field; By setting 5 as the minimum number of documents of a source, 35 of them meet the thresholds and the visualization of sources analysis was got (
Bibliographic coupling analysis of sources in this fields.
From the overlay visualization map of sources analysis in VOSviewer (
There are 46 research categories and 70 publishers involved in the retrieved literature. The top 5 subject categories and publishers based on the number of publications are listed in
Through statistical analysis, 5017 authors contributed to all the 739 documents, 15 among them were single-authored documents. The average number of authors per document was 6.78, and the average number of co-authors per document was 9.56. Cristofanilli M from US was the most prolific author (19 articles), followed by Turner NC, from UK (n=17) and Ma F, from US (n=16) (
The top 10 with most published authors in this field.
Highly published Authors | NP | TC | h-index | g-index | m-index |
---|---|---|---|---|---|
CRISTOFANILLI M | 18 | 999 | 11 | 18 | 1.571 |
TURNER NC | 17 | 2491 | 15 | 17 | 1.875 |
MA F | 16 | 306 | 9 | 16 | 1.286 |
BIDARD FC | 15 | 870 | 10 | 15 | 1 |
GUAN YF | 15 | 282 | 9 | 15 | 1.286 |
PIERGA JY | 15 | 541 | 9 | 15 | 1 |
GARCIA-MURILLAS I | 14 | 1715 | 13 | 14 | 1.625 |
XU BH | 14 | 284 | 8 | 14 | 1.143 |
YI ZB | 14 | 298 | 9 | 14 | 1.286 |
YI X | 13 | 333 | 9 | 13 | 1.286 |
NP: number of publications.
TC: total cications.
Co-authorship analysis between authors was conducted using VOSviewer, and 79 co-authors wrote more than 5 articles. Preserving the largest set of 67 connected items and we got the Visualization Map (
Co-authorship analysis between authors in network visualization.
We used bibliometrix to analyze the specific information of the top 10 documents in this field (
From the co-citations analysis of cited references in VOSviewer, we can learn that 215 of the 24894 co-cited references had been cited at a minimum number of 20 times. In the network visualization, all the cited references are divided into four clusters and larger node represents more reference cited (
Co-citations analysis of cited references in this research fields.
291 keywords were obtained to perform keyword analysis by setting the occurrence frequency to more than 5. With the function of clustering on VOSviewer, all keywords were divided into four clusters shown in different colors (
The word or phrases that frequently appear in the title of an article’s cited reference were extracted and defined as keywords plus, which is used to identify research topics more descriptively and objectively (
It remains a global challenge for cancers to the early diagnosis and effective recurrence detection and therapeutic evaluation in spite of the progress made in the treatment (
The research scope of this paper is set from 2012 to 2021. In 2012, only 8 articles were published in the research field of ctDNA and BC. Since 2015, the number of articles published in this field has grown rapidly, reaching 103 articles in 2017, exceeding 100 articles every year since then, and reaching a peak of 131 articles in 2021. It showed a burst of interest and sustained attention on this field. As of the retrieval date, the papers analyzed in this study have been cited for 37,987 times, with an average of 51.4 times for each paper. Since 2012, the number of citations has increased 258 times, and the literature published in 2014 has the highest number of citations per paper, reflecting the increasing attention and importance of this field. What’s more, it is indicated that the articles published in 2014 have laid an important foundation for the development of this field. Among the top 10 most cited documents, 90% verified that ctDNA has important clinical significance for the identification of early cancer (
In total, 52 countries contributed to the field. The United States is undoubtedly the dominant country in this field, with the largest total number of publications, total number of citations and total link strength. Further analysis displayed that although the number of articles published in the UK and Italy was less than that in China, the number of citations and the total link strength of their articles were significantly higher than that of China, indicating that the academic influence of a country is not only reflected in the number of articles published, but also needs to be comprehensively evaluated by the number of citations, cooperation with other countries and link strength. China should encourage innovative inventions and research, not just the quantity of published literature.
On the top10 for publications of 300 journals, Clinical Cancer Research or Oncotarget was the first choice for many researchers to submit articles five years ago. In the past two years, Cancers and Frontiers in oncology have become the focus of submission for scholars, with a sharply rising number of publications. Meanwhile, in the past three years, breast cancer related journals such as Breast Cancer Research and Treatment, NPJ Breast cancer and Breast Cancer Research have also achieved a satisfactory volume of scientific publications in this field. It can be speculated that the relationship between ctDNA and BC is becoming more and more popular among these special journals related to breast cancers.
Cristofanilli M (n=19), Turner NC (n=17), Ma F (n=16), Bidard FC (n=15), Guan YF (n=15) were top five authors by number of publications. Besides, Turner NC had the highest h-index. His first paper on this subject was published in 2015, which demonstrated that targeted sequencing of ctDNA can be used to detect minimal residual disease in BC and thus more accurately predict genetic events of metastatic recurrence (
Keyword analysis can be used to present emerging topics and predict future research directions in this field. From the perspective of keyword analysis, the research field of ctDNA and BC is mainly divided into three parts. At the early stage, the development of ctDNA to detect and monitor tumor burden has been widely discussed and it is indicated that ctDNA is feasible to be used to detect tumor dynamics in some solid cancer patients (
“cell-free DNA” (cfDNA) is thought to be secreted through apoptosis and necrosis, and the kirsten rat sarcoma viral oncogene (KRAS) mutations detected in cfDNA were found of tumor origin, giving rise to the term ‘ctDNA’ (
The keywords of “ESR1 mutations”, “survival” and “resistance” are closely related to each other. “ESR1 mutations” are common in patients with ER-positive MBC during aromatase inhibitors (AI) medications, especially after the development of hormone resistance (
There were still limitations in the present study. Firstly, we only used the web of science database to search for related publications, which may exclude some influential papers included from other databases, such as PubMed, Embase, and Scopus, resulting in selection bias. However, WoS is still the most extensively used tool for bibliometric analysis, with rich information of distribution of authors, countries, journals, organizations, and citations. Secondly, the research criterion of language type was limited to English. It is possible to cause a lack of high-impact articles written in other language. Thirdly, the articles published this year have not been bring into study, the result of this research is only applicable to the time point until August 28, 2022. Despite these, we believe that our finding can provide valuable advice on future development for researchers in this field.
In this study, we explored the application value of ctDNA in breast cancer with bibliometric analysis, offering an overall and intuitive understanding of this topic and revealing the study trends in the past ten years. Publications related to this field showed a rapidly upward trend. The keyword analysis indicated that the current focus of this field may be the detection of genetic mutation in ctDNA to predict disease progression and treatment effectiveness of BC. Further clinical trials of ctDNA and the standardization and clinical feasibility of liquid biopsy detection are future attention.
The original contributions presented in the study are included in the article/
JC, ZL, and YC participated in the conception and design of the study. QY and HT collected the data. JC and LC provided the guiding suggestions on application usage and data analysis. ZJ, JW, and DZ were involved in data analysis and drafted the manuscript. ZJ was the major contributor to writing the article. LC revised the final version of the manuscript. All authors contributed to the article and approved the submitted version.
This work was supported by the Special Fund Project of Guangdong Science and Technology (210728156901524, 210728156901519), Medical Scientific Research Foundation of Guangdong Province, China (grant number A2021432), Shantou Medical Science and Technology Planning Project (grant number 210521236491457, 210625106490696, 220518116490772, 220518116490933).
The authors thank those individuals who provided software usage guidance, language help, writing assistance, and proofreading of the article.
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
The Supplementary Material for this article can be found online at:
BC | breast cancer |
ctDNA | crculating tumor DNA |
CGP | cancer genome project |
NGS | next-generation sequencing |
CRC | colorectal cancer |
HCC | hepatocellular carcinoma |
Mesh | Medical Subject Headings |
WoS | Web of Science |
SCI-EXPANDED | Science Citation Index Expanded |
SSCI | Social Sciences Citation Index |
ESCI | Arts & Humanities Citation Index |
CCR-EXPANDED | Current Chemical Reactions |
IC | Index Chemicus |
h-index | Hirsch index |
Tam-Seq | tagged-amplicon deep sequencing |
MBC | metastatic breast cancer |
ESR1 | estrogen receptor alpha gene |
cfDNA | cell-free DNA |
KRAS | kirsten rat sarcoma viral oncogene |
AI | aromatase inhibitors |
pCR | pathological complete response |