PubTrends

PubTrends is a powerful tool designed to help researchers navigate the ever-growing flow of scientific publications.
It accelerates trend analysis and the discovery of breakthrough papers, offering a streamlined approach to staying ahead in your field.

With PubTrends, you can:
Datasets

Workflow

The workflow diagram below illustrates how PubTrends operates, offering a seamless path from data collection to meaningful insights.

Example

To demonstrate PubTrends in action, we showcase an analysis for the predefined search query "human aging".

Papers

The Papers section offers a comprehensive overview of the field, featuring the total number of articles, extracted topics, and a word cloud of frequently used terms in titles and abstracts (clickable for accessing related documents). It also includes a yearly summary of published papers and an option to view papers in a list format.

Top Cited Papers

The Top Cited Papers visualization presents an interactive breakdown of the most-cited works, categorized by citation count and article type, with distinct colors representing various types.

Hot Papers

The Hot Papers plot identifies the most impactful papers by highlighting those with the highest citation counts for each year.

Keywords

The Keywords Frequency plot displays the most common terms used in papers and their evolution over time.
You can hover over a keyword to see the exact number of papers in which it appears.

Network

Topics are closely related groups of documents. Aggregated graph and text embeddings are used to find similar papers and detect topics.
Overall structure of topics within a research field can be visualised as a papers similarity graph.


Interactive Visualization

The displayed graph represents the overall structure of the research field, but the dedicated graph explorer offers a more comprehensive experience.
It allows papers to be color-coded by year or topic and provides advanced search and filtering options using available metadata.
In the screenshot, papers are color-coded by topic, only papers with the word "methylation" are shown.


Topics

Topics are identified through hierarchical clustering of paper embeddings.
Users can explore the topic hierarchy using a dendrogram in a dedicated plot.


Topics by Year

The Topics by Year plot displays the distribution of papers across different topics, highlighting the number of papers for each topic annually.
Each topic is summarized with key keywords and its overall share of total papers.

Topic Keywords

For each topic, the application displays a familiar word cloud and an articles plot.
The word cloud is generated using terms specific to the selected topic, highlighting their significance compared to other topics.
The size of each word reflects its importance, based on the proportion of papers mentioning it.

Materials

Code

Source code is available on GitHub at https://github.com/JetBrains-Research/PubTrends.