Usage of OpenAlex for creating meaningful global overlay maps of science on the individual and institutional levels

Read original: arXiv:2404.02732 - Published 4/4/2024 by Robin Haunschild, Lutz Bornmann
Total Score

0

⛏️

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • Researchers have developed a method to create global overlay maps that visualize scientific performance data from individual researchers, institutions, or countries.
  • The paper proposes a procedure for creating these overlay maps using the OpenAlex database.
  • Six different base maps are provided, and example overlay maps are shown for an individual researcher and their research institution.
  • The paper discusses a method for normalizing the overlay data and compares overlay maps using raw and normalized data.
  • The advantages and limitations of the proposed overlay approach are discussed.

Plain English Explanation

Visualizing scientific data can help researchers and institutions better understand research performance and trends. Global overlay maps are one way to do this - they take a base map of the world and overlay specific data on top, like the research output of a particular scientist or institution.

The researchers in this paper have developed a way to create these global overlay maps using a tool called OpenAlex, which contains a large database of scientific publications and data. They provide several different base map options that can be used as the foundation for the overlays.

As an example, the researchers show overlay maps for an individual researcher (the first author of this paper) and their research institution. These maps display things like the number of publications or citations in different regions of the world. The researchers also discuss a method for normalizing the overlay data to account for factors like population size.

The paper suggests that overlay maps using raw, unnormalized data may be better at highlighting general concepts, while normalized data provides a more accurate picture. Both approaches have their advantages and limitations that are explored in the paper.

Overall, this research provides a useful tool for visualizing and analyzing the geographic distribution of scientific performance and output. This could be valuable for researchers, institutions, and policymakers looking to understand research trends and identify areas for collaboration or investment.

Technical Explanation

The paper presents a procedure for creating global overlay maps to visualize scientific performance data using the OpenAlex database. Six different base map options are provided, including maps based on country boundaries, population density, and GDP.

To demonstrate the overlay mapping approach, the researchers create example maps for an individual researcher (the first author) and their research institution. The individual researcher map shows the geographic distribution of the researcher's publications and citations. The institutional map displays similar data for the entire organization.

A key aspect of the overlay mapping process is data normalization. The researchers propose a method to normalize the overlay data, such as adjusting for factors like population size. They compare overlay maps created with raw, unnormalized data to those using the normalized approach.

The paper suggests that overlay maps with raw data tend to emphasize general concepts and patterns, while normalized data provides a more accurate representation of research activity. For example, the raw data maps may highlight major research hubs, while the normalized maps reveal more nuanced differences in per-capita performance.

The researchers discuss the advantages of their overlay mapping approach, such as the ability to quickly visualize global research trends. Limitations include potential biases in the underlying data and challenges in interpreting complex overlay maps.

Overall, this research demonstrates a flexible and scalable method for creating global overlay maps to support analysis of scientific performance and output around the world. The tool could be valuable for researchers, policymakers, and others seeking to understand the geographic distribution of research activities and impact.

Critical Analysis

The paper presents a solid methodology for creating global overlay maps of scientific performance data using the OpenAlex database. The provision of multiple base map options is a strength, as it allows users to select the visualization that best suits their needs and data.

One potential limitation is the reliance on OpenAlex as the sole data source. While OpenAlex is a comprehensive database, it may not capture all relevant research activity, especially in regions or fields that are underrepresented. Incorporating data from additional sources could help provide a more complete picture.

The proposed normalization approach is a thoughtful attempt to account for factors like population size that can skew the overlay data. However, the paper does not fully explore other potential confounding variables, such as economic development, research funding levels, or language biases in publication databases. Further refinement of the normalization method may be warranted.

Additionally, the paper acknowledges that interpreting complex overlay maps can be challenging. While the researchers provide examples, more guidance on best practices for visualizing and analyzing these maps could enhance the tool's utility for a broad range of users.

Despite these minor limitations, the overlay mapping approach presented in this paper represents a valuable contribution to the field of science mapping and visualization. As the researchers note, these tools can provide important insights to support research management, policy, and collaboration decisions. Further development and validation of the technique could make it an increasingly powerful resource for the scientific community.

Conclusion

This paper introduces a novel method for creating global overlay maps to visualize scientific performance data using the OpenAlex database. The researchers demonstrate the approach through example maps for an individual researcher and their institution, highlighting both the advantages and potential limitations of the technique.

The ability to quickly generate customizable overlay maps that reveal geographic patterns in research output, citations, and other metrics could be a valuable asset for researchers, research managers, policymakers, and others seeking to understand global science and innovation trends. As the researchers suggest, these tools may support decision-making around research priorities, funding allocations, and international collaborations.

While further refinements to data sources and normalization methods could enhance the approach, this paper lays an important foundation for leveraging visualization technologies to gain deeper insights into the worldwide landscape of scientific activity and impact. As the research community continues to grapple with complex, large-scale data, innovations like global overlay mapping will likely play an increasingly vital role in extracting meaningful, actionable intelligence.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

⛏️

Total Score

0

Usage of OpenAlex for creating meaningful global overlay maps of science on the individual and institutional levels

Robin Haunschild, Lutz Bornmann

Global overlay maps of science use base maps that are overlaid by specific data (from single researchers, institutions, or countries) for visualizing scientific performance such as field-specific paper output. A procedure to create global overlay maps using OpenAlex is proposed. Six different global base maps are provided. Using one of these base maps, example overlay maps for one individual (the first author of this paper) and his research institution are shown and analyzed. A method for normalizing the overlay data is proposed. Overlay maps using raw overlay data display general concepts more pronounced than their counterparts using normalized overlay data. Advantages and limitations of the proposed overlay approach are discussed.

Read more

4/4/2024

🤷

Total Score

0

Chronological Outlooks of Globe Illustrated with Web-Based Visualization

Tahmim Hossain, Sai Sarath Movva, Ritika Ritika

Developing visualizations with comprehensive annotations is crucial for research and educational purposes. We've been experimenting with various visualization tools like Plotly, Plotly.js, and D3.js to analyze global trends, focusing on areas such as Global Terrorism, the Global Air Quality Index (AQI), and Global Population dynamics. These visualizations help us gain insights into complex research topics, facilitating better understanding and analysis. We've created a single web homepage that links to three distinct visualization web pages, each exploring specific topics in depth. These webpages have been deployed on free cloud hosting servers such as Vercel and Render.

Read more

4/26/2024

The Ontoverse: Democratising Access to Knowledge Graph-based Data Through a Cartographic Interface
Total Score

0

The Ontoverse: Democratising Access to Knowledge Graph-based Data Through a Cartographic Interface

Johannes Zimmermann, Dariusz Wiktorek, Thomas Meusburger, Miquel Monge-Dalmau, Antonio Fabregat, Alexander Jarasch, Gunter Schmidt, Jorge S. Reis-Filho, T. Ian Simpson

As the number of scientific publications and preprints is growing exponentially, several attempts have been made to navigate this complex and increasingly detailed landscape. These have almost exclusively taken unsupervised approaches that fail to incorporate domain knowledge and lack the structural organisation required for intuitive interactive human exploration and discovery. Especially in highly interdisciplinary fields, a deep understanding of the connectedness of research works across topics is essential for generating insights. We have developed a unique approach to data navigation that leans on geographical visualisation and uses hierarchically structured domain knowledge to enable end-users to explore knowledge spaces grounded in their desired domains of interest. This can take advantage of existing ontologies, proprietary intelligence schemata, or be directly derived from the underlying data through hierarchical topic modelling. Our approach uses natural language processing techniques to extract named entities from the underlying data and normalise them against relevant domain references and navigational structures. The knowledge is integrated by first calculating similarities between entities based on their shared extracted feature space and then by alignment to the navigational structures. The result is a knowledge graph that allows for full text and semantic graph query and structured topic driven navigation. This allows end-users to identify entities relevant to their needs and access extensive graph analytics. The user interface facilitates graphical interaction with the underlying knowledge graph and mimics a cartographic map to maximise ease of use and widen adoption. We demonstrate an exemplar project using our generalisable and scalable infrastructure for an academic biomedical literature corpus that is grounded against hundreds of different named domain entities.

Read more

8/9/2024

AceMap: Knowledge Discovery through Academic Graph
Total Score

0

AceMap: Knowledge Discovery through Academic Graph

Xinbing Wang, Luoyi Fu, Xiaoying Gan, Ying Wen, Guanjie Zheng, Jiaxin Ding, Liyao Xiang, Nanyang Ye, Meng Jin, Shiyu Liang, Bin Lu, Haiwen Wang, Yi Xu, Cheng Deng, Shao Zhang, Huquan Kang, Xingli Wang, Qi Li, Zhixin Guo, Jiexing Qi, Pan Liu, Yuyang Ren, Lyuwen Wu, Jungang Yang, Jianping Zhou, Chenghu Zhou

The exponential growth of scientific literature requires effective management and extraction of valuable insights. While existing scientific search engines excel at delivering search results based on relational databases, they often neglect the analysis of collaborations between scientific entities and the evolution of ideas, as well as the in-depth analysis of content within scientific publications. The representation of heterogeneous graphs and the effective measurement, analysis, and mining of such graphs pose significant challenges. To address these challenges, we present AceMap, an academic system designed for knowledge discovery through academic graph. We present advanced database construction techniques to build the comprehensive AceMap database with large-scale academic entities that contain rich visual, textual, and numerical information. AceMap also employs innovative visualization, quantification, and analysis methods to explore associations and logical relationships among academic entities. AceMap introduces large-scale academic network visualization techniques centered on nebular graphs, providing a comprehensive view of academic networks from multiple perspectives. In addition, AceMap proposes a unified metric based on structural entropy to quantitatively measure the knowledge content of different academic entities. Moreover, AceMap provides advanced analysis capabilities, including tracing the evolution of academic ideas through citation relationships and concept co-occurrence, and generating concise summaries informed by this evolutionary process. In addition, AceMap uses machine reading methods to generate potential new ideas at the intersection of different fields. Exploring the integration of large language models and knowledge graphs is a promising direction for future research in idea evolution. Please visit url{https://www.acemap.info} for further exploration.

Read more

4/16/2024