Planetary computing for data-driven environmental policy-making

Read original: arXiv:2303.04501 - Published 6/4/2024 by Patrick Ferris, Michael Dales, Sadiq Jaffer, Amelia Holcomb, Eleanor Toye Scott, Thomas Swinfield, Alison Eyres, Andrew Balmford, David Coomes, Srinivasan Keshav and 1 other
Total Score

0

๐Ÿค–

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • The paper makes a case for "planetary computing" - infrastructure to handle the ingestion, transformation, analysis, and publication of global data products for advancing environmental science and enabling better-informed policy-making.
  • The authors draw on their experiences as a team of computer scientists working with environmental scientists on forest carbon and biodiversity preservation.
  • They classify existing solutions based on their flexibility in scalably processing geospatial data and how well they support building trust in the results via traceability and reproducibility.
  • The paper identifies research gaps at the intersection of computing and environmental science around handling continuously changing datasets collected over decades and requiring careful access control rather than being fully open access.

Plain English Explanation

The paper discusses the idea of "planetary computing" - developing the infrastructure and systems to effectively manage and analyze huge amounts of environmental data from around the world. This could help advance environmental science research and support better decision-making by policymakers.

The authors, who are computer scientists, have worked with environmental scientists on projects related to forests and biodiversity. They've looked at different existing technological solutions and evaluated how well they can handle large-scale geospatial data in a scalable way, as well as how transparent and reproducible the results from these systems are.

A key challenge they identify is that much of the environmental data being collected is constantly changing over time, often gathered over decades. This data also often requires careful control over who can access it, rather than being fully open to the public. The paper suggests there are gaps in research on how to best manage and analyze this type of complex, evolving environmental dataset.

Technical Explanation

The paper proposes the concept of "planetary computing" - a computing infrastructure designed to ingest, transform, analyze, and publish global-scale environmental data products. This could support advancements in environmental science research and better-informed policymaking.

The authors draw on their experiences working as a team of computer scientists collaborating with environmental scientists on projects related to forest carbon and biodiversity preservation. They categorize existing technological solutions based on two key criteria: 1) the flexibility and scalability of the systems in processing large geospatial datasets, and 2) the degree to which the solutions support building trust in the results through traceability and reproducibility.

A key research gap identified in the paper is how to effectively handle environmental datasets that are continuously evolving over long time periods, often collected over decades, and require careful access controls rather than being fully open access. The authors suggest this intersection of computing and environmental science needs further investigation.

Critical Analysis

The paper rightly highlights the critical importance of developing robust computing infrastructure and systems to support the analysis and understanding of global-scale environmental data. As the authors note, this is essential for advancing environmental science research and informing effective policymaking around issues like climate change and biodiversity conservation.

The authors' focus on evaluating existing solutions based on scalability and result traceability/reproducibility is valuable. These are key attributes needed for environmental data platforms to be trusted and widely adopted. However, the paper does not provide much detail on the specific existing systems analyzed or the detailed criteria used for the evaluation.

Additionally, the paper could have gone further in exploring the challenges around managing continuously evolving environmental datasets with access control requirements. This is a significant hurdle that likely requires novel approaches beyond traditional data management techniques. More discussion of potential solutions or directions for future research in this area would have strengthened the paper.

Overall, the paper makes a compelling case for the need for "planetary computing" capabilities to support environmental science and policy. The identification of key research gaps is insightful and could help guide future work in this important interdisciplinary domain at the intersection of computing and environmental science.

Conclusion

This paper argues for the development of "planetary computing" infrastructure to enable the effective management and analysis of global environmental data. Such a system could drive advancements in environmental science research and support better-informed policymaking around critical issues like climate change and biodiversity.

The authors draw on their experiences working at the intersection of computer science and environmental science, evaluating existing technological solutions based on scalability and result transparency. They identify a key research gap around handling continuously evolving environmental datasets that require carefully controlled access.

Addressing the challenges outlined in this paper could lead to transformative improvements in our ability to understand and protect the planet's ecosystems. The vision of "planetary computing" put forth deserves further exploration and development by the research community.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on ๐• โ†’

Related Papers

๐Ÿค–

Total Score

0

Planetary computing for data-driven environmental policy-making

Patrick Ferris, Michael Dales, Sadiq Jaffer, Amelia Holcomb, Eleanor Toye Scott, Thomas Swinfield, Alison Eyres, Andrew Balmford, David Coomes, Srinivasan Keshav, Anil Madhavapeddy

We make a case for planetary computing -- infrastructure to handle the ingestion, transformation, analysis and publication of global data products for furthering environmental science and enabling better informed policy-making. We draw on our experiences as a team of computer scientists working with environmental scientists on forest carbon and biodiversity preservation, and classify existing solutions by their flexibility in scalably processing geospatial data, and also how well they support building trust in the results via traceability and reproducibility. We identify research gaps in the intersection of computing and environmental science around how to handle continuously changing datasets that are often collected across decades and require careful access control rather than being fully open access.

Read more

6/4/2024

๐Ÿ”„

Total Score

0

Carbon Connect: An Ecosystem for Sustainable Computing

Benjamin C. Lee, David Brooks, Arthur van Benthem, Udit Gupta, Gage Hills, Vincent Liu, Benjamin Pierce, Christopher Stewart, Emma Strubell, Gu-Yeon Wei, Adam Wierman, Yuan Yao, Minlan Yu

Computing is at a moment of profound opportunity. Emerging applications -- such as capable artificial intelligence, immersive virtual realities, and pervasive sensor systems -- drive unprecedented demand for computer. Despite recent advances toward net zero carbon emissions, the computing industry's gross energy usage continues to rise at an alarming rate, outpacing the growth of new energy installations and renewable energy deployments. A shift towards sustainability is needed to spark a transformation in how computer systems are manufactured, allocated, and consumed. Carbon Connect envisions coordinated research thrusts that produce design and management strategies for sustainable, next-generation computer systems. These strategies must flatten and then reverse growth trajectories for computing power and carbon for society's most rapidly growing applications such as artificial intelligence and virtual spaces. We will require accurate models for carbon accounting in computing technology. For embodied carbon, we must re-think conventional design strategies -- over-provisioned monolithic servers, frequent hardware refresh cycles, custom silicon -- and adopt life-cycle design strategies that more effectively reduce, reuse and recycle hardware at scale. For operational carbon, we must not only embrace renewable energy but also design systems to use that energy more efficiently. Finally, new hardware design and management strategies must be cognizant of economic policy and regulatory landscape, aligning private initiatives with societal goals. Many of these broader goals will require computer scientists to develop deep, enduring collaborations with researchers in economics, law, and industrial ecology to spark change in broader practice.

Read more

8/22/2024

๐Ÿ› ๏ธ

Total Score

0

Web-based Visualization and Analytics of Petascale data: Equity as a Tide that Lifts All Boats

Aashish Panta, Xuan Huang, Nina McCurdy, David Ellsworth, Amy Gooch, Giorgio Scorzelli, Hector Torres, Patrice Klein, Gustavo Ovando-Montejo, Valerio Pascucci

Scientists generate petabytes of data daily to help uncover environmental trends or behaviors that are hard to predict. For example, understanding climate simulations based on the long-term average of temperature, precipitation, and other environmental variables is essential to predicting and establishing root causes of future undesirable scenarios and assessing possible mitigation strategies. While supercomputer centers provide a powerful infrastructure for generating petabytes of simulation output, accessing and analyzing these datasets interactively remains challenging on multiple fronts. This paper presents an approach to managing, visualizing, and analyzing petabytes of data within a browser on equipment ranging from the top NASA supercomputer to commodity hardware like a laptop. Our novel data fabric abstraction layer allows user-friendly querying of scientific information while hiding the complexities of dealing with file systems or cloud services. We also optimize network utilization while streaming from petascale repositories through state-of-the-art progressive compression algorithms. Based on this abstraction, we provide customizable dashboards that can be accessed from any device with any internet connection, enabling interactive visual analysis of vast amounts of data to a wide range of users - from top scientists with access to leadership-class computing environments to undergraduate students of disadvantaged backgrounds from minority-serving institutions. We focus on NASA's use of petascale climate datasets as an example of particular societal impact and, therefore, a case where achieving equity in science participation is critical. We further validate our approach by deploying the dashboards and simplified training materials in the classroom at a minority-serving institution.

Read more

8/23/2024

๐Ÿ“Š

Total Score

0

Reducing the climate impact of data portals: a case study

Noah Gie{ss}ing, Madhurima Deb, Ankit Satpute, Moritz Schubotz, Olaf Teschke

The carbon footprint share of the information and communication technology (ICT) sector has steadily increased in the past decade and is predicted to make up as much as 23 % of global emissions in 2030. This shows a pressing need for developers, including the information retrieval community, to make their code more energy-efficient. In this project proposal, we discuss techniques to reduce the energy footprint of the MaRDI (Mathematical Research Data Initiative) Portal, a MediaWiki-based knowledge base. In future work, we plan to implement these changes and provide concrete measurements on the gain in energy efficiency. Researchers developing similar knowledge bases can adapt our measures to reduce their environmental footprint. In this way, we are working on mitigating the climate impact of Information Retrieval research.

Read more

6/7/2024