Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs

Read original: arXiv:2405.04386 - Published 5/8/2024 by Antonio Biki'c, Sayan Mukherjee

🎲

Overview

Artificial neural networks (ANNs) excel at various tasks like classification and prediction, such as speech processing and image classification.
The paper examines the connection between how ANNs select model parameters and the epistemological theory of neopragmatism, focusing on utility and anti-representationalist aspects.
The paper suggests using neopragmatist theories to understand the consequences of ANN model parameter selection, as neopragmatism's notion of optimization is also based on utility considerations.
The paper explores the inherent connections between optimization in machine learning (ML), numerical methods during the learning phase, and optimization in the ethical theory of consequentialism.

Plain English Explanation

Artificial neural networks (ANNs) are a type of machine learning model that can perform exceptionally well on tasks like speech processing and image classification. These models are able to freely select all the necessary internal parameters to deliver the desired functionality.

The paper examines the relationship between how these ANN models select their parameters and the philosophical theory of neopragmatism. Neopragmatism is a perspective that focuses on the practical utility and usefulness of knowledge, rather than on accurately representing reality. The paper suggests that using neopragmatist ideas can help us understand the consequences of how ANNs choose their parameters, as neopragmatism also emphasizes optimization based on utility considerations.

Importantly, the paper finds that the way ANNs optimize their parameters during the learning process is closely connected to the ethical theory of consequentialism, which also focuses on optimizing actions based on their outcomes and utility. The paper proposes that these connections arise from how relevance is calculated in ML systems, and this could reveal tendencies for specific actions in ANN-based systems.

Technical Explanation

The paper explores the connection between how artificial neural networks (ANNs) select their internal model parameters and the epistemological theory of neopragmatism. ANNs are a type of machine learning model that can excel at a variety of tasks, including classification and prediction.

The authors suggest that using neopragmatist theories, which focus on the utility and anti-representationalist aspects of knowledge, can help reveal the consequences of how ANNs choose their model parameters. Neopragmatism's notion of optimization is also based on utility considerations, similar to the optimization that occurs in machine learning during the training phase.

The paper explores the inherent connections between this optimization in ML, using numerical methods during the learning phase, and the optimization in the ethical theory of consequentialism, where it is a central principle. The authors propose that these connections arise from how relevance is calculated in ML systems, and this could ultimately uncover tendencies for specific actions in ANN-based systems.

Critical Analysis

The paper provides an interesting perspective on the connections between machine learning, specifically ANNs, and philosophical theories like neopragmatism and consequentialism. By exploring these connections, the authors aim to shed light on the consequences of how ANNs select their model parameters.

One potential limitation of the research is that it does not delve deeply into empirical validation of the proposed connections. The paper remains largely conceptual and theoretical, relying on the existing literature on neopragmatism and consequentialism to make its arguments. Further research could explore more concrete examples or case studies to substantiate the claims made in the paper.

Additionally, the paper does not address potential issues or ethical concerns that may arise from the inherent connections between ML optimization and consequentialist ethical principles. As AI-powered systems become more prevalent, it will be crucial to carefully examine the implications of these connections and ensure that the actions taken by ANN-based systems align with ethical frameworks that go beyond just optimization based on utility.

Conclusion

This paper presents a thought-provoking exploration of the connections between the internal parameter selection of artificial neural networks and the epistemological theory of neopragmatism, as well as the ethical theory of consequentialism. By highlighting these connections, the authors suggest that using neopragmatist ideas can provide valuable insights into the consequences of how ANNs choose their model parameters.

The paper's main contribution is in drawing these conceptual links and encouraging further research to empirically validate and explore the implications of these connections. As AI models become more ubiquitous, understanding the ethical and philosophical underpinnings of their decision-making processes will be crucial for ensuring these systems are aligned with societal values and beneficial to humanity.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🎲

Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs

Antonio Biki'c, Sayan Mukherjee

Artificial neural networks (ANNs) perform extraordinarily on numerous tasks including classification or prediction, e.g., speech processing and image classification. These new functions are based on a computational model that is enabled to select freely all necessary internal model parameters as long as it eventually delivers the functionality it is supposed to exhibit. Here, we review the connection between the model parameter selection in machine learning (ML) algorithms running on ANNs and the epistemological theory of neopragmatism focusing on the theory's utility and anti-representationalist aspects. To understand the consequences of the model parameter selection of an ANN, we suggest using neopragmatist theories whose implications are well studied. Incidentally, neopragmatism's notion of optimization is also based on utility considerations. This means that applying this approach elegantly reveals the inherent connections between optimization in ML, using a numerical method during the learning phase, and optimization in the ethical theory of consequentialism, where it occurs as a maxim of action. We suggest that these connections originate from the way relevance is calculated in ML systems. This could ultimately reveal a tendency for specific actions in ML systems.

5/8/2024

🧠

Statistical Mechanics and Artificial Neural Networks: Principles, Models, and Applications

Lucas Bottcher, Gregory Wheeler

The field of neuroscience and the development of artificial neural networks (ANNs) have mutually influenced each other, drawing from and contributing to many concepts initially developed in statistical mechanics. Notably, Hopfield networks and Boltzmann machines are versions of the Ising model, a model extensively studied in statistical mechanics for over a century. In the first part of this chapter, we provide an overview of the principles, models, and applications of ANNs, highlighting their connections to statistical mechanics and statistical learning theory. Artificial neural networks can be seen as high-dimensional mathematical functions, and understanding the geometric properties of their loss landscapes (i.e., the high-dimensional space on which one wishes to find extrema or saddles) can provide valuable insights into their optimization behavior, generalization abilities, and overall performance. Visualizing these functions can help us design better optimization methods and improve their generalization abilities. Thus, the second part of this chapter focuses on quantifying geometric properties and visualizing loss functions associated with deep ANNs.

5/21/2024

🤖

AI without networks

Partha P Mitra, Cl'ement Sire

Contemporary Artificial Intelligence (AI) stands on two legs: large training data corpora and many-parameter artificial neural networks (ANNs). The data corpora are needed to represent the complexity and heterogeneity of the world. The role of the networks is less transparent due to the obscure dependence of the network parameters and outputs on the training data and inputs. This raises problems, ranging from technical-scientific to legal-ethical. We hypothesize that a transparent approach to machine learning is possible without using networks at all. By generalizing a parameter-free, statistically consistent data interpolation method, which we analyze theoretically in detail, we develop a network-free framework for AI incorporating generative modeling. We demonstrate this framework with examples from three different disciplines - ethology, control theory, and mathematics. Our generative Hilbert framework applied to the trajectories of small groups of swimming fish outperformed state-of-the-art traditional mathematical behavioral models and current ANN-based models. We demonstrate pure data interpolation based control by stabilizing an inverted pendulum and a driven logistic map around unstable fixed points. Finally, we present a mathematical application by predicting zeros of the Riemann Zeta function, achieving comparable performance as a transformer network. We do not suggest that the proposed framework will always outperform networks as over-parameterized networks can interpolate. However, our framework is theoretically sound, transparent, deterministic, and parameter free: remarkably, it does not require any compute-expensive training, does not involve optimization, has no model selection, and is easily reproduced and ported. We also propose an easily computed method of credit assignment based on this framework, to help address ethical-legal challenges raised by generative AI.

6/7/2024

🤖

Eight challenges in developing theory of intelligence

Haiping Huang

A good theory of mathematical beauty is more practical than any current observation, as new predictions of physical reality can be verified self-consistently. This belief applies to the current status of understanding deep neural networks including large language models and even the biological intelligence. Toy models provide a metaphor of physical reality, allowing mathematically formulating that reality (i.e., the so-called theory), which can be updated as more conjectures are justified or refuted. One does not need to pack all details into a model, but rather, more abstract models are constructed, as complex systems like brains or deep networks have many sloppy dimensions but much less stiff dimensions that strongly impact macroscopic observables. This kind of bottom-up mechanistic modeling is still promising in the modern era of understanding the natural or artificial intelligence. Here, we shed light on eight challenges in developing theory of intelligence following this theoretical paradigm. Theses challenges are representation learning, generalization, adversarial robustness, continual learning, causal learning, internal model of the brain, next-token prediction, and finally the mechanics of subjective experience.

6/24/2024