Publications

You can also find my articles on my Google Scholar profile.

Journal Articles


Strong Novelty Regained: High-Impact Outcomes of Machine Learning for Science

Published in Synthese, forthcoming, 2025

A general class of presupposition arguments holds that the background knowledge and theory required to design, develop, and interpret a machine learning (ML) system imply a strong upper limit to ML’s impact on science. I consider two proposals for how to assess the scientific impact of ML predictions, and I argue that while these accounts prioritize conceptual change, the presuppositions they take to be disqualifying for strong novelty are too restrictive. I characterize a general form of their arguments I call the Concept-free Design Argument: that strong novelty is curtailed by utilizing prior conceptualizations of target phenomena in model design. However, I argue that if ML design choices (such as ground-truth labels for supervised ML and inductive biases) are based on prior conceptualizations of phenomena, it need not impede conceptual change. Furthermore, while their accounts focus narrowly on conceptual change, a variety of learning outcomes also contribute to strong scientific change. Thus, I present a variety of types of strong novelty from philosophy of creativity, epistemology, and philosophy of science that paint a more varied picture of how ML advances science. One of these is a form of local theory-independent learning from data that signals an aim to substantially revise existing theory, but it is not easily undermined by prior assumptions about target phenomena. Furthermore, generating surprise, reducing utility blindness, and eliminating deep ignorance also indicate high impact to scientific knowledge or research direction. I illustrate these types of strong novelty with several cases of scientific discovery with algorithms. My taxonomy clarifies several desiderata for machine-based exploration and should inform choices in designing for scientific change.

Download Paper

On Values in Fairness Optimization with Machine Learning

Published in Philosophy of Science, forthcoming, 2024

Statistical criteria of fairness bring attention to the multiobjective nature of many predictive modelling problems. In this paper, I consider how epistemic and non-epistemic values impact the design of machine learning algorithms that optimize for more than one normative goal. I focus on a major design choice between biased search strategies that directly incorporate priorities for various objectives into an optimization procedure, and unbiased search strategies that do not. I argue that both reliably generate Pareto optimal solutions such that various other values are relevant to making a rational choice between them.

Download Paper

Some Dilemmas for an Account of Neural Representation: A Reply to Poldrack

Published in Synthese, 2022

“The physics of representation” (Poldrack, 2020) aims to (1) define the word “representation” as used in the neurosciences, (2) argue that such representations as described in neuroscience are related to and usefully illuminated by the representations generated by modern neural networks, and (3) establish that these entities are “representations in good standing”. We suggest that Poldrack succeeds in (1), exposes some tensions between the broad use of the term in neuroscience and the narrower class of entities that he identifies in the end, and between the meaning of “representation” in neuroscience and in psychology in (2), and fails in (3). This results in some hard choices: give up on the broad scope of the term in neuroscience (and thereby potentially opening a gap between psychology and neuroscience) or continue to embrace the broad, psychologically inflected sense of the term, and deny the entities generated by neural nets (and the brain) are representations in the relevant sense.

Download Paper

PARETO: A novel evolutionary optimization approach to multiobjective IMRT planning

Published in Medical Physics, 2011

In radiation therapy treatment planning, the clinical objectives of uniform high dose to the planning target volume (PTV) and low dose to the organs-at-risk (OARs) are invariably in conflict, often requiring compromises to be made between them when selecting the best treatment plan for a particular patient. In this work, the authors introduce Pareto-Aware Radiotherapy Evolutionary Treatment Optimization (pareto), a multiobjective optimization tool to solve for beam angles and fluence patterns in intensity-modulated radiation therapy (IMRT) treatment planning. PARETO is built around a powerful multiobjective genetic algorithm (GA), which allows us to treat the problem of IMRT treatment plan optimization as a combined monolithic problem, where all beam fluence and angle parameters are treated equally during the optimization. We have employed a simple parameterized beam fluence representation with a realistic dose calculation approach, incorporating patient scatter effects, to demonstrate feasibility of the proposed approach on two phantoms. The first phantom is a simple cylindrical phantom containing a target surrounded by three OARs, while the second phantom is more complex and represents a paraspinal patient.

Recommended citation: Fiege, J., McCurdy, B., Potrebko, P., Champion, H., & Cull, A. (2011). PARETO: A Novel Evolutionary Optimization Approach to Multiobjective IMRT planning. Medical Physics, 38(9), 5217-5229
Download Paper