The Most Important Tool in Astronomy Isn’t a Telescope

On any given night, thousands of astronomers around the world point their telescopes at the same sky. They see the same stars, the same galaxies, the same cosmic microwave background. But until recently, they could not share the same code. A researcher at the University of Hawaii might write a Python script to calibrate images from the Subaru Telescope. A postdoc in Heidelberg would spend three months writing essentially the same script for data from the Very Large Telescope. They were solving the same problems, separately, for decades.
That is the problem the Astropy Project set out to kill.
In 2022, a team of 145 authors led by Adrian M. Price-Whelan published a paper in The Astrophysical Journal documenting the project’s fifth major release (Price-Whelan et al., 2022). The paper has since been cited over 4,400 times. But those citations do not measure what matters. What matters is this: the Astropy software library is now downloaded more than a million times per month. It has become the invisible infrastructure beneath nearly every major discovery in modern astronomy.
The revolution is not in the code. It is in how the code was built.
How Do You Get 145 People to Agree on Anything?

The Astropy Project began in 2011, when a small group of astronomers decided they were tired of reinventing wheels. They wanted a single, community-developed Python library that could handle the basic tasks of astronomy: reading FITS files, converting coordinates, modeling spectra, calculating cosmological distances. The idea was simple. The execution was anything but.
The 2022 paper lists 145 co-authors. That is not a typo. One hundred and forty five people from institutions across the globe contributed to the code, the documentation, the testing, and the governance of the project. This is not how science usually works. Typically, a small team writes a paper, and the paper is the product. Here, the product is a living piece of software that thousands of people use every day.
The authors describe a governance structure that is remarkably flat. There is a coordination committee, but decisions are made by consensus among active contributors. Anyone can propose a change. Anyone can review a pull request. The project has a code of conduct, a formalized process for adding new features, and a system for handling disagreements. It is, in effect, a small democracy of scientists who have agreed to share their tools.
"This is not a top-down project," the authors write. "It is a community of practice." The result is software that is not just functional but trusted. When you use Astropy to convert a celestial coordinate, you are using code that has been reviewed by experts in astrometry, tested against real telescope data, and validated by thousands of users.
What the Code Actually Does

Astropy is not one thing. It is a library of about 40 submodules, each handling a different piece of the astronomer’s workflow. The core package, described in the 2022 paper, includes tools for:
- ▸Reading and writing astronomical data formats (FITS, VOTable, ASCII tables)
- ▸Performing unit conversions and physical constants
- ▸Handling celestial coordinates and time systems
- ▸Modeling spectra and fitting models to data
- ▸Computing cosmological distances and times
- ▸Accessing online databases like SIMBAD and NASA/IPAC
The new release, version 5.0, added significant improvements to performance and memory usage. It also introduced a new "astropy.units" system that makes it nearly impossible to accidentally mix up meters and parsecs. That might sound trivial. It is not. In 1999, NASA lost the Mars Climate Orbiter because one team used imperial units and another used metric. Astronomers deal with units that span 30 orders of magnitude. A single unit conversion error can destroy a career.
The authors note that the code is now used in the data pipelines of major observatories including the Hubble Space Telescope, the James Webb Space Telescope, and the Vera C. Rubin Observatory. When the Rubin Observatory begins its ten-year survey of the sky in 2024, it will produce 20 terabytes of data every night. Astropy will be part of the software that processes that data.
The Hidden Architecture of Open Science
The 2022 paper is not really about code. It is about a social structure.
The authors describe how the project is funded, how decisions are made, and how newcomers are welcomed. There is a formal mentorship program. There are regular "sprints" where contributors meet in person or online to work on specific problems. There is a system of "package maintainers" who take responsibility for different parts of the library.
This matters because science is changing. The old model, where a single PI writes grant proposals and directs a team of graduate students, is giving way to something more distributed. Large collaborations like the LIGO gravitational wave observatory and the Event Horizon Telescope have hundreds of co-authors. The data they produce is too complex for any one person to analyze. They need shared tools.
Astropy is a proof that this model works. The paper shows that the project has grown steadily since 2011, with more contributors, more users, and more features. The release cycle is predictable: a new major version every year, with minor releases in between. The code is tested automatically on multiple platforms. The documentation is thorough and updated continuously.
But the authors are honest about the challenges. Maintaining a large open-source project is exhausting. Many contributors are early-career researchers who do not get academic credit for their work. The project has struggled with "bus factor" the risk that a key contributor will leave and take crucial knowledge with them. The solution, they argue, is to keep the community broad and the governance transparent.
What This Means for the Future of Astronomy
The Astropy Project is not just a tool. It is a template.
The authors argue that the same model could work for other areas of science. Biology, chemistry, and physics all have their own specialized software needs. They all suffer from the same inefficiencies: researchers writing the same code over and over, at different institutions, with no way to share it. A community-driven library like Astropy could save thousands of person-years of effort.
There is a deeper point here. Science is becoming more computational. The days when a researcher could do all their analysis in a spreadsheet are over. Modern data sets are too large, too complex, and too noisy. The tools we use to analyze them shape the questions we can ask. If those tools are proprietary or closed-source, they introduce a hidden bias into the scientific process. Open-source software like Astropy ensures that the methods are transparent and reproducible.
The 2022 paper is a document of this transition. It is not a typical scientific paper. It has no hypotheses, no experiments, no results in the traditional sense. It is a description of a community and its output. The fact that it has been cited over 4,400 times is a sign that the community values it. But the real measure of its success is invisible: every time an astronomer uses Astropy without thinking about it, the project has done its job.
What the Paper Does Not Prove
The Astropy model is not a universal solution.
The authors acknowledge that the project has benefited from a relatively homogenous community. Most contributors are professional astronomers at major institutions. The project has struggled to attract contributors from underrepresented groups, both in terms of demographics and geography. The code is written in English, and the documentation assumes a certain level of technical expertise.
There is also the question of sustainability. The project is funded by a mix of grants and institutional support, but it is not clear how long that will last. Many open-source projects in science have collapsed when the key contributors moved on to other jobs. The authors argue that the community structure is resilient, but they do not have data to prove it.
Finally, the paper does not address the tension between standardization and innovation. Astropy makes it easy to do common tasks, but it also makes it easy to do them the same way every time. That is good for reproducibility. It might be bad for creativity. Some of the most important discoveries in astronomy have come from researchers who wrote their own custom code and saw things that nobody else had seen. The authors do not explore this tradeoff.
What This Actually Means
- ▸If you are a graduate student in astronomy, install Astropy on your first day. It will save you months of work. Learn how to contribute to the project. It is one of the best ways to build a professional network and learn how real science software is built.
- ▸If you are a principal investigator writing a grant, budget for software maintenance. The authors note that most funding goes to building new tools, not maintaining existing ones. That is a mistake. Stable, well-documented software is as important as a new telescope.
- ▸If you are a department chair or a tenure committee member, count software contributions as real academic output. The 145 co-authors on this paper did not all write code. Some wrote documentation. Some reviewed pull requests. Some mentored new contributors. That work matters.
- ▸If you are a scientist in a different field, study the Astropy governance model. It is one of the most successful examples of community-driven software in science. The same principles could work for genomics, climate modeling, or materials science.
- ▸If you are a taxpayer, know that your money is being spent on more than just telescopes. The Astropy Project is funded by the National Science Foundation and other agencies. It is a public good, like a highway or a library. It makes science faster, cheaper, and more transparent. That is a good investment.
References
- [1]Adrian M. Price-Whelan, LIM, Pey Lian, A. Zonca, STARKMAN, Nathaniel (2022). The Astropy Project: Sustaining and Growing a Community-oriented Open-source Project and the Latest Major Release (v5.0) of the Core Package. Research Portal (Queen's University Belfast)DOI· 4,465 citations
