2026-07-17 –, Memorial Hall
Computational narratives like Jupyter, MyST Markdown, R-Markdown, and Quarto are amazing for doing science. You can combine narrative, code, data and images, conducting your analysis while also creating information to share. However, the workflow has been that notebooks are where you do the work, but you need to publish a pdf article to advertise the work, and this is the research output that most people see. That process not only creates extra work, but we're losing key information, amazing graphics, interactive visualizations, and a connection to the code and data.
Flattening science into a published pdf sacrifices reproducibility and valuable context for others to build on the research. We’re continuing to share our science in 19th century ways, as if we need to send printed, physical copies of our work to people in the mail. This is both a boring and ineffective way to communicate science and also reduces the visibility and value to the modular components of research. The data, images, and code all have individual value, especially as we think about new ways for humans and machines to build on existing science for new impact.
The Open Exchange Infrastructure (OXA, https://oxa.dev) is a community standard for scientific publishing built for modular and computational science. Initial contributors include Stencila, eLife, Posit, PLOS, openRxiv, Curvenote, NeuroLibre, and Creative Commons — representing a new document format that brings together the best of Jupyter Notebooks, Quarto, MyST Markdown, and publishing/archiving standards to enable new scientific publishing experiences and workflows. OXA additionally allows many tools and existing formats to connect with each other and into traditional publishing workflows, like Journal Article Tag Suite (JATS XML) and Manuscript Exchange Common Approach (MECA). This means that what you share is interactive and engaging and your research products, like large scale microscopy images (e.g. OME-Zarr), are first-class citizens where image datasets, notebooks, and other research products are highlighted not hidden.
In this talk we’re sharing more on the technical architecture of the format and a pilot between openRxiv (the non-profit organization behind the largest biomedical preprint servers: bioRxiv and medRxiv) and Curvenote (a scientific content management system that also hosts the SciPy Proceedings) to migrate 500k preprints (8.1TB) to OXA and show real-world examples of interactive scientific content, modular attribution, and what’s possible when the pieces are connected and scientific research can be open, engaging and match what’s possible with our current technology - to change the way we share and do science. This isn’t a future vision, this is what is already happening today.
This talk is for:
* People who are scientists creating and sharing research, especially using computational notebooks (e.g. Jupyter Notebooks, Jupyter Book, Quarto, MyST Markdown)
* People developing tools related to scientific communications, that could more easily be connected with each other through OXA
* People working on formats and standards for computational notebooks and scientific publication
Some relevant previous speaking experience includes:
- Talk at SciPy 2023 on "Scientific and technical publishing with Python and Quarto"
- Talk at PyData Seattle 2023 on "It's not just code: managing an open source project"
- Talk at posit::conf 2022 on "These are a few of my favorite things (about Quarto presentations)"
Dr. Tracy K. Teal has been the Open Source Program Director at RStudio/Posit and Nixtla, Executive Director of Dryad, and a co-founder and Executive Director of The Carpentries and is now the CEO at openRxiv. She developed open source bioinformatics software as an assistant professor at Michigan State University and holds a PhD in Computation and Neural Systems from California Institute of Technology. Tracy is involved in the open source software and reproducible research communities, including serving on advisory committees for NumFOCUS, pyOpenSci, R Consortium, and CarbonPlan, and has been working with open source communities, developing curriculum, and teaching people how to work with data and code as a developer, instructor and project leader throughout her career.