BOSC 2025, the 26th Bioinformatics Open Source Conference

Nomi L. Harris; Karsten Hokamp; Jessica Maia; Hervé Ménager; Monia C Munoz-Torres; Tazro Ohta; Deepak Unni; Jason Williams

doi:10.12688/f1000research.169977.1

Home Browse BOSC 2025, the 26th Bioinformatics Open Source Conference

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Editorial

BOSC 2025, the 26th Bioinformatics Open Source Conference

[version 1; peer review: not peer reviewed]

Nomi L. Harris ¹, Karsten Hokamp², Jessica Maia³, [...] Hervé Ménager⁴, Monia C Munoz-Torres⁵, Tazro Ohta⁶, Deepak Unni⁷, Jason Williams⁸

Nomi L. Harris ¹, Karsten Hokamp², [...] Jessica Maia³, Hervé Ménager⁴, Monia C Munoz-Torres⁵, Tazro Ohta⁶, Deepak Unni⁷, Jason Williams⁸

PUBLISHED 09 Sep 2025

Author details Author details

¹ E O Lawrence Berkeley National Laboratory, Berkeley, California, 94720, USA
² Smurfit Institute of Genetics Trinity College, Dublin, Leinster, D02 PN40, Ireland
³ BD, Research Triangle Park, NC, 27709, USA
⁴ Bioinformatics and Biostatistics Hub, Institut Pasteur, Paris, 75015, France
⁵ University of Colorado Anschutz Medical Campus, Aurora, Colorado, 80045, USA
⁶ Chiba University, Chiba, 263-8522, Japan
⁷ Swiss Institute of Bioinformatics, Basel, 4051, Switzerland
⁸ Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 11724, USA

Nomi L. Harris
Roles: Conceptualization, Funding Acquisition, Project Administration, Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

Karsten Hokamp
Roles: Conceptualization, Project Administration, Software, Writing – Review & Editing

Jessica Maia
Roles: Conceptualization, Project Administration

Hervé Ménager
Roles: Conceptualization, Project Administration, Supervision, Writing – Review & Editing

Monia C Munoz-Torres
Roles: Conceptualization, Project Administration, Writing – Review & Editing

Tazro Ohta
Roles: Conceptualization, Project Administration

Deepak Unni
Roles: Conceptualization, Funding Acquisition, Project Administration, Writing – Review & Editing

Jason Williams
Roles: Conceptualization, Project Administration

OPEN PEER REVIEW

NOT PEER REVIEWED

This article is included in the Bioinformatics Open Source Conference (BOSC) collection.

Abstract

The 26th annual Bioinformatics Open Source Conference (BOSC 2025, open-bio.org/events/bosc-2025) brought its community-driven focus on open-source bioinformatics and open science to the 2025 conference on Intelligent Systems for Molecular Biology and the European Conference on Computational Biology (ISMB/ECCB 2025). Since its launch in 2000, BOSC has been the premier annual meeting covering open-source bioinformatics and open science.

Framed by two keynote addresses and a thought-provoking panel discussion, the two-day conference included sessions dedicated to open data, analytic tools and pipelines, workflow platforms, knowledge representation, and the application of AI/ML.

The first keynote talk was delivered by Christine Orengo: “Working together to develop, promote and protect our data resources: Lessons learnt developing CATH and TED.” A joint session with the Bio-Ontologies and Knowledge Representation (BOKR) track the second day of BOSC started with a keynote talk by Chris Mungall entitled “Open Knowledge Bases in the Age of Generative AI”. A closing panel on Data Sustainability, moderated by Mónica Muñoz Torres, featured panelists Scott Edmunds, Varsha Khodiyar, Tony Burdett, Nicky Mulder, and Chris Mungall.

This year, the CollaborationFest collaborative work event that typically precedes or follows ISMB was incorporated as part of the main conference and organized by BOSC with help from the Function and 3D-SIG tracks.

Keywords

bioinformatics; open source; open science; open data; data sustainability

Corresponding author: Nomi L. Harris

Competing interests: No competing interests were disclosed.

Grant information: The author(s) declare that no grants were directly involved in supporting BOSC; however, the participation of some of the organizers was funded in part by grants, including the following: NH was supported in part by the Director, Office of Science, Office of Basic Energy Sciences, of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231; JW received support from NSF grants DBI-0735191, DBI-1265383, and DBI-1743442.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2025 Harris NL et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Harris NL, Hokamp K, Maia J et al. BOSC 2025, the 26th Bioinformatics Open Source Conference [version 1; peer review: not peer reviewed]. F1000Research 2025, 14:887 (https://doi.org/10.12688/f1000research.169977.1) First published: 09 Sep 2025, 14:887 (https://doi.org/10.12688/f1000research.169977.1) Latest published: 09 Sep 2025, 14:887 (https://doi.org/10.12688/f1000research.169977.1)

Introduction

The 26th annual Bioinformatics Open Source Conference (BOSC 2025, open-bio.org/events/bosc-2025) took place in Liverpool, UK, as part of the 2025 conference on Intelligent Systems for Molecular Biology and the European Conference on Computational Biology (ISMB/ECCB 2025). Total in-person attendance at ISMB/ECCB 2025 was a record-breaking 2283, with an additional 312 online participants. Attendance at BOSC peaked at 283 in-person attendees (Figure 1), with additional online participants.

Figure 1. Members of the audience at BOSC 2025 in Liverpool, UK.

BOSC 2025 included two full days of talks and posters covering a wide range of topics in open science and open source bioinformatics, with two keynote talks and a panel discussion. The second day of BOSC was a joint session with the Bio-Ontologies and Knowledge Representation (BOKR) track.

Every year since 2000, the Bioinformatics Open Source Conference (BOSC) has been the key annual meeting for people involved in some aspect of open source or open science in bioinformatics. Last year’s conference, BOSC 2024, was the 25th BOSC, but this year marked 25 years since the first BOSC in 2000, so we took advantage of the off-by-one confusion (all too familiar to bioinformaticists) to celebrate our 25th birthday again at BOSC 2025.

BOSC is entirely volunteer run. Nomi Harris, a program manager at the Lawrence Berkeley National Laboratory, has led BOSC since 2012. The organizing committee (Figure 2) and review committee ensured a rigorous abstract selection process, with each submission receiving three independent reviews. BOSC’s commitment to transparency extends to its review process and rubric, documented and publicly available since 2020.

Figure 2. 7/8 of the BOSC 2025 organizing committee.

From left: Jason Williams, Jessica Maia, Nomi Harris, Deepak Unni, Mónica Muñoz Torres, Karsten Hokamp, Hervé Ménager (missing: Tazro Ohta).

Conference Program

Opening session

The conference began with a welcome from chair Nomi Harris, followed by an overview of BOSC’s parent organization, the Open Bioinformatics Foundation, by OBF President Peter Cock (Figure 3). The opening session closed with a tribute to Peter Amstutz, a very active member of the bioinformatics open-source community who passed away recently. Peter contributed his energy and vision to many BOSCs and CollaborationFests, and co-founded the widely used Common Workflow Language (CWL), which was conceived at the BOSC 2014 CoFest. The tribute was presented by Michael Crusoe, Peter’s collaborator and co-founder of CWL (Figure 4).

Figure 3. Peter Cock, president of the Open Bioinformatics Foundation, presented an OBF update.

Figure 4. Michael Crusoe presented a tribute to CWL co-founder Peter Amstutz, who sadly passed away shortly before the conference.

Keynotes

The first BOSC 2025 keynote talk was delivered by Dr. Christine Orengo: “Working together to develop, promote and protect our data resources: Lessons learnt developing CATH and TED” (Figure 5). Dr. Orengo is the Vice President of the ISCB and was previously the ISCB’s first female President. Dr. Orengo’s talk covered the history of CATH (which is celebrating its 30th anniversary) and the insights CATH and TED provided about protein structure and function, while also providing a model of how to bring the community together to sustain a resource. Interestingly, this involved working together with competitors to improve each other’s approaches.

Figure 5. BOSC 2025 keynote speaker Christine Orengo.

The joint BOSC/BOKR session was kicked off by keynote speaker Chris Mungall (Figure 6), who wove together his work on agentic AI, ontologies and knowledge bases to describe a “future of bioinformatics” that is in fact happening right now. While we have heard a great deal on the limits of LLMs, some of Dr. Mungall’s latest work examines the potential of agentic AI as an important aid for biocuration. As the number of tools grows, an under-recognized challenge will be evaluating and benchmarking the utility and reproducibility of these tools. One strategy for examining AI agent work benefits from the long history of provenance around curator maintenance of ontologies. Dating back to pre-GitHub times—circa 2002—this evidence trail allows evaluation work to directly compare how AI decisions are compared with the work of human curators. The talk ended with Dr. Mungall’s reflections on AI ethics, particularly related to circumstances where an AI might be tasked with conflicting policy directives.

Figure 6. Joint BOSC/BOKR keynote speaker Chris Mungall.

Talks and posters

In addition to the two keynotes, BOSC 2025 included 35 talks chosen from submitted abstracts (Figure 7) and 47 posters presented in person or online (Figure 8). This year’s session topics included open science, open data, analytic tools and pipelines, workflow platforms, knowledge representation, and AI/ML. The complete lists of talks and posters can be found at https://www.open-bio.org/events/bosc-2025/bosc-2025-schedule/.

Figure 7. Speakers at BOSC 2025 included Susheel Varma, Favour James, and Guy Cochrane.

Figure 8. Poster presenters Maggie Fu, Dorian Portillo, and Carole Goble share their work with other participants.

The first talk session covered topics related to open science, including code sharing practices, a case study on how open science is advancing bioinformatics in a low-resource region (Nigeria), and guidelines for FAIR research. The next two sessions focused on some of the core areas in open-source bioinformatics development, including tools and pipelines for data analysis, workflow systems, and developer toolkits. Day 1 closed with a session on open data which highlighted the need for AI-readiness and interoperability.

Day 2, which was combined with BOKR, included a session on AI/ML, with an emphasis on open approaches, sharing of models, and reusability. The next session, which focused on ontologies and knowledge representation, again included a lot of AI. The session was bookended by talks about two of the most popular biomedical ontology resources, BioPortal and AberOWL. Next, a short talk on the global biodata infrastructure by Guy Cochrane led directly into the closing panel on Data Sustainability.

Beyond the archive: Forging a sustainable future for scientific data

Our panel on Data Sustainability (Figure 9) tackled a critical question facing modern science: How do we ensure that the vast digital knowledge we generate today remains accessible, reliable, and valuable for generations to come? We defined data sustainability as a proactive approach to keeping bioinformatics data FAIR (Findable, Accessible, Interoperable, Reusable), ethically managed, and secure through robust infrastructure, funding, expertise, and governance.

Figure 9. The panel on data sustainability included (from left to right) moderator Mónica Muñoz Torres and panelists Scott Edmunds, Chris Mungall, Varsha Khodiyar, Tony Burdett, and Nicky Mulder.

The discussion quickly moved beyond principles to practical realities. Panelists agreed that while foundational guidelines like FAIR and CARE are essential, they are not a complete solution. They can introduce costs, and don’t inherently solve the complex challenges of governance and long-term funding. True sustainability cannot be achieved with a simple checklist; it requires building community-driven standards and frameworks, such as those developed by the Global Alliance for Genomics and Health (GA4GH), to ensure responsible data sharing.

A powerful theme emerged from the discussion: sustainability is fundamentally about people, not just data. The conversation repeatedly highlighted the urgent need to formally recognize, train, and establish career paths for the data stewards, biocurators, and bioethicists who are the guardians of our collective knowledge. As one panelist noted, we should invest less in creating perfectly static archives and more in cultivating the networks of people who can care for datasets, allowing them to evolve and grow over time.

Ultimately, the panel concluded that ensuring the future of our data requires a cultural shift. It means moving beyond simply asking for more funding and toward building resilient, adaptable ecosystems. It demands that we value the human expertise that underpins our data infrastructure and work together to face a landscape where data is not only at risk of being lost, but also of being devalued or attacked. Our collective digital legacy depends on it.

CollaborationFest

BOSC has held CollaborationFests (aka CoFests) every year before or after ISMB since 2010. CoFests are a two-day collaborative work event at which participants work together to contribute code, documentation, training materials, and challenging analysis problems and use cases. This year, we decided to hold the CollaborationFest as part of ISMB/ECCB and open it to all registered ISMB/ECCB participants. The co-organizers were Hervé Ménager (BOSC), Iddo Friedberg (Function COSI), and Ben Busby (Figure 10).

Figure 10. ISMB CoFest organizers Ben Busby, Hervé Ménager, and Iddo Friedberg.

Over 60 people participated in CoFest, with about 50 in person and 10 online (Figure 11 and Figure 12). CoFest participants worked together on nine main projects. A wide range of projects were undertaken, addressing topics such as genome annotation, HiC data analysis, protein function annotation, proteomics annotation, workflows, data processing, databases, and AI/ML approaches. A summary of the CoFest projects is provided in Table 1.

Figure 11. CoFest 2025, which was part of the large ISMB/ECCB conference, attracted many participants.

Figure 12. Some of the CoFest participants working together on their projects.

Table 1. Projects worked on at CoFest 2025.

Project title	Keywords	Links
Improving how we describe and discover Bioinformatics tools	Software, metadata, LLM	https://github.com/edamontology/edammcp
Collaborative development of gRINN for interaction energy based analysis of biomolecular simulation trajectories	Residue interaction, energy-based analysis of protein MD simulation trajectories	https://www.github.com/osercinoglu/grinn-ismb-2025, https://www.github.com/osercinoglu/grinn
Python 3.14 testing	CWL, Python	https://github.com/common-workflow-language/schema_salad/releases/tag/8.9.20250723145140 https://github.com/common-workflow-language/cwl-utils/releases/tag/v0.39 https://github.com/galaxyproject/bioblend/pull/510
README Generation	Software description, LLM	https://github.com/slugb0t/llm-readme-comparisons
Use LinkML in the Comprehensive Antibiotic Resistance Database (CARD)	Antibiotic Resistance Database, LinkML	https://linkml.io/linkml/developers/schemaview.html
schemas.science genAI-assisted training	JSON-LD, metadata, LLMs	https://github.com/PhilReedData/ismb2025cofest
Biopython, Python & GPUs	GPU, Python	https://github.com/rapidsai/cuxfilter https://biopython.org/
iCn3d & Jalview - MSA/Structure	Javascript, 3D, MSA	https://github.com/ncbi/icn3d
Proteomics metadata	Proteomics, metadata, text-mining	https://github.com/SparkyDaBear/Intelligent_MetaData_ISMB_collaboration_fest_2025/
How much funding per protein?	Python, R, Protein, data analysis	https://github.com/anphan0828/funding-per-protein-hackathon
Enhancing protein knowledge curation using GO annotations and LLMs for community literature submissions	Gene Ontology, LLMs	https://github.com/anphan0828/ISMB_CollaborationFest_EBI/
Identifying genomic safe harbor sites in Zebrafish at different development stages, incorporating 3D chromosome data using an integrative genomics approach	Integrative genomics, HiC data	https://github.com/parnaljoshi/gsh-zebrafish-hackathon

Fostering diversity, equity, and inclusion at BOSC

The BOSC organizing committee, which consists entirely of volunteers, devotes substantial effort and resources to making BOSC more diverse and accessible (see our DEI page). We do all we can to spread the word about BOSC to as wide an audience as possible, and we offer financial support to help lower the barriers to participation.

Any author who submits an abstract to BOSC can privately request financial assistance for conference registration fees. This initiative, funded by generous sponsors (see https://www.open-bio.org/events/bosc/sponsors/ and the Acknowledgements below), has enabled us to offer free ISMB registration to dozens of BOSC participants, primarily from underrepresented groups in the field. In 2025, thanks to our sponsors, we provided free registration to 12 presenters, 10 of whom belong to underrepresented groups in our community. BOSC sponsorships also fund the Open Bioinformatics Foundation’s Event Awards, which are aimed at increasing diverse participation at events promoting open-source bioinformatics and open science, including (but not limited to) BOSC.

Our BOSC keynote speaker selection process has been designed to support inclusivity and transparency. In the first phase of the yearly process, we invite the community to submit nominations for potential speakers. Shortlisted candidates are then evaluated using our established Invited Speaker Rubric, followed by a call for community feedback regarding any potential concerns. After reviewing the feedback, we make the final selection, ensuring that our keynote speakers represent a diverse spectrum of perspectives and backgrounds.

Making BOSC both successful and inclusive is a lot of work. Our greatest reward is hearing from participants that BOSC provides a truly welcoming and inspiring home for collaboration. First-time attendee Daniel Korn described his experience as the “greatest highlight of my international journey,” adding that “at every step… I felt included in a community of programmers and academics focused on tackling big problems of biomedical software.” This feeling was echoed by Sebastian Ayala Ruano, who shared that “attending BOSC was an incredible experience. I discovered inspiring research projects and open-source tools, while connecting with a welcoming community.” These stories of connection and inclusion are the reason we do this work; they are the inspiration that ensures we will be back next year.

Figures

All photos in this report are shared under a CC-BY-SA license. All identifiable subjects in the photos were contacted, and they consented to have their photos used in this report.

Acknowledgments

We are grateful to all who helped make BOSC 2025 a success: the organizers, reviewers, volunteers, presenters, attendees, and the generous sponsors who enabled us to offset some meeting expenses and provide free registration to 15 participants. We thank Platinum sponsor Chan-Zuckerberg Initiative, Gold sponsor Seqera, and Silver sponsor GigaScience for their support. Special thanks to Seqera for providing socks!

We thank ISCB staff, particularly Seth Mulholland and Bel Hanson, for helping ISMB/ECCB 2025 run smoothly, even with a record number of attendees.

Hervé Ménager, Iddo Friedberg, and Ben Busby were the main organizers of CoFest. Peter Cock, Núria Queralt Rosinach, Tazro Ohta and Ravi Abrol also participated in the preparation of the CoFest. Thank you to Iddo for providing swag, and to the people who developed the CoFest challenges: Wout Bittremieux, Ian Sitarik, Kunal Suresh, Parnal Joshi, An Phan, and Yvonne Lussi.

Thank you to Peter Cock for reading and commenting on this manuscript.

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 09 Sep 2025

Author details Author details

¹ E O Lawrence Berkeley National Laboratory, Berkeley, California, 94720, USA
² Smurfit Institute of Genetics Trinity College, Dublin, Leinster, D02 PN40, Ireland
³ BD, Research Triangle Park, NC, 27709, USA
⁴ Bioinformatics and Biostatistics Hub, Institut Pasteur, Paris, 75015, France
⁵ University of Colorado Anschutz Medical Campus, Aurora, Colorado, 80045, USA
⁶ Chiba University, Chiba, 263-8522, Japan
⁷ Swiss Institute of Bioinformatics, Basel, 4051, Switzerland
⁸ Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 11724, USA

Nomi L. Harris
Roles: Conceptualization, Funding Acquisition, Project Administration, Supervision, Writing – Original Draft Preparation, Writing – Review & Editing

Karsten Hokamp
Roles: Conceptualization, Project Administration, Software, Writing – Review & Editing

Jessica Maia
Roles: Conceptualization, Project Administration

Hervé Ménager
Roles: Conceptualization, Project Administration, Supervision, Writing – Review & Editing

Monia C Munoz-Torres
Roles: Conceptualization, Project Administration, Writing – Review & Editing

Tazro Ohta
Roles: Conceptualization, Project Administration

Deepak Unni
Roles: Conceptualization, Funding Acquisition, Project Administration, Writing – Review & Editing

Jason Williams
Roles: Conceptualization, Project Administration

Competing interests

No competing interests were disclosed.

Grant information

The author(s) declare that no grants were directly involved in supporting BOSC; however, the participation of some of the organizers was funded in part by grants, including the following: NH was supported in part by the Director, Office of Science, Office of Basic Energy Sciences, of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231; JW received support from NSF grants DBI-0735191, DBI-1265383, and DBI-1743442.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (1)

version 1

Published: 09 Sep 2025, 14:887

https://doi.org/10.12688/f1000research.169977.1

Copyright

© 2025 Harris NL et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Harris NL, Hokamp K, Maia J et al. BOSC 2025, the 26th Bioinformatics Open Source Conference [version 1; peer review: not peer reviewed]. F1000Research 2025, 14:887 (https://doi.org/10.12688/f1000research.169977.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 09 Sep 2025

Not Peer Reviewed

This article is an Editorial and has not been subject to external peer review.

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

BOSC 2025, the 26th Bioinformatics Open Source Conference

Abstract

Keywords

Introduction

Figure 1. Members of the audience at BOSC 2025 in Liverpool, UK.

Figure 2. 7/8 of the BOSC 2025 organizing committee.

Conference Program

Opening session

Figure 3. Peter Cock, president of the Open Bioinformatics Foundation, presented an OBF update.

Figure 4. Michael Crusoe presented a tribute to CWL co-founder Peter Amstutz, who sadly passed away shortly before the conference.

Keynotes

Figure 5. BOSC 2025 keynote speaker Christine Orengo.

Figure 6. Joint BOSC/BOKR keynote speaker Chris Mungall.

Talks and posters

Figure 7. Speakers at BOSC 2025 included Susheel Varma, Favour James, and Guy Cochrane.

Figure 8. Poster presenters Maggie Fu, Dorian Portillo, and Carole Goble share their work with other participants.

Beyond the archive: Forging a sustainable future for scientific data

Figure 9. The panel on data sustainability included (from left to right) moderator Mónica Muñoz Torres and panelists Scott Edmunds, Chris Mungall, Varsha Khodiyar, Tony Burdett, and Nicky Mulder.

CollaborationFest

Figure 10. ISMB CoFest organizers Ben Busby, Hervé Ménager, and Iddo Friedberg.

Figure 11. CoFest 2025, which was part of the large ISMB/ECCB conference, attracted many participants.

Figure 12. Some of the CoFest participants working together on their projects.

Table 1. Projects worked on at CoFest 2025.

Fostering diversity, equity, and inclusion at BOSC

Figures

Acknowledgments

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Not Peer Reviewed

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated