ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Editorial

BOSC 2025, the 26th Bioinformatics Open Source Conference

[version 1; peer review: not peer reviewed]
PUBLISHED 09 Sep 2025
Author details Author details
OPEN PEER REVIEW
NOT PEER REVIEWED

This article is included in the Bioinformatics Open Source Conference (BOSC) collection.

Abstract

The 26th annual Bioinformatics Open Source Conference (BOSC 2025, open-bio.org/events/bosc-2025) brought its community-driven focus on open-source bioinformatics and open science to the 2025 conference on Intelligent Systems for Molecular Biology and the European Conference on Computational Biology (ISMB/ECCB 2025). Since its launch in 2000, BOSC has been the premier annual meeting covering open-source bioinformatics and open science.

Framed by two keynote addresses and a thought-provoking panel discussion, the two-day conference included sessions dedicated to open data, analytic tools and pipelines, workflow platforms, knowledge representation, and the application of AI/ML.

The first keynote talk was delivered by Christine Orengo: “Working together to develop, promote and protect our data resources: Lessons learnt developing CATH and TED.” A joint session with the Bio-Ontologies and Knowledge Representation (BOKR) track the second day of BOSC started with a keynote talk by Chris Mungall entitled “Open Knowledge Bases in the Age of Generative AI”. A closing panel on Data Sustainability, moderated by Mónica Muñoz Torres, featured panelists Scott Edmunds, Varsha Khodiyar, Tony Burdett, Nicky Mulder, and Chris Mungall.

This year, the CollaborationFest collaborative work event that typically precedes or follows ISMB was incorporated as part of the main conference and organized by BOSC with help from the Function and 3D-SIG tracks.

Keywords

bioinformatics; open source; open science; open data; data sustainability

Introduction

The 26th annual Bioinformatics Open Source Conference (BOSC 2025, open-bio.org/events/bosc-2025) took place in Liverpool, UK, as part of the 2025 conference on Intelligent Systems for Molecular Biology and the European Conference on Computational Biology (ISMB/ECCB 2025). Total in-person attendance at ISMB/ECCB 2025 was a record-breaking 2283, with an additional 312 online participants. Attendance at BOSC peaked at 283 in-person attendees (Figure 1), with additional online participants.

401a464d-60bf-4d5a-9adb-6870becd513f_figure1.gif

Figure 1. Members of the audience at BOSC 2025 in Liverpool, UK.

BOSC 2025 included two full days of talks and posters covering a wide range of topics in open science and open source bioinformatics, with two keynote talks and a panel discussion. The second day of BOSC was a joint session with the Bio-Ontologies and Knowledge Representation (BOKR) track.

Every year since 2000, the Bioinformatics Open Source Conference (BOSC) has been the key annual meeting for people involved in some aspect of open source or open science in bioinformatics. Last year’s conference, BOSC 2024, was the 25th BOSC, but this year marked 25 years since the first BOSC in 2000, so we took advantage of the off-by-one confusion (all too familiar to bioinformaticists) to celebrate our 25th birthday again at BOSC 2025.

BOSC is entirely volunteer run. Nomi Harris, a program manager at the Lawrence Berkeley National Laboratory, has led BOSC since 2012. The organizing committee (Figure 2) and review committee ensured a rigorous abstract selection process, with each submission receiving three independent reviews. BOSC’s commitment to transparency extends to its review process and rubric, documented and publicly available since 2020.

401a464d-60bf-4d5a-9adb-6870becd513f_figure2.gif

Figure 2. 7/8 of the BOSC 2025 organizing committee.

From left: Jason Williams, Jessica Maia, Nomi Harris, Deepak Unni, Mónica Muñoz Torres, Karsten Hokamp, Hervé Ménager (missing: Tazro Ohta).

Conference Program

Opening session

The conference began with a welcome from chair Nomi Harris, followed by an overview of BOSC’s parent organization, the Open Bioinformatics Foundation, by OBF President Peter Cock (Figure 3). The opening session closed with a tribute to Peter Amstutz, a very active member of the bioinformatics open-source community who passed away recently. Peter contributed his energy and vision to many BOSCs and CollaborationFests, and co-founded the widely used Common Workflow Language (CWL), which was conceived at the BOSC 2014 CoFest. The tribute was presented by Michael Crusoe, Peter’s collaborator and co-founder of CWL (Figure 4).

401a464d-60bf-4d5a-9adb-6870becd513f_figure3.gif

Figure 3. Peter Cock, president of the Open Bioinformatics Foundation, presented an OBF update.

401a464d-60bf-4d5a-9adb-6870becd513f_figure4.gif

Figure 4. Michael Crusoe presented a tribute to CWL co-founder Peter Amstutz, who sadly passed away shortly before the conference.

Keynotes

The first BOSC 2025 keynote talk was delivered by Dr. Christine Orengo: “Working together to develop, promote and protect our data resources: Lessons learnt developing CATH and TED” (Figure 5). Dr. Orengo is the Vice President of the ISCB and was previously the ISCB’s first female President. Dr. Orengo’s talk covered the history of CATH (which is celebrating its 30th anniversary) and the insights CATH and TED provided about protein structure and function, while also providing a model of how to bring the community together to sustain a resource. Interestingly, this involved working together with competitors to improve each other’s approaches.

401a464d-60bf-4d5a-9adb-6870becd513f_figure5.gif

Figure 5. BOSC 2025 keynote speaker Christine Orengo.

The joint BOSC/BOKR session was kicked off by keynote speaker Chris Mungall (Figure 6), who wove together his work on agentic AI, ontologies and knowledge bases to describe a “future of bioinformatics” that is in fact happening right now. While we have heard a great deal on the limits of LLMs, some of Dr. Mungall’s latest work examines the potential of agentic AI as an important aid for biocuration. As the number of tools grows, an under-recognized challenge will be evaluating and benchmarking the utility and reproducibility of these tools. One strategy for examining AI agent work benefits from the long history of provenance around curator maintenance of ontologies. Dating back to pre-GitHub times—circa 2002—this evidence trail allows evaluation work to directly compare how AI decisions are compared with the work of human curators. The talk ended with Dr. Mungall’s reflections on AI ethics, particularly related to circumstances where an AI might be tasked with conflicting policy directives.

401a464d-60bf-4d5a-9adb-6870becd513f_figure6.gif

Figure 6. Joint BOSC/BOKR keynote speaker Chris Mungall.

Talks and posters

In addition to the two keynotes, BOSC 2025 included 35 talks chosen from submitted abstracts (Figure 7) and 47 posters presented in person or online (Figure 8). This year’s session topics included open science, open data, analytic tools and pipelines, workflow platforms, knowledge representation, and AI/ML. The complete lists of talks and posters can be found at https://www.open-bio.org/events/bosc-2025/bosc-2025-schedule/.

401a464d-60bf-4d5a-9adb-6870becd513f_figure7.gif

Figure 7. Speakers at BOSC 2025 included Susheel Varma, Favour James, and Guy Cochrane.

401a464d-60bf-4d5a-9adb-6870becd513f_figure8.gif

Figure 8. Poster presenters Maggie Fu, Dorian Portillo, and Carole Goble share their work with other participants.

The first talk session covered topics related to open science, including code sharing practices, a case study on how open science is advancing bioinformatics in a low-resource region (Nigeria), and guidelines for FAIR research. The next two sessions focused on some of the core areas in open-source bioinformatics development, including tools and pipelines for data analysis, workflow systems, and developer toolkits. Day 1 closed with a session on open data which highlighted the need for AI-readiness and interoperability.

Day 2, which was combined with BOKR, included a session on AI/ML, with an emphasis on open approaches, sharing of models, and reusability. The next session, which focused on ontologies and knowledge representation, again included a lot of AI. The session was bookended by talks about two of the most popular biomedical ontology resources, BioPortal and AberOWL. Next, a short talk on the global biodata infrastructure by Guy Cochrane led directly into the closing panel on Data Sustainability.

Beyond the archive: Forging a sustainable future for scientific data

Our panel on Data Sustainability (Figure 9) tackled a critical question facing modern science: How do we ensure that the vast digital knowledge we generate today remains accessible, reliable, and valuable for generations to come? We defined data sustainability as a proactive approach to keeping bioinformatics data FAIR (Findable, Accessible, Interoperable, Reusable), ethically managed, and secure through robust infrastructure, funding, expertise, and governance.

401a464d-60bf-4d5a-9adb-6870becd513f_figure9.gif

Figure 9. The panel on data sustainability included (from left to right) moderator Mónica Muñoz Torres and panelists Scott Edmunds, Chris Mungall, Varsha Khodiyar, Tony Burdett, and Nicky Mulder.

The discussion quickly moved beyond principles to practical realities. Panelists agreed that while foundational guidelines like FAIR and CARE are essential, they are not a complete solution. They can introduce costs, and don’t inherently solve the complex challenges of governance and long-term funding. True sustainability cannot be achieved with a simple checklist; it requires building community-driven standards and frameworks, such as those developed by the Global Alliance for Genomics and Health (GA4GH), to ensure responsible data sharing.

A powerful theme emerged from the discussion: sustainability is fundamentally about people, not just data. The conversation repeatedly highlighted the urgent need to formally recognize, train, and establish career paths for the data stewards, biocurators, and bioethicists who are the guardians of our collective knowledge. As one panelist noted, we should invest less in creating perfectly static archives and more in cultivating the networks of people who can care for datasets, allowing them to evolve and grow over time.

Ultimately, the panel concluded that ensuring the future of our data requires a cultural shift. It means moving beyond simply asking for more funding and toward building resilient, adaptable ecosystems. It demands that we value the human expertise that underpins our data infrastructure and work together to face a landscape where data is not only at risk of being lost, but also of being devalued or attacked. Our collective digital legacy depends on it.

CollaborationFest

BOSC has held CollaborationFests (aka CoFests) every year before or after ISMB since 2010. CoFests are a two-day collaborative work event at which participants work together to contribute code, documentation, training materials, and challenging analysis problems and use cases. This year, we decided to hold the CollaborationFest as part of ISMB/ECCB and open it to all registered ISMB/ECCB participants. The co-organizers were Hervé Ménager (BOSC), Iddo Friedberg (Function COSI), and Ben Busby (Figure 10).

401a464d-60bf-4d5a-9adb-6870becd513f_figure10.gif

Figure 10. ISMB CoFest organizers Ben Busby, Hervé Ménager, and Iddo Friedberg.

Over 60 people participated in CoFest, with about 50 in person and 10 online (Figure 11 and Figure 12). CoFest participants worked together on nine main projects. A wide range of projects were undertaken, addressing topics such as genome annotation, HiC data analysis, protein function annotation, proteomics annotation, workflows, data processing, databases, and AI/ML approaches. A summary of the CoFest projects is provided in Table 1.

401a464d-60bf-4d5a-9adb-6870becd513f_figure11.gif

Figure 11. CoFest 2025, which was part of the large ISMB/ECCB conference, attracted many participants.

401a464d-60bf-4d5a-9adb-6870becd513f_figure12.gif

Figure 12. Some of the CoFest participants working together on their projects.

Table 1. Projects worked on at CoFest 2025.

Project titleKeywordsLinks
Improving how we describe and discover Bioinformatics toolsSoftware, metadata, LLMhttps://github.com/edamontology/edammcp
Collaborative development of gRINN for interaction energy based analysis of biomolecular simulation trajectoriesResidue interaction, energy-based analysis of protein MD simulation trajectorieshttps://www.github.com/osercinoglu/grinn-ismb-2025, https://www.github.com/osercinoglu/grinn
Python 3.14 testingCWL, Pythonhttps://github.com/common-workflow-language/schema_salad/releases/tag/8.9.20250723145140 https://github.com/common-workflow-language/cwl-utils/releases/tag/v0.39 https://github.com/galaxyproject/bioblend/pull/510
README GenerationSoftware description, LLMhttps://github.com/slugb0t/llm-readme-comparisons
Use LinkML in the Comprehensive Antibiotic Resistance Database (CARD)Antibiotic Resistance Database, LinkMLhttps://linkml.io/linkml/developers/schemaview.html
schemas.science genAI-assisted trainingJSON-LD, metadata, LLMshttps://github.com/PhilReedData/ismb2025cofest
Biopython, Python & GPUsGPU, Pythonhttps://github.com/rapidsai/cuxfilter
https://biopython.org/
iCn3d & Jalview - MSA/StructureJavascript, 3D, MSAhttps://github.com/ncbi/icn3d
Proteomics metadataProteomics, metadata, text-mining https://github.com/SparkyDaBear/Intelligent_MetaData_ISMB_collaboration_fest_2025/
How much funding per protein?Python, R, Protein, data analysishttps://github.com/anphan0828/funding-per-protein-hackathon
Enhancing protein knowledge curation using GO annotations and LLMs for community literature submissionsGene Ontology, LLMshttps://github.com/anphan0828/ISMB_CollaborationFest_EBI/
Identifying genomic safe harbor sites in Zebrafish at different development stages, incorporating 3D chromosome data using an integrative genomics approachIntegrative genomics, HiC datahttps://github.com/parnaljoshi/gsh-zebrafish-hackathon

Fostering diversity, equity, and inclusion at BOSC

The BOSC organizing committee, which consists entirely of volunteers, devotes substantial effort and resources to making BOSC more diverse and accessible (see our DEI page). We do all we can to spread the word about BOSC to as wide an audience as possible, and we offer financial support to help lower the barriers to participation.

Any author who submits an abstract to BOSC can privately request financial assistance for conference registration fees. This initiative, funded by generous sponsors (see https://www.open-bio.org/events/bosc/sponsors/ and the Acknowledgements below), has enabled us to offer free ISMB registration to dozens of BOSC participants, primarily from underrepresented groups in the field. In 2025, thanks to our sponsors, we provided free registration to 12 presenters, 10 of whom belong to underrepresented groups in our community. BOSC sponsorships also fund the Open Bioinformatics Foundation’s Event Awards, which are aimed at increasing diverse participation at events promoting open-source bioinformatics and open science, including (but not limited to) BOSC.

Our BOSC keynote speaker selection process has been designed to support inclusivity and transparency. In the first phase of the yearly process, we invite the community to submit nominations for potential speakers. Shortlisted candidates are then evaluated using our established Invited Speaker Rubric, followed by a call for community feedback regarding any potential concerns. After reviewing the feedback, we make the final selection, ensuring that our keynote speakers represent a diverse spectrum of perspectives and backgrounds.

Making BOSC both successful and inclusive is a lot of work. Our greatest reward is hearing from participants that BOSC provides a truly welcoming and inspiring home for collaboration. First-time attendee Daniel Korn described his experience as the “greatest highlight of my international journey,” adding that “at every step… I felt included in a community of programmers and academics focused on tackling big problems of biomedical software.” This feeling was echoed by Sebastian Ayala Ruano, who shared that “attending BOSC was an incredible experience. I discovered inspiring research projects and open-source tools, while connecting with a welcoming community.” These stories of connection and inclusion are the reason we do this work; they are the inspiration that ensures we will be back next year.

Figures

All photos in this report are shared under a CC-BY-SA license. All identifiable subjects in the photos were contacted, and they consented to have their photos used in this report.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 09 Sep 2025
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Harris NL, Hokamp K, Maia J et al. BOSC 2025, the 26th Bioinformatics Open Source Conference [version 1; peer review: not peer reviewed]. F1000Research 2025, 14:887 (https://doi.org/10.12688/f1000research.169977.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status:
NOT PEER REVIEWED
NOT PEER REVIEWED

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 09 Sep 2025
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.