Q-SPARC: An Interactive Chatbot for Exploring SPARC SCKAN Connectivity with Flatmap Visualization

Huayan Zeng; Dan Zhang; Matthew French; Fangqiang Xu; Yun Gu

doi:10.12688/f1000research.178101.1

Home Browse Q-SPARC: An Interactive Chatbot for Exploring SPARC SCKAN Connectivity...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Software Tool Article

Q-SPARC: An Interactive Chatbot for Exploring SPARC SCKAN Connectivity with Flatmap Visualization

[version 1; peer review: awaiting peer review]

Huayan Zeng¹, Dan Zhang¹, Matthew French¹, Fangqiang Xu¹, Yun Gu ¹

Huayan Zeng¹, Dan Zhang¹, [...] Matthew French¹, Fangqiang Xu¹, Yun Gu ¹

PUBLISHED 18 Jun 2026

Author details Author details

¹ The University of Auckland Auckland Bioengineering Institute, Auckland, 1142, New Zealand

Huayan Zeng
Roles: Data Curation, Formal Analysis, Writing – Original Draft Preparation

Dan Zhang
Roles: Methodology, Writing – Review & Editing

Matthew French
Roles: Conceptualization, Project Administration, Visualization

Fangqiang Xu
Roles: Conceptualization, Project Administration, Software, Validation

Yun Gu
Roles: Investigation, Supervision, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS AWAITING PEER REVIEW

This article is included in the Software and Hardware Engineering gateway.

Abstract

Background

The SPARC program (SPARC Portal, RRID:SCR_017041; https://sparc.science) aggregates anatomy and connectivity knowledge across species. The SCKAN database (RRID:SCR_026088) provides structured connectivity relationships and an associated Natural Language Interface (NLI). However, the NLI currently supports only single-turn querying, lacks conversational memory, and does not integrate Flatmap visualization.

Methods

We developed Q-SPARC—a Python-based conversational system that integrates local or cloud-hosted LLMs (default: Qwen2.5-72B with optional GPT-4 support) with semantic retrieval, reranking, and Flatmap visualization.

Results

Users can submit queries such as “What are the input sources of the heart?” and receive a narrative summary, structured tables, and Flatmap anatomical diagrams. The system supports multi-turn conversational memory, allowing follow-up refinement and context- dependent queries.

Conclusions

Q-SPARC extends the SPARC ecosystem by enabling conversational exploration of SCKAN connectivity, integrating visualization, and improving usability and FAIRness.

Keywords

SCKAN; SPARC; Chatbot; Flatmap; Large language model; FAIR; Anatomical visualization

Corresponding authors: Fangqiang Xu, Yun Gu

Competing interests: No competing interests were disclosed.

Grant information: This article was supported by NIH Common Fund’s 2025 SPARC FAIR Codeathon.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2026 Zeng H et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Zeng H, Zhang D, French M et al. Q-SPARC: An Interactive Chatbot for Exploring SPARC SCKAN Connectivity with Flatmap Visualization [version 1; peer review: awaiting peer review]. F1000Research 2026, 15:977 (https://doi.org/10.12688/f1000research.178101.1) First published: 18 Jun 2026, 15:977 (https://doi.org/10.12688/f1000research.178101.1) Latest published: 18 Jun 2026, 15:977 (https://doi.org/10.12688/f1000research.178101.1)

Introduction

The SPARC initiative consolidates connectivity and anatomical data across species to accelerate neuromodulation research and related applications,¹ and is made accessible through the online SPARC Portal (SPARC, RRID:SCR_017041; https://sparc.science). Within this ecosystem, the SPARC Knowledge Graph includes the SCKAN database and its Natural Language Interface (NLI), which together allow users to query connectivity relationships between organs, nerves, and ganglia.² SCKAN itself is registered as SCKAN (RRID:SCR_026088) and exposes curated connectivity relationships that can be reused across tools in the SPARC ecosystem. Q-SPARC is further listed as a resource on the SPARC Tools and Resources page (https://sparc.science/tools-and-resources/4A4tJH8PCbsrINgIlcH4ef), providing an official entry point for users to discover the tool.

Despite these strengths, the current SCKAN NLI exhibits several limitations that hinder its usability for researchers and educators.² First, lack of multi-turn interaction: the platform currently supports only single-turn queries, preventing the accumulation of conversational context across interactions. This restriction reduces the depth and continuity of exploratory analysis, making it difficult for users to build upon prior results or maintain a coherent line of inquiry over time. Second, high latency in sequential queries: response delays disrupt the flow of sequential queries, undermining the efficiency of iterative workflows. Such latency is particularly problematic when researchers require rapid and adaptive questioning to refine or validate emerging hypotheses. Third, absence of spatial visualization: the lack of integrated Flatmap-based anatomical visualization limits the intuitive interpretation of spatial relationships in connectivity data. Without visual support, users face greater challenges in contextualizing anatomical insights within broader structural or functional frameworks.¹ Fourth, restricted output formats: results are returned only as unstructured text, with no accompanying tabular or machine-readable formats such as CSV or JSON. This limitation constrains downstream computational processing, automated analysis, and integration with external analytical pipelines. Finally, insufficient FAIR alignment: the absence of persistent conversation history and weak integration with FAIR principles (Findable, Accessible, Interoperable, Reusable)³ reduces the platform’s capacity for reproducible, shareable, and interoperable research. These gaps hinder collaborative work and diminish the long-term reusability of outputs.

These limitations highlight the need for a more interactive, context-aware, and visualization-enabled interface for SCKAN connectivity exploration. Q-SPARC, a Python-based LLM-powered interface that layers retrieval-augmented generation and Flatmap visualization on top of SCKAN, addresses these gaps by enabling multi-turn conversational access, structured output generation, and integration with Flatmap anatomical visualization, while maintaining compatibility with the FAIR principles that underpin SPARC resources and the broader SPARC Portal ecosystem.

Methods

Implementation

1. Overview of our solution

Q-SPARC integrates an LLM-powered conversational interface with a semantic indexing and retrieval pipeline,⁴^,⁵^,⁶ enabling users to submit natural-language queries and receive both narrative and structured outputs. The system supports multi-turn dialogue, maintaining conversational memory for context-aware reasoning and allowing users to build on prior queries. It also incorporates Flatmap visualization for anatomical context.

To clarify the model used in the implementation, Q-SPARC supports both local and cloud-hosted LLMs. In the hackathon prototype, we used Qwen2.5-72B as the default LLM, while GPT-4 and lighter-weight open-source models were also compatible in testing. This flexibility ensures adaptability across computational environments.

The tool is built on a modular architecture that separates query understanding, data retrieval, and visualization. This separation facilitates maintenance, scalability, and integration with other SPARC resources. To improve responsiveness, token and document flows are separated, asynchronous processing is applied, and local embedding caching minimizes repeated inference—together accelerating sequential queries without compromising reproducibility.

2. System architecture

Q-SPARC is implemented as a modular pipeline composed of multiple interconnected components, shown in Figure 1. The workflow begins when the user enters a natural language query into the input box. The query is processed by the Query Understanding LLM, followed by two-stage retrieval (embedding-based and reranking) from a local database. Relevant chunks are passed to the Reader LLM, which generates answers in both text and structured formats (JSON, CSV, TTL). The results can be displayed as text, tables, and Flatmap-based anatomical diagrams. Each module plays a specific role in transforming a natural language query into structured answers and visualizations.

• Interface: The process begins when the user enters a prompt into the input box on the web interface and clicks the submit button. The interface is designed to display three possible outputs: (1) a natural language text response, (2) a structured table, and (3) an optional Flatmap-based anatomical diagram.
• Query Understanding (LLM): The submitted query is processed by a local or server-hosted Large Language Model (LLM) responsible for interpreting the question and generating an internal search representation.
• First Retrieval (Embedding): The interpreted query is vectorized and matched against a local database of SCKAN knowledge using semantic embeddings. This first retrieval stage selects an initial set of candidate knowledge chunks.
• Second Retrieval (Reranking): The candidate chunks are reranked based on relevance, using additional scoring methods to ensure that the most relevant items are prioritized for the next stage.
• Reader (LLM): The top-ranked chunks are passed to a second LLM (Reader) which synthesizes the final answer, combining retrieved knowledge with reasoning capabilities. The Reader can produce both free-text explanations and structured outputs.
• Structured Output Formats: The system supports JSON, CSV, and TTL formats, ensuring that responses are interoperable with other tools and data pipelines.
• Visualization Adapter (Flatmap): When applicable, anatomical context is provided via Flatmap visualization, allowing users to see spatial relationships between structures described in the answer.
• Local Server and Data Flow: All processing can be run locally. Token flow and document flow, as shown in Figure 1, are separated to optimize efficiency and maintain modularity.

Figure 1. Q-SPARC system architecture.

Overview of the Q-SPARC interactive chatbot framework integrating SPARC SCKAN connectivity with flatmap-based visualization. The system combines natural language input, knowledge graph querying and anatomical flatmap rendering to enable interactive exploration of neural connectivity.

Operation

General use:

Q-SPARC can be run locally or deployed on a server. The system requires Python 3.x and the dependencies listed in the accompanying requirements file. A containerized configuration is provided for reproducibility.

Using Q-SPARC:

A typical workflow involves:

1. Starting the backend service to handle data retrieval and processing.
2. Launching the frontend interface in a browser.
3. Entering a natural-language query, for example, “What are the input sources of the heart?”.
4. Viewing the outputs, which may include:
- • a narrative text summary;
- • a structured results table;
- • an optional Flatmap anatomical visualization.

Tutorials:

The software is accompanied by a complete tutorial set that guides the user from installation through to advanced use. The tutorials cover:

• Installing dependencies and setting up the Python environment.
• Starting the backend and frontend components.
• Understanding the two-stage retrieval process.
• Generating and interpreting Flatmap visualizations.

Reproducibility:

All source code, documentation, and example data are distributed under an open-source license. The modular design allows adaptation for integration with other SPARC tools and datasets.

Author contributions

HZ: Data curation, Formal analysis, Writing – original draft.

DZ: Methodology, Writing – review & editing.

FX: Conceptualization, Software development, Validation, Project administration– review & editing.

MF: Conceptualization, Visualization, Project administration– review & editing.

YG: Supervision, Investigation, Writing – review & editing.

Data availability

The Q-SPARC software tool is publicly available at:

• Source code available from: https://github.com/greeyun/Q-SPARC
• Archived software available from: https://doi.org/10.5281/zenodo.18690270
• SPARC Tools and Resources listing: https://sparc.science/tools-and-resources/4A4tJH8PCbsrINgIlcH4ef
• License: Apache-2.0

The software is platform-independent and tested on Linux, macOS, and Windows. A container configuration is provided for reproducible deployment.

All data, examples, and documentation are released under the Apache-2.0 license.

Acknowledgements

This research was supported by the NIH Common Fund’s 2025 SPARC FAIR Codeathon, https://sparc.science/news-and-events/events/2025-sparc-fair-codeathon . We thank the SPARC FAIR Codeathon organizers and the SPARC community for their support. We also acknowledge contributors and maintainers of related SPARC ecosystem tools and Flatmap resources.

References

1. Osanlouy M, Bandrowski A, de Bono B , et al.: The sparc drc: building a resource for the autonomic nervous system community. Front. Physiol. 2021; 12: 693735. PubMed Abstract | Publisher Full Text | Free Full Text
2. Imam FT, Gillespie TH, Ziogas I, et al.: Developing a multiscale neural connectivity knowledgebase of the autonomic nervous system. Front. Neuroinform. 2025; 19: 1541184. PubMed Abstract | Publisher Full Text | Free Full Text
3. Wilkinson MD, Dumontier M, Aalbersberg IJJ, et al.: The fair guiding principles for scientific data management and stewardship. Sci. Data. 2016; 3(1): 1–9.
4. Lewis P, Perez E, Piktus A, et al.: Retrieval-augmented generation for knowledge-intensive nlp tasks. Adv. Neural Inf. Proces. Syst. 2020; 33: 9459–9474.
5. Yang A, Li A, Yang B, et al.: Qwen3 technical report. arXiv preprint arXiv:2505.09388. 2025.
6. Achiam J, Adler S, Agarwal S, et al.: Gpt-4 technical report. arXiv preprint arXiv:2303.08774. 2023.

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 18 Jun 2026

Author details Author details

¹ The University of Auckland Auckland Bioengineering Institute, Auckland, 1142, New Zealand

Huayan Zeng
Roles: Data Curation, Formal Analysis, Writing – Original Draft Preparation

Dan Zhang
Roles: Methodology, Writing – Review & Editing

Matthew French
Roles: Conceptualization, Project Administration, Visualization

Fangqiang Xu
Roles: Conceptualization, Project Administration, Software, Validation

Yun Gu
Roles: Investigation, Supervision, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

This article was supported by NIH Common Fund’s 2025 SPARC FAIR Codeathon.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (1)

version 1

Published: 18 Jun 2026, 15:977

https://doi.org/10.12688/f1000research.178101.1

Copyright

© 2026 Zeng H et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Zeng H, Zhang D, French M et al. Q-SPARC: An Interactive Chatbot for Exploring SPARC SCKAN Connectivity with Flatmap Visualization [version 1; peer review: awaiting peer review]. F1000Research 2026, 15:977 (https://doi.org/10.12688/f1000research.178101.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 18 Jun 2026

Open Peer Review

Reviewer Status

AWAITING PEER REVIEW

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

[1] 1. Osanlouy M, Bandrowski A, de Bono B , et al.: The sparc drc: building a resource for the autonomic nervous system community. Front. Physiol. 2021; 12: 693735. PubMed Abstract | Publisher Full Text | Free Full Text

[2] 2. Imam FT, Gillespie TH, Ziogas I, et al.: Developing a multiscale neural connectivity knowledgebase of the autonomic nervous system. Front. Neuroinform. 2025; 19: 1541184. PubMed Abstract | Publisher Full Text | Free Full Text

[3] 3. Wilkinson MD, Dumontier M, Aalbersberg IJJ, et al.: The fair guiding principles for scientific data management and stewardship. Sci. Data. 2016; 3(1): 1–9.

[4] 4. Lewis P, Perez E, Piktus A, et al.: Retrieval-augmented generation for knowledge-intensive nlp tasks. Adv. Neural Inf. Proces. Syst. 2020; 33: 9459–9474.

[5] 5. Yang A, Li A, Yang B, et al.: Qwen3 technical report. arXiv preprint arXiv:2505.09388. 2025.

[6] 6. Achiam J, Adler S, Agarwal S, et al.: Gpt-4 technical report. arXiv preprint arXiv:2303.08774. 2023.

Q-SPARC: An Interactive Chatbot for Exploring SPARC SCKAN Connectivity with Flatmap Visualization

Abstract

Background

Methods

Results

Conclusions

Keywords

Introduction

Methods

Implementation

Figure 1. Q-SPARC system architecture.

Operation

Author contributions

Data availability

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated