ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Research Article

Campus air quality dataset

[version 1; peer review: 1 approved with reservations, 1 not approved]
PUBLISHED 03 Apr 2025
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Data: Use and Reuse collection.

Abstract

Background

Elevated levels of carbon dioxide (CO2) within academic settings can adversely affect the health and academic efficacy of both students and faculty. High concentrations of CO2 are correlated with reduced cognitive functioning, compromised decision-making abilities, diminished academic performance, and various health-related issues. The escalating apprehensions regarding the detrimental health consequences of air pollution have precipitated an increase in research focused on air quality assessment and amelioration.

Method

The investigation utilized Internet of Things (IoT) devices that were outfitted with sensors to gather data on various environmental parameters, such as temperature, humidity, CO2 concentrations, and light intensity. This data underwent analysis through the application of summary statistics to delineate the dataset and to visualize the distribution of variables via scatter matrix plots.

Result

The dataset obtained, which encompasses essential air quality and environmental parameters, is now accessible to the public through the Mendeley repository. The analytical findings illuminated significant characteristics of the data concerning CO2 levels and their prospective ramifications on the academic milieu.

Conclusion

The amalgamation of IoT technology with summary statistical analysis presents a promising methodology for the real-time surveillance of air quality. This approach yields critical insights into the health and academic ramifications of heightened CO2 levels within educational environments, underscoring the necessity for ongoing air quality monitoring to enhance campus conditions.

Keywords

Air Quality Monitoring; Artificial Intelligence of Things; Carbon dioxide Level Management; Environmental Monitoring

Value of the data

  • To the research community dataset holds considerable importance for scholars owing to its proximity to various pollution sources, potential ramifications for public health, implications for urban development, function as a venue for experimental interventions, ramifications for educational and policy frameworks, and the promotion of interdisciplinary cooperation. The dataset promotes collaborative efforts across disciplines among experts, thereby contributing to a comprehensive approach to addressing air quality challenges and sustainability within the campus environment.

  • The dataset benefits various groups and individuals, Campus facilities managers, Environmental researchers, Health and safety officers, Urban planners and architects, Students and educators, and public awareness and advocacy groups

  • The dataset can be employed for advanced insights and comprehensive analysis through methodologies such as trend analysis, spatial mapping, source apportionment, health impact assessment, correlation analysis, intervention evaluation, scenario modeling, and comparative studies. These methodologies empower researchers to discern trends, spatial distributions, sources of pollution, health ramifications, correlations with external variables, as well as the efficacy of interventions. Through the implementation of these analytical techniques, scholars can derive significant insights, strengthen evidence-based policymaking, and foster initiatives aimed at enhancing air quality within campus settings.

Background

Research indicates that elevated concentrations of Carbon dioxide (CO2) within academic environments can exert detrimental effects on the health and academic performance of campus inhabitants. High levels of CO2 can result in several consequences, including Diminished Cognitive Function, Impaired Decision- Making, Diminished Academic Performance, and Health Concerns.1,2,3 Moreover, insufficient ventilation, which results in increased CO2 levels in educational settings, has been documented to obstruct students' performance in cognitive tasks, elevate absenteeism, and diminish test scores.4 Therefore the campus air quality dataset aims to provide data on environmental air quality parameters such as CO2 levels, temperature, humidity, and other relevant environmental factors within the campus facilities. This comprehensive information equips decision-makers at the University to execute targeted strategies for air quality enhancement, thereby ensuring a healthier and more sustainable living milieu for the students. Also, the dataset aids in informed decision-making processes aimed at improving the overall quality of life within the university's accommodation facilities.510

Data description

The AIoT dataset incorporates a heterogeneous assortment of environmental variables, including temperature, humidity, CO2 concentrations, and light intensity, which were amassed within the university campus setting through the deployment of IoT sensors. This dataset is in a table format and, meticulously structured to facilitate the thorough monitoring and examination of environmental air quality within the hostels, particularly concentrating on the comprehension and regulation of CO2 concentrations, and parameters captured are:

  • Temperature: The dataset comprises temperature measurements expressed in degrees Celsius, thereby providing valuable insights into the thermal conditions prevalent in various locations throughout the campus.

  • Humidity: The humidity readings, articulated as a percentage, yield essential information regarding the moisture content present in the indoor atmosphere.

  • CO2 Levels: The concentration of CO2, quantified in parts per million (ppm), is encapsulated within the dataset, thereby offering crucial data for the assessment of indoor air quality. Light Intensity: The dataset encompasses measurements of light intensity articulated in lux, thereby elucidating the illumination levels throughout the campus.

  • Light Intensity: Light intensity is a measure of the amount of light energy per unit area. It quantifies the brightness or luminance of a light source reaching a specific surface or point. Light intensity was measured in lux as the measuring unit.

  • Data Granularity and Size: The dataset displays a high degree of granularity, as it records data at short intervals, resulting in a considerable volume of data points. The dataset comprises a substantial number of records, which reflects the varied environmental conditions and activities occurring within the university campus.

Methods

Sensiron SCD30 sensor integrated with a digital light intensity sensor was used to capture all four parameters for air quality as presented in the dataset. The SCD30 and digital light sensors were connected to the grove base using the female-to-female jumper wires, the ESP module received the readings from the sensors, processed the raw data, and stored it in the microSD card for later retrieval.

Analysis and visualization of Dataset: The hostel dataset was analyzed using summary statistics that describe the key characteristics of the data and help in gaining a quick understanding of its distribution, mean and standard deviation were used. Also, the Variable distribution and scatter matrix plots provide a visual representation of the relationships between the different variables in the dataset. Figure 1 shows the schematic diagram showing the connections of sensors.

9e13a6b8-df94-4438-8d1e-69444901712a_figure1.gif

Figure 1. Schematic diagram showing connections of sensors (Author’s Own Construct).

Result and discussion

5.1. Boys hostel

The male hostel dataset was analyzed using summary statistics that describe the key characteristics of the data and help in gaining a quick understanding of its distribution, central tendency, and variability. The mean and standard deviation were used in this case. Also, the Variable distribution and scatter matrix plots provide a visual representation of the relationships between the different variables in the dataset.

Summary statistics observation

  • The mean temperature is approximately 29.59°C with a standard deviation of 12.81°C.

  • The mean humidity is around 0.00% with a standard deviation of 123.10%.

  • The mean CO2 concentration is approximately 0.00 ppm with a standard deviation of 62.09 ppm.

  • The mean light intensity is around 3764.83 units with a standard deviation of 4508.98 units.

Distribution of variables

  • Temperature Distribution: The temperature ranged from 28.99°C to 29.05°C, this indicated that the temperatures within the hostel was consistent and showed little variation. This narrow range suggests a relatively stable environment regarding temperature.

  • Humidity Distribution: As shown on Figures 2 to 5, Humidity levels ranges. Humidity levels ranged from 29.59% to 80.79%. This wide range indicates significant variability in humidity, reflecting different environmental conditions in the hostel. A lower humidity level (29.59%) imply drier conditions, while higher readings (up to 80.79%) indicate more humid and potentially uncomfortable environments.

  • CO2 Concentration : CO2 levels vary from 0.00 ppm to 671.08 ppm. The presence of a zero value denote instances of low ventilation or open spaces, while the maximum concentration suggests potential crowding or poor air quality situations. Elevated CO2 levels impact the comfort and health of the occupants when the value becomes too high.

Light Intensity: Light intensity ranges between 0 to 12751 units, representing a wide range of lighting conditions within the hostel. The presence of a zero reading indicates very dark or unlit areas, while the maximum level shows that there are areas with bright light. This variation influence the comfort and activity levels of the residents, as different lighting conditions affect mood and productivity.

9e13a6b8-df94-4438-8d1e-69444901712a_figure2.gif

Figure 2. Boys’ Hostel Variable Distribution Plot (Author’s Own Construct).

9e13a6b8-df94-4438-8d1e-69444901712a_figure3.gif

Figure 3. Scattered plot for male hostel (Author’s Own Construct).

9e13a6b8-df94-4438-8d1e-69444901712a_figure4.gif

Figure 4. Girls’ Hostel variable distribution plot (Author’s Own Construct).

9e13a6b8-df94-4438-8d1e-69444901712a_figure5.gif

Figure 5. Scattered plot for female hostel (Author’s Own Construct).

Scatter plots

  • Temperature vs Humidity shows a negative correlation between temperature and humidity. As the temperature increases, the humidity tends to decrease, which aligns with the correlation coefficient of -0.975

  • Temperature vs CO2 shows a weak negative correlation between temperature and CO2 levels. There is no clear trend indicating that temperature and CO2 levels are not strongly related.

  • Temperature vs Light Intensity shows a moderate positive correlation between temperature and light intensity. Higher temperatures are associated with higher light intensity.

  • Humidity vs CO2 shows a weak positive correlation between humidity and CO2 levels. There is no clear trend, indicating that humidity and CO2 levels are not strongly related

  • Humidity vs Light Intensity shows a moderate negative correlation between humidity and light intensity. Higher humidity is associated with lower light intensity

  • CO2 vs Light Intensity shows a weak negative correlation between CO2 levels and light intensity. There is no clear trend, indicating that CO2 levels and light intensity are not strongly related.

Female hostel

The female hostel dataset was analyzed using summary statistics that describe the key characteristics of the data and help in gaining a quick understanding of its distribution, central tendency, and variability. The mean and standard deviation were used in this case. Also, the Variable distribution and scatter matrix plots provide a visual representation of the relationships between the different variables in the dataset.

Summary statistics observation

  • Mean Temperature: The mean temperature of approximately 33.70°C with a standard deviation of 3.21°C indicates that, on average, the temperatures are relatively high. The standard deviation showed that temperatures fluctuate moderately around the mean. This range indicates warm conditions, which may lead to discomfort without proper cooling systems.

  • Mean Humidity: The mean humidity level of around 63.53% with a standard deviation of 9.05% indicates a moderately humid environment. The humidity levels showed feeling of dampness, which may lead to uncomfortable living condition.

  • Mean CO2 Concentration: A mean CO2 concentration of approximately 508.79 ppm with a standard deviation of 62.09 ppm showed that the air quality varies.

  • Mean Light Intensity: The mean light intensity of around 6826.76 units with a standard deviation of 4540.86 units suggested that there is significant variability in lighting conditions within the space.

Distribution of variables

  • The temperature distribution shows a range from 28.36°C to 38.75°C

  • The humidity distribution ranges from 50.13% to 78.29%.

  • The CO2 concentration ranges from 473.38 ppm to 7257.28 ppm, with a notable outlier at the maximum value.

  • The light intensity ranges from 0 to 14337 units, indicating a wide range of light conditions.

Scatter plots

  • Temperature vs. Humidity: The was a negative correlation between temperature and humidity which suggested an inverse relationship. As temperature increases, humidity decreases.

  • Temperature vs. CO2: The lack of a clear correlation between temperature and CO2 levels implies that changes in temperature do not regularly affect CO2 concentrations.

  • Temperature vs. Light Intensity: No clear correlation between temperature and light intensity shows that variations in one do not affect the other.

  • Humidity vs. CO2: As displayed in Table 1, the table specification was displayed as the specific area, subject are, data type and location of the data source can be located. No correlation between humidity and CO2 suggests that changes in moisture levels do not systematically affect carbon dioxide concentrations.

  • Humidity vs. Light Intensity: No clear correlation between humidity and light intensity indicates that variations in moisture do not appear to influence light levels.

  • CO2 vs. Light Intensity: No clear correlation between CO2 levels and light intensity suggests that changes in one do not impact the other.

Table 1. Specifications Table.

Subject Data Science.
Specific subject area Data Mining and Statistics.
Type of data Table, Raw
Data collection The dataset was collected over 14 days consists of 38,369 columns for the boys’ hostel and 13,469 for the Girls’ Hostel alongside corresponding four attributes i.e. Temperature, Humidity, Carbon Dioxide CO2, and light intensity for each data collected. Sensiron SCD30 sensor integrated with a digital light intensity sensor was used to capture all four parameters for air quality as presented in the dataset.
Data source location Institution: Bowen University, Boys and Girls Hostels City/Town/Region: Iwo/Osun
Country: Nigeria
Data accessibility Repository name: Mendeley Data
Data identification number: DOI: https://doi.org/10.17632/r95srm9m8m.1
Direct URL to data: https://data.mendeley.com/datasets/r95srm9m8m/1

Conclusion

This dataset proffers valuable insights into the dynamic characteristics of environmental conditions within a campus environment. Researchers and practitioners can leverage this data to analyze trends, formulate predictions, and implement strategies aimed at optimizing indoor air quality, energy consumption, and overall campus sustainability. The extensive dataset possesses considerable significance for researchers and practitioners who are engaged in environmental monitoring, indoor air quality management, and sustainable campus initiatives. The dataset can function as a foundational resource for the formulation, evaluation, and enhancement of data-driven models aimed at CO2 level forecasting and regulation within university campuses. Moreover, the dataset presents avenues for undertaking comprehensive analyses to extract insights concerning the interplay between environmental parameters and the quality of the indoor atmosphere.

Limitations

Not Applicable.

Ethics and consent

The authors have read and follow the ethical requirements for publication in Data in Brief and confirming that the current work does not involve human subjects, animal experiments, or any data collected from social media platforms.

'Ethical approval and consent were not required.'

Credit author statement

Chinecherem Umezuruike, and Halleluyah Aworinde, took the lead in data collection, Goodness Amodu, Abidemi Adeniyi, Michael Rudolph, Oluwasegun Aroba participated in the preprocessing of the dataset collected, writing and Michael Rudolph proofreading of the manuscript.

All of the authors contributed to the manuscript and gave their approval to the final version after offering constructive criticism and helping to develop the research, analysis, and manuscript.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 03 Apr 2025
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Umezuruike C, Aworinde H, Amodu G et al. Campus air quality dataset [version 1; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2025, 14:388 (https://doi.org/10.12688/f1000research.162509.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 03 Apr 2025
Views
2
Cite
Reviewer Report 21 Aug 2025
Domenico Suriano, ENEA—Italian National Agency for New Technologies, Brindisi, Italy 
Not Approved
VIEWS 2
This article is focused on a database made with records reporting the concentration of CO2 and other environmental parameters measured in two hostels belonging to a university campus. The text of the article is clear and understandable, the document is ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Suriano D. Reviewer Report For: Campus air quality dataset [version 1; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2025, 14:388 (https://doi.org/10.5256/f1000research.178719.r393312)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
13
Cite
Reviewer Report 28 Apr 2025
Tymoteusz Miller, University of Szczecin, Szczecin, Poland 
Approved with Reservations
VIEWS 13
The article presents an environmental dataset collected via IoT sensors (SCD30 and digital light sensors) deployed in male and female hostels at Bowen University, Nigeria. The dataset records four key variables: temperature, humidity, carbon dioxide (CO₂) concentration, and light intensity. ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Miller T. Reviewer Report For: Campus air quality dataset [version 1; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2025, 14:388 (https://doi.org/10.5256/f1000research.178719.r377521)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 04 May 2025
    Oluwasegun Aroba, Centre for Ecological Intelligence, University of Johannesburg, Auckland Park, South Africa
    04 May 2025
    Author Response
    F1000Research Response
    Response 1
    We thank the reviewer for this observation. We clarified that, while existing studies aim generally at indoor conditions, there exists little publicly accessible data specifically tracking ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 04 May 2025
    Oluwasegun Aroba, Centre for Ecological Intelligence, University of Johannesburg, Auckland Park, South Africa
    04 May 2025
    Author Response
    F1000Research Response
    Response 1
    We thank the reviewer for this observation. We clarified that, while existing studies aim generally at indoor conditions, there exists little publicly accessible data specifically tracking ... Continue reading

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 03 Apr 2025
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.