Keywords
Social media; Adolescents; TikTok; Instagram; Digital Well-being; Mental Health; Spain; Dataset.
The rapid proliferation of social media platforms among adolescents has generated significant concern about its implications for mental health and digital well-being. In Spain, where social network use is near-universal among minors, this intersection is especially acute given the country’s high rates of psychological disorders among young people. This article describes a dataset generated within the National R&D&i Project “Adolescent Recipients and Creators of Mental Health Content on Social Media” (SMARS, ref. PID2022-141454OB-I00). The dataset provides a comprehensive, multidimensional quantitative record of the digital lives of 1,043 Spanish adolescents aged 12 to 18, collected through a structured online survey administered between April 24 and May 6, 2024. It captures detailed information on platform usage patterns (TikTok and Instagram), content consumption preferences across 20 categories, social perceptions of digital technology impact, and validated digital well-being self-assessments. The instrument incorporates an adapted Social Perception Scale (α = 0.89) and a three-factor Digital Well-being on TikTok Scale (α = 0.79–0.83), validated through Exploratory Factor Analysis. This dataset offers a reusable, platform-specific resource for interdisciplinary research in communication, psychology, sociology, and digital health, enabling comparative studies, gender-differentiated analyses, and evidence-based digital literacy program development.
Social media; Adolescents; TikTok; Instagram; Digital Well-being; Mental Health; Spain; Dataset.
Adolescence constitutes a singular developmental period characterized by identity formation, peer belonging, and the internalization of social norms.1 In the contemporary era, these fundamental developmental tasks are increasingly mediated by digital technologies, giving rise to the concept of ‘digital socialization’—the acquisition of social experience in online contexts subsequently reproduced in a hybrid online-offline reality.1,2 The digital footprint of an adolescent is thus not merely an entertainment record but a central component of their real-world identity and social standing.1,3
In Spain, the penetration of social media among minors aged 12 to 18 is particularly high, with platforms such as TikTok and Instagram functioning as primary spaces for peer interaction, self-expression, and information consumption.1 This intensive engagement unfolds against the backdrop of a serious mental health crisis: Spain reports some of the highest rates of anxiety, depression, and eating disorders among adolescents in the European Union, conditions disproportionately affecting adolescent girls.4 The COVID-19 pandemic and associated lockdowns acted as catalysts, markedly intensifying social media consumption among minors while simultaneously increasing their psychological vulnerability.4
While the scientific community continues to debate the causal nature of the relationship between social media use and mental health outcomes, a growing empirical consensus suggests that impact is strongly contingent on the specific architectural features of the platform and the type of content consumed, rather than screen time alone.1,3 This distinction is critical: TikTok’s algorithm-driven individualized feed model—facilitating a primarily intrapersonal mode of interaction—differs structurally and psychologically from Instagram’s socially-oriented architecture, which functions primarily to represent and intensify real-world social relations.3
The SMARS project was designed to address this empirical gap by generating platform-specific, nationally representative data on Spanish adolescent digital behavior and well-being. The resulting dataset, described in this Data Note, provides a comprehensive, high-quality resource for investigating the multifaceted relationship between platform-specific social media use, content consumption, and psychosocial well-being across a diverse adolescent population. Preliminary analyses embedded in the dataset documentation reveal a complex narrative. Adolescents generally perceive the impact of social media on their lives as neither fully positive nor negative, with a sense of belonging and social connection partially offsetting negative impacts on psychological well-being.
However, marked gender disparities are evident: girls who use these platforms more intensively report more negative perceptions of the impact on their mental health, consistent with hypotheses about heightened pressures related to physical appearance and the pursuit of external validation.3 Conversely, platform users perceive the social impact of digital technologies more positively than non-users, suggesting that active participation confers a form of social capital highly valued within adolescent peer groups. The data further document an ‘Agency Paradox’: while adolescents report social and organizational benefits from platform use, their self-perceived ability to set limits on their own usage decreases significantly as engagement deepens—a pattern particularly pronounced among heavy TikTok users. This dataset serves as a reusable, well-documented resource that provides not only descriptive statistics on platform engagement but also validated psychometric evaluations of digital well-being across three dimensions: emotional resilience, agency, and social connection. By offering granular data on how 20 distinct content categories correlate with emotional outcomes and social perceptions, this resource enables the development of nuanced models of digital impact that transcend the limitations of aggregate screen time metrics.
Data were collected through a structured online survey administered to a representative sample of 1,043 adolescents residing in Spain. The sampling strategy employed a stratified random design to ensure generalizability to the national population of youth aged 12 to 18 years. The stratification process was conducted in multiple stages. Initially, the sample was segmented proportionally across the seventeen Autonomous Communities of Spain according to population distributions published by the Spanish Statistics Office (Instituto Nacional de Estadística, INE) for 2023. Within each region, municipalities served as primary sampling units, ensuring representation of both metropolitan and rural environments across five population size strata. Final participant selection applied a transversal classification by gender, age, and municipality size. The achieved sample comprised 527 females (50.5%) and 516 males (49.5%), with a mean age of 13.96 years (SD = 1.43). The data collection company ODEC was commissioned to manage fieldwork, which was conducted between April 24 and May 6, 2024. For specific analyses concerning TikTok, a sub-sample of 737 platform users was identified from the full sample (the demographic and structural breakdown of the sample is presented in Table 1).
The study was conducted in strict adherence to ethical standards and in full accordance with the principles stated in the Declaration of Helsinki. The research design received a formal favourable report and ethical approval from the Research Ethics Committee of the Universitat Oberta de Catalunya (UOC), under reference number CE24-PR05, confirming that the research meets national and international guidelines for research on human participants. Given that the research involved minors, a rigorous informed consent and assent process was systematically implemented. In accordance with Organic Law 3/2018 on the Protection of Personal Data and Guarantee of Digital Rights (LOPDGDD), written informed assent was obtained directly from adolescents aged 14 to 17 for data processing, whereas for younger participants aged 12 and 13, written informed consent was obtained from parents or legal guardians prior to survey access. Verbal consent was not used; all participants or guardians provided written documentation before accessing the instrument. The survey instrument included an introductory section detailing the research objectives, the strictly voluntary nature of participation, the right to withdraw at any time without consequence, and the technical measures implemented to guarantee complete anonymity. All data handling by the fieldwork company ODEC and the research team adhered to UNE EN ISO/IEC 27001 standards for information security management. Participants were explicitly informed that their responses would be processed exclusively in aggregate form and used solely for scientific purposes. No individual-level data was retained by the data collection company after the cleaning and anonymisation phase.
The questionnaire was structured into nine blocks designed to capture a holistic view of the adolescent digital experience, aligned with the SMARS project’s theoretical framework. A detailed description of each block follows:
• Block 0 – Socio-demographic profile: Gender (male/female/other), age (12–18 years), autonomous community of residence, municipality size (five strata from <10,000 to >500,000 inhabitants), and current educational level.
• Block 1 – Digital technology use: Daily usage time (in minutes) across eleven platforms (WhatsApp, Telegram, YouTube, Facebook, Twitter/X, Instagram, TikTok, Snapchat, Twitch, BeReal, Discord) on a seven-point ordinal scale ranging from ‘do not use’ to ‘more than 2 hours per day’.
• Block 2 – Digital competencies: Self-assessed technical competencies (9 items), informational competencies (9 items), knowledge of data practices by technology companies (2 items), and artificial intelligence literacy (6 items), all measured on 5-point Likert scales.
• Block 3 – Perceptions of digital technologies: An adapted 9-item scale5 measuring adolescents’ perceptions of how digital technologies affect nine dimensions of their social lives, including psychological well-being, peer socialization, group belonging, and civic autonomy, rated on a 5-point scale from ‘very negatively’ to ‘very positively’.
• Block 4 – Health information seeking online: Items on health information needs, topics of interest, sources used (including social media), and decision criteria for source selection.
• Block 5 – Mental health knowledge and attitudes: A 9-item adapted attitudinal scale and a 12-item knowledge test (true/false) assessing literacy about mental health conditions and stigma.
• Block 6 – Perceptions of mental health content on social media: Eight items measuring opinions on influencer and peer-produced mental health content, perceived credibility by platform, emotional reactions, and perceived influence on mental health perceptions.
• Block 7 – TikTok content consumption (TikTok users only): Frequency of engagement with 20 content categories (e.g., comedy, beauty, fitness, psychology, news) on a 5-point Likert scale from ‘never’ to ‘always’. All 20 items were assessed on a 5-point Likert scale: 1 = Never, 2 = Almost never, 3 = Sometimes, 4 = Almost always, 5 = Always. Block 7 was administered exclusively to participants who reported using TikTok (N = 737) across the specific content groups and themes outlined in Table 2.
• Block 8 – Digital well-being : The Digital Well-being on TikTok Scale [6, adapted], comprising 13 items structured across three subscales: emotional resilience (4 items), agency (4 items), and social connection and communion (5 items), measured on a 5-point agreement scale.
• Block 9 – Emotion self-regulation strategies: A 25-item instrument measuring a range of cognitive, behavioral, and social strategies used by adolescents when experiencing negative emotions, including prosocial, avoidant, creative, and potentially maladaptive responses.
Multiple quality control measures were implemented during and after survey administration to ensure the reliability and validity of the dataset. During fieldwork, the online survey incorporated attention-check items designed to detect and exclude respondents providing inconsistent or inattentive responses. Incomplete submissions were automatically discarded by the data collection system prior to export. Following data acquisition, a systematic cleaning and anonymisation protocol was applied. Open-ended responses in ‘other’ category fields were reviewed to remove any incidental personal identifiers and standardised into title case strings where feasible. Ordinal scales were harmonised across questionnaire blocks to ensure coding consistency. Missing values were labelled uniformly. The final dataset includes only fully completed and validated interviews (N = 1,043). Following standard anonymisation procedures applied by ODEC in accordance with ISO/IEC 27001, the dataset was verified to contain no directly or indirectly identifying information before being transferred to the research team.
The technical quality of the dataset was assessed through comprehensive statistical analyses focusing on internal consistency, structural validity, and consistency with established sociological benchmarks. All analyses were conducted using IBM SPSS Statistics version 25.0. Internal consistency of the adapted Social Perception Scale (Block 3, 9 items) was evaluated using Cronbach’s α, yielding a coefficient of 0.89, indicating excellent internal consistency for this cohort.
For the Digital Well-being on TikTok Scale (Block 8, 13 items, administered to N = 737 TikTok users), an Exploratory Factor Analysis (EFA) was conducted using principal axis factoring with oblimin rotation to account for expected inter-factor correlations. The Kaiser-Meyer-Olkin measure of sampling adequacy was satisfactory (KMO = 0.87), and Bartlett’s test of sphericity was significant (p < .001), confirming the suitability of the correlation matrix for factor analysis. The EFA yielded a three-factor solution explaining 60.46% of total variance. This structure differs slightly from the four-factor model proposed in the original instrument by Prakash,6 as the social connection and communion dimensions merged into a single factor for the Spanish adolescent cohort, suggesting cultural specificity in how these constructs are experienced. Cronbach’s α coefficients for all subscales exceeded the threshold of 0.70 conventionally accepted in social science research ( Table 3).
Descriptive and inferential statistical audits were performed to validate expected patterns in the data and ensure consistency with established sociological findings. Regarding gander differences, the data confirm that adolescent girls report significantly more negative perceptions of the impact of digital technologies on their psychological well-being than boys ( Table 4), consistent with prior literature on gander-differentiated vulnerability on visual platforms.3 Independent samples t-tests further confirmed that TikTok and Instagram users perceive the social impact of digital technologies significantly more positively than non-users on most scale dimensions, validating the social capital value attributed to platform participation within the adolescent peer group.
| Dimension of Perception | Total mean | Girls mean | Boys mean | t-test (p) |
|---|---|---|---|---|
| Psychological Well-being | 3.06 | 2.99 | 3.13 | < .001** |
| Ability to Argue and Debate | 3.39 | 3.37 | 3.40 | .52 (ns) |
| Peer Socialization | 3.45 | 3.46 | 3.44 | .80 (ns) |
| Acceptance of Social Norms | 3.35 | 3.33 | 3.38 | .36 (ns) |
| Decision-making Autonomy | 3.31 | 3.29 | 3.33 | .49 (ns) |
| Block | Content Domain | No. variables | Measurement |
|---|---|---|---|
| Block 0 | Socio-demographic profile | 5 | Nominal/Ordinal |
| Block 1 | Platform usage (11 platforms) | 12 | Ordinal (7-point) |
| Block 2 | Digital competencies | 26 | Likert 5-point/Nominal |
| Block 3 | Social perceptions of technology | 9 | Likert 5-point |
| Block 4 | Health information seeking | 7 | Mixed (nominal/ordinal) |
| Block 5 | Mental health knowledge & attitudes | 22 | Likert 5-point/Binary |
| Block 6 | Mental health content on social media | 10 | Likert 5-point |
| Block 7 | TikTok content consumption* | 20 | Likert 5-point |
| Block 8 | Digital well-being (TikTok)* | 13 | Likert 5-point |
| Block 9 | Emotion self-regulation strategies | 25 | Likert 5-point |
Analysis of digital well-being dimensions revealed that the Agency subscale (mean = 3.22, SD = 0.91) scored lowest of the three subscales, compared to Social Connection (mean = 3.64, SD = 0.57) and Emotional Resilience (mean = 3.31, SD = 0.60). A one-way analysis of variance (ANOVA) confirmed a significant effect of daily TikTok usage time on agency scores, F(5, 731) = 7.48, p < .05, with post-hoc tests (Tukey HSD) indicating that heavy users (>2 hours per day) reported significantly lower agency than light users. This pattern provides convergent internal validity, demonstrating that self-perceived capacity to set boundaries on TikTok use decreases as usage time increases.
No custom or proprietary code was developed to generate or process the dataset described in this Data Note. Data collection and fieldwork were managed by ODEC using standardised commercial survey administration infrastructure. Data cleaning, anonymisation, and quality control procedures were performed using standard functions in Microsoft Excel. Statistical validation analyses were conducted using IBM SPSS Statistics version 25.0 following standard documented procedures.
Figshare: Adolescent Digital Engagement and Well-being Patterns Among Spanish Youth: A Dataset on Social Media Use, Mental Health Perceptions, and Digital Well-being (SMARS, 2024). https://doi.org/10.6084/m9.figshare.32626206.7
This repository contains the following underlying data files distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0):
• SMARS_Adolescent_Social_Media_Spain_2024.csv: Primary quantitative dataset containing raw, fully anonymised responses from 1,043 participants across 117 columns.
• Codebook_SMARS_2024.xlsx: Metadata variable codebook detailing item labels, measurement types, valid value codes, and routing logic constraints.
Figshare: Adolescent Digital Engagement and Well-being Patterns Among Spanish Youth: Supplementary Materials. https://doi.org/10.6084/m9.figshare.32626206
This repository contains the following extended data files distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0). The structural blueprint and comprehensive cross-sectional matrix mapping of these items are visually cross-referenced in Table 5:
| Views | Downloads | |
|---|---|---|
| F1000Research | - | - |
|
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)