<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.145845.3</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Sentiment analysis of internet posts on vaccination using ChatGPT and comparison with actual vaccination rates in South Korea</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 3; peer review: 1 approved, 2 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Park</surname>
                        <given-names>Sunyoung</given-names>
                    </name>
                    <role content-type="http://credit.niso.org/">Conceptualization</role>
                    <role content-type="http://credit.niso.org/">Data Curation</role>
                    <role content-type="http://credit.niso.org/">Formal Analysis</role>
                    <role content-type="http://credit.niso.org/">Investigation</role>
                    <role content-type="http://credit.niso.org/">Methodology</role>
                    <role content-type="http://credit.niso.org/">Project Administration</role>
                    <role content-type="http://credit.niso.org/">Resources</role>
                    <role content-type="http://credit.niso.org/">Software</role>
                    <role content-type="http://credit.niso.org/">Supervision</role>
                    <role content-type="http://credit.niso.org/">Validation</role>
                    <role content-type="http://credit.niso.org/">Visualization</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Original Draft Preparation</role>
                    <role content-type="http://credit.niso.org/">Writing &#x2013; Review &amp; Editing</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-1973-0073</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Department of Psychiatry, National Health Insurance Service Ilsan Hospital, Goyang-si, Gyeonggi-do, 10444, South Korea</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:bechungan@nhimc.or.kr">bechungan@nhimc.or.kr</email>
                </corresp>
                <fn fn-type="conflict">
                    <p>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>17</day>
                <month>1</month>
                <year>2025</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2024</year>
            </pub-date>
            <volume>13</volume>
            <elocation-id>96</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>10</day>
                    <month>1</month>
                    <year>2025</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Park S</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/13-96/pdf"/>
            <abstract>
                <sec>
                    <title>Background</title>
                    <p>This study used ChatGPT for sentiment analysis to investigate the possible links between online sentiments and COVID-19 vaccination rates. It also examines Internet posts to understand the attitudes and reasons associated with vaccine-related opinions.</p>
                </sec>
                <sec>
                    <title>Methods</title>
                    <p>We collected 500,558 posts over 60 weeks from the Blind platform, mainly used by working individuals, and 854 relevant posts were analyzed. After excluding duplicates and irrelevant content, attitudes toward and reasons for vaccine opinions were studied through sentiment analysis. The study further correlated these categorized attitudes with the actual vaccination data.</p>
                </sec>
                <sec>
                    <title>Results</title>
                    <p>The proportions of posts expressing positive, negative, and neutral attitudes toward COVID-19 vaccines were 5%, 83%, and 12%, respectively. The total post count showed a positive correlation with the vaccination rate, indicating a high correlation between the number of negative posts about the vaccine and the vaccination rate. Negative attitudes were predominantly associated with societal distrust and perceived oppression.</p>
                </sec>
                <sec>
                    <title>Conclusions</title>
                    <p>This study demonstrates the interplay between public perceptions of COVID-19 vaccines as expressed through social media and vaccination behavior. These correlations can serve as useful clues for devising effective vaccination strategies.</p>
                </sec>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>COVID-19</kwd>
                <kwd>vaccination</kwd>
                <kwd>ChatGPT</kwd>
                <kwd>sentiment analysis</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The author(s) declared that no grants were involved in supporting this work.</funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 2</title>
                <p>I considered several ways to fix the data imbalance and conducted additional analyses following the reviewer's comment. I have also added the results and acknowledged this as a limitation in the Discussion section.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec id="sec5" sec-type="intro">
            <title>Introduction</title>
            <p>The exchange of opinions and information on social media platforms has become vital for social interaction and communication. This transformation has created an environment in which information and opinions about societal issues spread rapidly and are shared.
                <sup>
                    <xref ref-type="bibr" rid="ref1">1</xref>
                </sup> In recent years, the global landscape has been significantly affected by the COVID-19 pandemic, leading to an abundance of Internet posts discussing various aspects of this crisis. Among these critical topics, vaccination has emerged as an indispensable tool for overcoming the challenges posed by the pandemic.
                <sup>
                    <xref ref-type="bibr" rid="ref2">2</xref>
                </sup> Despite their importance, the distribution and vaccination rates vary significantly across nations, regions, and occupations,
                <sup>
                    <xref ref-type="bibr" rid="ref3">3</xref>
                </sup> and these differences are closely related to individuals&#x2019; perceptions of and attitudes toward vaccines.
                <sup>
                    <xref ref-type="bibr" rid="ref4">4</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref5">5</xref>
                </sup>
            </p>
            <p>Gathering public opinion on vaccination through Internet posts is anticipated to contribute to providing accurate information and obtaining insights for policy formulation in infectious disease prevention.
                <sup>
                    <xref ref-type="bibr" rid="ref6">6</xref>
                </sup> Sentiment analysis is commonly employed to analyze extensive and diverse online content. It is a form of text classification that focuses on subjective statements and is often called opinion mining. The goal was to analyze opinions to gain insights into public perceptions.
                <sup>
                    <xref ref-type="bibr" rid="ref7">7</xref>
                </sup> However, the complexity and necessity of meticulous processes to enhance accuracy pose challenges for sentiment analysis.
                <sup>
                    <xref ref-type="bibr" rid="ref8">8</xref>
                </sup> Moreover, the language used on social media platforms can vary in style among users and may require understanding within the context. In such cases, existing sentiment-analysis models may find it challenging to adapt to diversity and changes.
                <sup>
                    <xref ref-type="bibr" rid="ref9">9</xref>
                </sup>
            </p>
            <p>Therefore, in this study, we employed ChatGPT, a state-of-the-art language model recognized for its natural language understanding capabilities, to explore the nuanced insights derived from the analysis.
                <sup>
                    <xref ref-type="bibr" rid="ref10">10</xref>
                </sup> ChatGPT excels in contextual understanding, allowing comprehension of the nuanced meanings within the context of a conversation. This contextual awareness is particularly valuable in sentiment analysis, where the interpretation of sentiments often relies on an understanding of the surrounding text. In addition, ChatGPT is adaptable to diverse language nuances, capturing the intricacies of expression across different linguistic styles, cultural variations, and demographic factors.
                <sup>
                    <xref ref-type="bibr" rid="ref11">11</xref>
                </sup> This adaptability can be crucial in sentiment analysis, especially when dealing with social media discourse, where language can be highly dynamic and varied.</p>
            <p>In particular, the sentiment analysis conducted in this study using ChatGPT holds several strengths. Firstly, the objective was to compare the attitudes towards vaccines identified in online content with actual vaccination rates. While analyzing data extracted from internet posts through sentiment analysis contributes to understanding a multitude of opinions, there has been a limitation in verifying how these expressed opinions relate to real-world behavioral outcomes. Analyzing the relationship with societal behavioral outcomes, as reflected in actual vaccination rates, will aid in comprehending the results of sentiment analysis and considering future analytical directions.
                <sup>
                    <xref ref-type="bibr" rid="ref12">12</xref>
                </sup> Secondly, sentiment analysis conducted in Korean, a relatively smaller cultural sphere, has encountered limitations despite various approaches. This study sought to explore the extent to which ChatGPT could contribute to understanding the subtle nuances in internet posts written by diverse individuals in the Korean language.
                <sup>
                    <xref ref-type="bibr" rid="ref13">13</xref>
                </sup>
            </p>
            <p>This study aimed to conduct a sentiment analysis of Internet postings using OpenAI GPT-3.5-turbo model, exploring the potential associations between diverse opinions expressed on social media and actual vaccination rates. The goal was to investigate the causal relationship between sentiments observed through sentiment analysis and real-world behavior. We hypothesized that a higher prevalence of positive vaccine-related posts correlates with elevated vaccination rates. Additionally, the research intended to further analyze the reasons associated with attitudes toward vaccines as expressed in internet posts.</p>
        </sec>
        <sec id="sec1.2">
            <title>Literature review</title>
            <p>Recent studies have provided significant insights by analyzing public sentiment towards COVID-19 vaccines on social media. These studies primarily identify positive or negative reactions to evaluate public perceptions of the vaccine.
                <sup>
                    <xref ref-type="bibr" rid="ref32">14</xref>
                </sup> Sentiment analysis of tweets using natural language processing techniques has been effective in analyzing changing attitudes toward vaccines over time and across different countries. Analyzing 11 million tweets from 180 countries revealed an increasing trend in positive responses to vaccines over time.
                <sup>
                    <xref ref-type="bibr" rid="ref33">15</xref>
                </sup> Furthermore, it was found that providing sufficient information is essential for fostering positive attitudes toward vaccines.
                <sup>
                    <xref ref-type="bibr" rid="ref34">16</xref>
                </sup>
            </p>
            <p>Such data is crucial for formulating public health policies and communication strategies related to vaccines. When examining the correlation between these results and actual vaccination rates, it was reported that higher numbers of positive tweets correlate with higher vaccination rates.
                <sup>
                    <xref ref-type="bibr" rid="ref33">15</xref>
                </sup> Additionally, studies have shown that regions with more positive tweets tend to have higher vaccination rates, especially among populations aged 40 and above.
                <sup>
                    <xref ref-type="bibr" rid="ref36">17</xref>
                </sup> However, distrust and misinformation also contribute to lower vaccination rates.
                <sup>
                    <xref ref-type="bibr" rid="ref37">18</xref>
                </sup> Therefore, providing accurate information and fostering positive public sentiment are essential to increase vaccination rates.</p>
            <p>To achieve reliable results in these studies, effective sentiment analysis is essential. However, sentiment analysis of tweets presents several challenges, such as sarcasm, irony, and language-specific challenges.
                <sup>
                    <xref ref-type="bibr" rid="ref38">19</xref>
                </sup> Current natural language processing technologies like Vader and TextBlob are mostly optimized for English data, making them difficult to use for analyzing minor languages like Korean.
                <sup>
                    <xref ref-type="bibr" rid="ref39">20</xref>
                </sup> Even when new Korean lexicons are developed for research, their accuracy ranges from 40-70%, which leads to inconsistent results.
                <sup>
                    <xref ref-type="bibr" rid="ref40">21</xref>
                </sup> Therefore, advanced AI-based natural language processing tools like ChatGPT, which excel in understanding complex language patterns and nuances, show promise in achieving high accuracy across various languages and dialects.
                <sup>
                    <xref ref-type="bibr" rid="ref41">22</xref>
                </sup> This capability is expected to enhance the reliability of sentiment analysis.</p>
        </sec>
        <sec id="sec6" sec-type="methods">
            <title>Methods</title>
            <p>This observational study analyzed users&#x2019; perceptions of the COVID-19 vaccine on Internet platforms targeting working individuals and their relationship with actual vaccination rates. This study, involving the collection and analysis of publicly available internet posts without containing personal information, received approval for a consent waiver and exemption from review through the National Health Insurance Service Ilsan Hospital Institutional Review Board (IRB) (NHIMC-2023-08-028).</p>
            <sec id="sec7">
                <title>Web crawling and data collection</title>
                <p>On the Internet, there are cases in which individuals intentionally post multiple messages to emphasize their claims or engage in actions with specific intentions, such as marketing.
                    <sup>
                        <xref ref-type="bibr" rid="ref14">23</xref>
                    </sup> To counteract this form of online manipulation, we utilized web crawling on the social network service (SNS) &#x2018;Blind (
                    <ext-link ext-link-type="uri" xlink:href="https://www.teamblind.com/kr/">https://www.teamblind.com/kr/</ext-link>),&#x2019; where individuals involved in employment, job-seeking, and workplace organizations actively participate and interact. Users join this SNS by using their individual email accounts associated with their respective workplaces, anonymously. The posts were web-scraped using the Python Selenium package, adhering to the Blind&#x2019;s Access Restriction Protocol (robots. txt). The data collection period spanned between March 23, 2022, and May 16, 2023, totaling 60 weeks, and a total of 500,558 posts were gathered. Information such as post number, posting date, publicly available workplace information of the post author, post title, and content was collected through web scraping. To ensure the integrity of the dataset, we implemented a robust method for excluding duplicate posts. Each post was uniquely identified based on a combination of attributes, including post number, posting date, and author details. Posts sharing identical attributes were flagged as duplicates, and only the earliest occurrence was retained for analysis. This process aimed to eliminate redundancy and maintain the diversity of opinions within the dataset. After then, posts were filtered based on the presence of keywords associated with COVID-19, such as &#x201c;COVID,&#x201d; &#x201c;coronavirus,&#x201d; and vaccine-specific terms.scraping. To ensure the integrity of the dataset, we implemented a robust method for excluding duplicate posts. Each post was uniquely identified based on a combination of attributes, including post number, posting date, and author details. After then, posts were filtered on whether they contained the following keywords. Keywords related to COVID-19 were checked, such as COVID-19, COVID, and coronavirus, as well as keywords related to vaccination, for example, vaccination, vaccine, and inoculation. Python&#x2019;s Beautiful Soup libarary and Request module were used for this purpose. Keyword searches were conducted on the title and content of each post.</p>
            </sec>
            <sec id="sec8">
                <title>Data refinement and pre-processing
</title>
                <p>After excluding duplicate posts and those less relevant to COVID-19, 4,419 posts were curated, with 854 posts specifically mentioning the vaccines chosen for the analysis. The posts underwent text preprocessing, involving UTF-8 encoding, stop word and URL removal, as well as the removal of emojis and special characters.</p>
            </sec>
            <sec id="sec9">
                <title>Sentiment analysis and reasoning extraction using ChatGPT</title>
                <p>Subsequently, the posts were analyzed using the OpenAI GPT-3.5-turbo model. This model, pretrained on a large corpus of language data by OpenAI, classified the attitude expressed in each post toward the vaccine as positive, negative, or neutral, following the criteria commonly used in the sentiment analysis of conditional statements, evaluating whether the sentiment toward a specific topic is positive, negative, or neutral.
                    <sup>
                        <xref ref-type="bibr" rid="ref7">7</xref>
                    </sup> In cases where the model could not confidently assign an attitude, we instructed it to respond with &#x2018;unclear&#x2019;.</p>
                <p>Using ChatGPT, we extracted the reasons for positive or negative attitudes toward vaccines from Internet posts. The system&#x2019;s role was an AI assistant tasked with identifying the reasons behind the positive or negative views on COVID-19 vaccines in the given posts. The goal was to extract up to three reasons per post. Cases in which the reasons were unknown or difficult to classify were noted accordingly.</p>
            </sec>
            <sec id="sec10">
                <title>Statistical analysis</title>
                <p>COVID-19 vaccination information was collected from the Korea Disease Control and Prevention Agency daily vaccination status (
                    <ext-link ext-link-type="uri" xlink:href="https://ncv.kdca.go.kr/vaccineStatus.es?mid=a11710000000">https://ncv.kdca.go.kr/vaccineStatus.es?mid=a11710000000</ext-link>). The counts for the first and second doses were collected over the 60-week research period, including additional winter booster vaccination counts for 31 weeks starting on October 11, 2022.</p>
                <p>Considering the working patterns of employees and the lower frequency of vaccination on weekends, a correlation analysis was performed on the sum of the posts and weekly vaccination counts. Furthermore, an association analysis was conducted to explore relationships among the reasons mentioned in the posts, specifically focusing on understanding the relationships between key reasons behind positive and negative attitudes. All analyses were performed using R version 4.2.2.</p>
            </sec>
        </sec>
        <sec id="sec11" sec-type="results">
            <title>Results</title>
            <p>The analysis used OpenAI&#x2019;s gpt-3.5-turbo model, which yielded 851 emotional evaluations. In three cases, the model reported uncertainty; upon manual review, these instances were deemed uncertain and excluded from the analysis. Two psychiatrists evaluated 100 posts, resulting in 85% agreement with the results provided by the gpt-3.5-turbo model. The results of the confusion matrix are presented in the Extended data.</p>
            <p>Over the 60-week analysis period, the sentiment distribution was as follows: 44 positive (5%), 704 negative (83%), and 103 neutral (12%) posts. For posts collected over 31 weeks starting from October 11, 2022, and associated with additional vaccinations during the winter season, the sentiment distribution was 20 positive (6%), 254 negative (80%), and 43 neutral (14%) posts. The total vaccination counts, weekly vaccination averages, and weekly post-vaccination averages for each study period are presented in 
                <xref ref-type="table" rid="T1">
Table 1</xref>.</p>
            <table-wrap id="T1" orientation="portrait" position="float">
                <label>
Table 1. </label>
                <caption>
                    <title>The mean of posts by sentiment and vaccination types.</title>
                </caption>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="8" rowspan="1" valign="top">A. Whole study period (for 60 weeks)</th>
                        </tr>
                        <tr>
                            <th align="left" colspan="1" rowspan="2" valign="top"/>
                            <th align="left" colspan="4" rowspan="1" valign="top">Posts</th>
                            <th align="left" colspan="3" rowspan="1" valign="top">Vaccination</th>
                        </tr>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Positive 
(N=44)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Negative 
(N=704)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Neutral 
(N=103)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Total 
(N=851)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">1
                                <sup>st</sup> Dose 
(N=179053)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">2
                                <sup>nd</sup> Dose 
(N=224735)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">
Total 
(N=403788)</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Mean &#x00b1; SD (per week)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.73 &#x00b1; 1.01</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">11.73 &#x00b1; 8.31</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1.72 &#x00b1; 1.74</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">14.18 &#x00b1; 9.67</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">2984.22 &#x00b1; 4520.73</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">3745.58 &#x00b1; 4038.60</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">6729.80 &#x00b1; 8053.79</td>
                        </tr>
                    </tbody>
                </table>
                <table content-type="article-table" frame="hsides">
                    <thead>
                        <tr>
                            <th align="left" colspan="6" rowspan="1" valign="top">B. Winter booster vaccination period (for 31 weeks)</th>
                        </tr>
                        <tr>
                            <th align="left" colspan="1" rowspan="2" valign="top"/>
                            <th align="left" colspan="4" rowspan="1" valign="top">Postings</th>
                            <th align="left" colspan="1" rowspan="2" valign="top">
Winter booster vaccination 
(N=6682925)</th>
                        </tr>
                        <tr>
                            <th align="left" colspan="1" rowspan="1" valign="top">Positive 
(N=20)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Negative 
(N=254)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">Neutral 
(N=43)</th>
                            <th align="left" colspan="1" rowspan="1" valign="top">
Total 
(N=317)</th>
                        </tr>
                    </thead>
                    <tbody>
                        <tr>
                            <td align="left" colspan="1" rowspan="1" valign="top">Mean &#x00b1; SD (per week)</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">0.65 &#x00b1; 0.97</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">8.19 &#x00b1; 6.28</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">1.39 &#x00b1; 1.45</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">10.23 &#x00b1; 7.32</td>
                            <td align="left" colspan="1" rowspan="1" valign="top">228841.41 &#x00b1; 205373.47</td>
                        </tr>
                    </tbody>
                </table>
                <table-wrap-foot>
                    <p>SD, Standard Deviation.</p>
                </table-wrap-foot>
            </table-wrap>
            <sec id="sec12">
                <title>Relationship between attitudes toward COVID-19 vaccination in internet postings and actual vaccination rates</title>
                <p>Positive posts regarding vaccination attitudes showed a weak positive correlation with the first-dose vaccination counts. Negative posts on vaccines exhibited a moderately positive correlation with both first- and second-dose vaccinations. Posts expressing a neutral attitude showed a weak positive correlation with second-dose vaccinations (
                    <xref ref-type="table" rid="T2">
Table 2</xref>, 
                    <xref ref-type="fig" rid="f1">
Figure 1</xref>).</p>
                <table-wrap id="T2" orientation="portrait" position="float">
                    <label>
Table 2. </label>
                    <caption>
                        <title>Correlation between number of vaccinations and postings by attitude toward vaccines.</title>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="left" colspan="1" rowspan="1" valign="top"/>
                                <th align="left" colspan="1" rowspan="1" valign="top">1
                                    <sup>st</sup> Dose</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">2
                                    <sup>nd</sup> Dose</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">Total</th>
                                <th align="left" colspan="1" rowspan="1" valign="top">
Booster vaccination</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Positive attitude</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.27
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn2">*</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.22</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.26
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn2">*</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.50
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn1">**</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Negative attitude</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.59
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn1">**</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.66
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn1">**</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.66
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn1">**</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.69
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn1">**</xref>
                                    </sup>
                                </td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Neutral attitude</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.24</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.31
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn2">*</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.29
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn2">*</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.18</td>
                            </tr>
                            <tr>
                                <td align="left" colspan="1" rowspan="1" valign="top">Total number of postings</td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.58
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn1">**</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.65
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn1">**</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.65
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn1">**</xref>
                                    </sup>
                                </td>
                                <td align="left" colspan="1" rowspan="1" valign="top">0.69
                                    <sup>
                                        <xref ref-type="table-fn" rid="tfn1">**</xref>
                                    </sup>
                                </td>
                            </tr>
                        </tbody>
                    </table>
                    <table-wrap-foot>
                        <fn-group content-type="footnotes">
                            <fn id="tfn1">
                                <label>**</label>
                                <p>The correlation is significant at the 0.01 level (2-tailed).</p>
                            </fn>
                            <fn id="tfn2">
                                <label>*</label>
                                <p>The correlation is significant at the 0.05 level (2-tailed).</p>
                            </fn>
                        </fn-group>
                    </table-wrap-foot>
                </table-wrap>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>
Figure 1. </label>
                    <caption>
                        <title>Comparison between the number of posts about vaccination (left axis) and actual vaccination (right axis) for 60 weeks.</title>
                    </caption>
                    <graphic id="gr1" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/176895/41a6b79a-b57e-481d-999b-ca225b9f1161_figure1.gif"/>
                </fig>
                <p>Regarding the booster vaccination count during the winter season, a strong correlation was observed between positive and negative posts regarding vaccination attitudes.</p>
                <p>The total counts of positive, negative, and neutral posts showed strong correlations with the counts of the first, second, and booster vaccinations. 
                    <xref ref-type="table" rid="T2">
Table 2</xref> presents the correlation results.</p>
            </sec>
            <sec id="sec13">
                <title>Extraction of attitudinal reasons toward vaccination and association analysis between reasons</title>
                <p>Using ChatGPT, we extracted the reasons for positive or negative attitudes toward vaccines from Internet posts. Among the 44 positive posts, the most prevalent (73%) was prevention, cited in 39 instances. For the 704 negative posts, the most common reason (28%) was distrust of the social system, which was mentioned in 194 cases. Two psychiatrists manually reviewed 100 posts and compared the outcomes in the extracted results. For the first reason per post, there were 58 instances of agreement, 37 for the second reason, and 14 for the third reason.</p>
                <p>An association analysis was also conducted for the reasons mentioned in the postings. However, this aspect was excluded from further study because of the low agreement rate (14%) for the third reason per post during the manual review. Cases in which reasons showed a high frequency of repetition were classified into item categories. The positive reasons for vaccination include prevention, symptom alleviation, reduced mortality rate, effectiveness, safety, fewer side effects, containment of the spread of infection, and immune system reinforcement. Negative reasons for vaccination included six items: mistrust of the social system, antipathy toward social oppression, side effects, concerns about side effects, lack of information, and perception of insufficient efficacy. If the first and second reasons for a post fell under the aforementioned classification, they were included in the association analysis. In the study of positive posts, 36 groups were used to analyze positive posts, and 381 groups were analyzed.</p>
                <p>In the analysis of positive reasons, &#x201c;Decreasing mortality rate&#x201d; is associated with &#x201c;Symptom alleviation,&#x201d; showcasing a support of 5.71%, confidence of 50.00%, and a lift of 2.92. Additionally, instances of &#x201c;Post-recovery symptom alleviation&#x201d; (support = 5.71%, confidence = 100.00%, lift = 1.21%), &#x201c;Immune activity strengthening&#x201d; (support = 17.14%, confidence = 100.00%, lift = 1.21%), &#x201c;Inhibition of infection spread&#x201d; (support = 31.43%, confidence = 100.00%, lift = 1.21%), and &#x201c;Safety&#x201d; (support = 14.29%, confidence = 83.33%, lift = 1.01%) are correlated with the item of &#x201c;Prevention.&#x201d;</p>
                <p>In the analysis of negative reasons, &#x2018;Antipathy to social oppression&#x2019; strongly correlates with &#x2018;Mistrust&#x2019; (support = 21.78%, confidence = 94.32%, lift = 2.16). Conversely, &#x2018;Mistrust&#x2019; is moderately associated with &#x2018;Antipathy to social oppression&#x2019; (support = 21.78%, confidence = 50.00%, lift = 2.16). Instances involving concerns about underlying conditions were linked to side effects (support = 29.13%, confidence = 91.74%, lift = 1.26). Conversely, side effects were associated with concerns about the underlying conditions (support = 29.13%, confidence = 40.07%, lift = 1.26). The presence of &#x201c;lack of information&#x201d; was strongly associated with&#x201d;unknown side effects&#x201d; (support = 23.62%, confidence = 90.91%, lift = 1.25). The overall trends in the association analyses are shown in 
                    <xref ref-type="fig" rid="f2">
Figure 2</xref>.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>
Figure 2. </label>
                    <caption>
                        <title>Association rules visualization for vaccine sentiments: positive (A) and negative (B) reasons.</title>
                    </caption>
                    <graphic id="gr2" orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/176895/41a6b79a-b57e-481d-999b-ca225b9f1161_figure2.gif"/>
                </fig>
            </sec>
        </sec>
        <sec id="sec14" sec-type="discussion">
            <title>Discussion</title>
            <p>This study explored the relationship between professionals&#x2019; attitudes toward the COVID-19 vaccine as expressed in internet posts and actual vaccination rates. Additionally, it examined the positive and negative reasons associated with vaccine attitudes and investigated the relationships between them.</p>
            <p>The sentiment analysis conducted using OpenAI&#x2019;s gpt-3.5-turbo model demonstrated an 85% concordance rate when compared with evaluations by mental health specialists. This result indicates a substantial level of accuracy of the model and affirms the utility of automated analysis using natural language processing technology. Recent studies using deep-learning-based analysis models have reported sentiment analysis accuracies ranging from approximately 70% to 90%. These studies often evaluate sentiments in posts related to COVID-19 on major social media platforms, such as Twitter.
                <sup>
                    <xref ref-type="bibr" rid="ref15">24</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref16">25</xref>
                </sup> In particular, the accuracy for positive posts was significantly decreased in this study. Data imbalances in sentiment analysis could be a cause of this, leading to models overfitting. This results in proficient detection of negative sentiments&#x2014;due to their prevalence in training datasets&#x2014;and reduced accuracy in identifying positive sentiments.
                <sup>
                    <xref ref-type="bibr" rid="ref42">26</xref>
                </sup> The reasoning analysis in this study, which aimed at extracting the reasons behind positive or negative sentiments toward vaccines, showed a lower matching rate (58%). Compared to sentiment analysis, which gauges the overall sentiment, reasoning analysis delves into specific words or phrases that explain why a sentiment is expressed. This process involves a deeper understanding of language nuances and contextual cues, making it a more intricate task.
                <sup>
                    <xref ref-type="bibr" rid="ref17">27</xref>
                </sup> Furthermore, ChatGPT tends to include irrelevant content when producing results for short posts, making it challenging to infer the reasons. This behavior is likely attributable to the generative nature of ChatGPT, a characteristic inherent in creative AI models.
                <sup>
                    <xref ref-type="bibr" rid="ref11">11</xref>
                </sup>
            </p>
            <p>Previous survey studies have shown that negative opinions towards vaccines accounted for 39.8%,
                <sup>
                    <xref ref-type="bibr" rid="ref21">28</xref>
                </sup> whereas in the present study, negative opinions are markedly higher at 82%. These results are associated with certain social phenomena in which negative or extreme online content is prevalent. Cyber venting
                <sup>
                    <xref ref-type="bibr" rid="ref22">29</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref23">30</xref>
                </sup> is a phenomenon in which internet users express dissatisfaction, stress, or anger online, making it easy to express various negative emotions related to vaccination, discomfort about side effects, and social pressure. Moreover, the freedom to swiftly respond and exchange opinions through comments in anonymous online spaces can result in uninhibited emotional expressions.
                <sup>
                    <xref ref-type="bibr" rid="ref24">31</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref25">32</xref>
                </sup> This phenomenon was evident in the posts analyzed in this study, in which various expressions of anxiety and discomfort regarding vaccine side effects, derogatory remarks about compliant attitudes toward vaccination, and rumors related to vaccines were observed.</p>
            <p>In this study, the correlation analysis between the number of posts and vaccine doses administered indicated a strong correlation between the total number of positive, negative, and neutral posts and the counts of the first, second, and additional vaccine doses. Numerous studies have explored the relevance of social media platforms to epidemiological patterns and medical information.
                <sup>
                    <xref ref-type="bibr" rid="ref18">33</xref>
                </sup> For instance, even before the COVID-19 pandemic, research suggested a strong correlation between the regional distribution of social media posts related to infectious diseases and their actual spread.
                <sup>
                    <xref ref-type="bibr" rid="ref19">34</xref>
                </sup> Additionally, studies on trend analysis using search queries have shown associations with the spread of infectious diseases.
                <sup>
                    <xref ref-type="bibr" rid="ref20">35</xref>
                </sup> In this study, the diverse opinions expressed online before and after the vaccination could be considered a reflection of the various perspectives emerging on online platforms. The strong correlation between these negative opinions and actual vaccination rates can be inferred from the content of internet posts. In Korea, the total vaccination rate reported by the Korea Disease Control and Prevention Agency was very high at 96.9%. However, the content of internet posts revealed instances where individuals were vaccinated due to social discomfort or job requirements, as well as due to social pressure. Additionally, posts expressing social discomfort due to refusal to vaccinate were also found.</p>
            <p>In this study, implications were derived by analyzing the underlying factors influencing attitudes toward vaccines. For positive attitudes toward vaccines, individuals expected preventive effects, personal relief from symptoms and after-effects, and societal benefits, such as preventing the spread of infection. In contrast, negative attitudes toward vaccines were associated with resentment and distrust toward social oppression. This aligns with discussions in numerous studies during the pandemic, indicating that public distrust of efforts to prevent epidemics at the societal level can lead to strong resistance.
                <sup>
                    <xref ref-type="bibr" rid="ref26">36</xref>
                </sup>
                <sup>&#x2013;</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref28">38</xref>
                </sup> Information gaps in various media channels regarding vaccines can reinforce public anxiety and contribute to negative attitudes.
                <sup>
                    <xref ref-type="bibr" rid="ref29">39</xref>
                </sup> Therefore, it is crucial to implement effective communication strategies and educational initiatives to address these concerns and promote a more informed perspective on vaccination.
                <sup>
                    <xref ref-type="bibr" rid="ref30">40</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref31">41</xref>
                </sup>
            </p>
            <p>The limitations of this study include its focus on SNS users among employed individuals, warranting future research that encompasses diverse demographic groups and regions for a more comprehensive understanding. Furthermore, research incorporating various factors is essential to investigate the relationship between opinion formation on Internet platforms and vaccination behavior. Additionally, limited number of posts poses significant constraints on this study, compared to other studies.
                <sup>
                    <xref ref-type="bibr" rid="ref6">6</xref>
                </sup>
                <sup>,</sup>
                <sup>
                    <xref ref-type="bibr" rid="ref19">34</xref>
                </sup> This limitation may be attributed to the fact that the BLIND community, which was the focus of the analysis, primarily targets professionals, resulting in a predominance of posts related to salary, career, and other job-related topics. The period of the study, which occurred about one year after the initial vaccine rollout in February 2022, coincided with a significant decrease in public interest in vaccines, which also likely contributed to the fewer posts related to vaccines.</p>
            <p>Moreover, the analysis of sentiment analysis models, including the gpt-3.5-turbo model, reveals additional limitations. These models may not fully capture the subtle nature of emotions, and the interpretation of emotional expressions can vary among individuals.
                <sup>
                    <xref ref-type="bibr" rid="ref11">11</xref>
                </sup> Therefore, involving a diverse range of reviewers across different age groups and backgrounds for manual review is crucial. Considering the gpt-3.5-turbo model, there is a limitation in fully understanding subtle nuances and cultural contexts. This is particularly evident in contexts like Korea, characterized by unique language styles, cultural norms, and demographic characteristics.
                <sup>
                    <xref ref-type="bibr" rid="ref13">13</xref>
                </sup>
            </p>
            <p>Building on these limitations, this study also encountered challenges in addressing data imbalance during sentiment analysis. Traditional undersampling methods were unsuitable due to the limited dataset size, as reducing the number of negative postings could compromise the reliability of correlation analysis. Similarly, oversampling techniques such as SMOTE, which generate synthetic data in vector form, were impractical for sentiment analysis using language models like ChatGPT. To address these issues, data augmentation was performed using ChatGPT
                <sup>
                    <xref ref-type="bibr" rid="ref45">42</xref>
                </sup> with controlled prompts to diversify positive and neutral data while maintaining contextual consistency.</p>
            <p>Despite these efforts, the sentiment analysis results did not improve after augmentation. Positive and neutral data proportions were increased to 25-30% of the total dataset, yet the accuracy declined, with neutral data frequently misclassified as negative. This decline likely stemmed from the nondeterministic nature of ChatGPT,
                <sup>
                    <xref ref-type="bibr" rid="ref46">43</xref>
                </sup> leading to inconsistencies in the augmented data, and potential quality degradation caused by inaccuracies in the initial analysis, particularly in ambiguous or sarcastic cases. These findings further underscore the complexities and limitations of applying language models like ChatGPT to sentiment analysis, particularly in culturally nuanced contexts.</p>
            <p>In conclusion, this study revealed the interaction between the public&#x2019;s perception of the COVID-19 vaccine expressed on social media and their actual vaccination behavior. The perception of risk and willingness to be vaccinated can be influenced by various mass media sources, such as the news. Opinions encountered on SNS, which people use, are also likely to significantly impact individuals&#x2019; perceptions of vaccines due to biased approaches to information and the phenomenon of conformity. Therefore, implementing social strategies that provide appropriate vaccine information in an accessible manner is crucial.</p>
            <sec id="sec15">
                <title>Ethical considerations</title>
                <p>This study was exempted from review by the National Health Insurance Ilsan Hospital Institutional Review Board (NHIMC-2023-08-028) 04/09/2023.</p>
            </sec>
        </sec>
    </body>
    <back>
        <sec id="sec16" sec-type="data-availability">
            <title>Data availability</title>
            <sec id="sec17">
                <title>Vaccination data</title>
                <p>Zenodo: Korea Disease Control and Prevention Agency daily COVID19 vaccination status, 
                    <ext-link ext-link-type="uri" xlink:href="https://zenodo.org/doi/10.5281/zenodo.10252895">https://zenodo.org/doi/10.5281/zenodo.10252895</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref43">44</xref>
                    </sup>
                </p>
                <p>Vaccination data is accessible in the form of an Excel file. This file comprehensively includes vaccination rate information relevant to the research findings. COVID-19 vaccination information was collected from the Korea Disease Control and Prevention Agency daily vaccination status (
                    <ext-link ext-link-type="uri" xlink:href="https://ncv.kdca.go.kr/vaccineStatus.es?mid=a11710000000">https://ncv.kdca.go.kr/vaccineStatus.es?mid=a11710000000</ext-link>). For additional details or specific requests regarding data provision, feel free to contact us.</p>
            </sec>
            <sec id="sec18">
                <title>SNS crawling data</title>
                <p>The data used for SNS crawling in this study, acquired through social media crawling, cannot be shared due to ethical and copyright restrictions related to social media content. A comprehensive description of the methodology is presented in the Methods section, along with the Python code below, facilitating the replication of the study. For inquiries concerning the methodology, please direct any questions to the corresponding author.</p>
            </sec>
            <sec id="sec19">
                <title>Python code for data scraping and sentiment analysis</title>
                <p>

                    <ext-link ext-link-type="uri" xlink:href="https://github.com/bechungan/Scraping-and-Sentiment-Analysis-using-CGPT">https://github.com/bechungan/Scraping-and-Sentiment-Analysis-using-CGPT
</ext-link>.</p>
            </sec>
            <sec id="sec20">
                <title>Reporting guidelines</title>
                <p>Zenodo: STROBE checklist for Sentiment analysis of internet posts on vaccination using ChatGPT and comparison with actual vaccination rates in South Korea, 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.10429910">https://doi.org/10.5281/zenodo.10429910</ext-link>.</p>
            </sec>
            <sec id="sec1.3">
                <title>Extended data</title>
                <p>Zenodo: The result of the confusion matrix for the accuracy evaluation of 100 posts. This project contains the data: confusion matrix.docx. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.13381133">https://doi.org/10.5281/zenodo.13381133</ext-link>.
                    <sup>
                        <xref ref-type="bibr" rid="ref44">45</xref>
                    </sup>
                </p>
            </sec>
        </sec>
        <ack>
            <title>Acknowledgments</title>
            <p>We extend our sincere appreciation to G-J Park at Cheongdam SL Clinic for invaluable guidance and assistance in data scraping and analysis. Additionally, we express our gratitude to Dr. J. Ahn, a psychiatrist, at Paju Psychiatric Clinic for providing valuable assistance in reviewing the data. Both individuals mentioned above have consented to being mentioned in the acknowledgment section.</p>
        </ack>
        <ref-list>
            <title>References</title>
            <ref id="ref1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Choi</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>The roles of media capabilities of smartphone-based SNS in developing social capital.</article-title>
                    <source>

                        <italic toggle="yes">Behav. Inform. Technol.</italic>
</source>
                    <year>2019</year>;<volume>38</volume>(<issue>6</issue>):<fpage>609</fpage>&#x2013;<lpage>620</lpage>.
                    <pub-id pub-id-type="doi">10.1080/0144929X.2018.1546903</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Razai</surname>
                            <given-names>MS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chaudhry</surname>
                            <given-names>UAR</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Doerholt</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Covid-19 vaccination hesitancy.</article-title>
                    <source>

                        <italic toggle="yes">BMJ.</italic>
</source>
                    <year>2021</year>;<volume>373</volume>.
                    <pub-id pub-id-type="doi">10.1136/bmj.n1138</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Noushad</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rastam</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Nassani</surname>
                            <given-names>MZ</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>A global survey of COVID-19 vaccine acceptance among healthcare workers.</article-title>
                    <source>

                        <italic toggle="yes">Front. Public Health.</italic>
</source>
                    <year>2021</year>;<volume>9</volume>:<fpage>794673</fpage>.</mixed-citation>
            </ref>
            <ref id="ref4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Adane</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ademas</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kloos</surname>
                            <given-names>H</given-names>
                        </name>
</person-group>:
                    <article-title>Knowledge, attitudes, and perceptions of COVID-19 vaccine and refusal to receive COVID-19 vaccine among healthcare workers in northeastern Ethiopia.</article-title>
                    <source>

                        <italic toggle="yes">BMC Public Health.</italic>
</source>
                    <year>2022</year>;<volume>22</volume>(<issue>1</issue>):<fpage>128</fpage>.
                    <pub-id pub-id-type="pmid">35042476</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s12889-021-12362-8</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8765812</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jing</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fang</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>H</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>The role of general attitudes and perceptions towards vaccination on the newly-developed vaccine: Results from a survey on COVID-19 vaccine acceptance in China.</article-title>
                    <source>

                        <italic toggle="yes">Front. Psychol.</italic>
</source>
                    <year>2022</year>;<volume>13</volume>:<fpage>841189</fpage>.
                    <pub-id pub-id-type="pmid">35712143</pub-id>
                    <pub-id pub-id-type="doi">10.3389/fpsyg.2022.841189</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9194573</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Griffith</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Marani</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Monkman</surname>
                            <given-names>H</given-names>
                        </name>
</person-group>:
                    <article-title>COVID-19 vaccine hesitancy in Canada: Content analysis of tweets using the theoretical domains framework.</article-title>
                    <source>

                        <italic toggle="yes">J. Med. Internet Res.</italic>
</source>
                    <year>2021</year>;<volume>23</volume>(<issue>4</issue>):<fpage>e26874</fpage>.
                    <pub-id pub-id-type="pmid">33769946</pub-id>
                    <pub-id pub-id-type="doi">10.2196/26874</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8045776</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Taboada</surname>
                            <given-names>M</given-names>
                        </name>
</person-group>:
                    <article-title>Sentiment Analysis: An Overview from Linguistics.</article-title>
                    <source>

                        <italic toggle="yes">Annu. Rev. Linguist.</italic>
</source>
                    <year>2016</year>;<volume>2</volume>:<fpage>325</fpage>&#x2013;<lpage>347</lpage>.
                    <pub-id pub-id-type="doi">10.1146/annurev-linguistics-011415-040518</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref8">
                <label>8</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kenyon-Dean</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ahmed</surname>
                            <given-names>E</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fujimoto</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>, editors.
                    <article-title>Sentiment analysis: It&#x2019;s complicated!</article-title>
                    <source>

                        <italic toggle="yes">Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers).</italic>
</source>
                    <year>2018</year>.</mixed-citation>
            </ref>
            <ref id="ref9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Birjali</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kasri</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Beni-Hssane</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>A comprehensive survey on sentiment analysis: Approaches, challenges and trends.</article-title>
                    <source>

                        <italic toggle="yes">Knowl. Based Syst.</italic>
</source>
                    <year>2021</year>;<volume>226</volume>:<fpage>107134</fpage>.
                    <pub-id pub-id-type="doi">10.1016/j.knosys.2021.107134</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Orr&#x00f9;</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Piarulli</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Conversano</surname>
                            <given-names>C</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Human-like problem-solving abilities in large language models using ChatGPT.</article-title>
                    <source>

                        <italic toggle="yes">Front. Artif. Intell.</italic>
</source>
                    <year>2023</year>;<volume>6</volume>:<fpage>1199350</fpage>.
                    <pub-id pub-id-type="pmid">37293238</pub-id>
                    <pub-id pub-id-type="doi">10.3389/frai.2023.1199350</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10244637</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kalla</surname>
                            <given-names>D</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Smith</surname>
                            <given-names>N</given-names>
                        </name>
</person-group>:
                    <article-title>Study and Analysis of Chat GPT and its Impact on Different Fields of Study.</article-title>
                    <source>

                        <italic toggle="yes">Int. J. Innov. Sci. Res. Technol.</italic>
</source>
                    <year>2023</year>;<volume>8</volume>(<issue>3</issue>).
                    <pub-id pub-id-type="doi">10.1007/978-3-031-43803-5</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wankhade</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rao</surname>
                            <given-names>ACS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kulkarni</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>A survey on sentiment analysis methods, applications, and challenges.</article-title>
                    <source>

                        <italic toggle="yes">Artif. Intell. Rev.</italic>
</source>
                    <year>2022</year>;<volume>55</volume>(<issue>7</issue>):<fpage>5731</fpage>&#x2013;<lpage>5780</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10462-022-10144-1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lee</surname>
                            <given-names>G-m</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Song</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Can Korean Language Models Detect Social Registers in Utterances?</article-title>
                    <source>

                        <italic toggle="yes">Korean J. Appl. Linguist.</italic>
</source>
                    <year>2023</year>;<volume>48</volume>(<issue>2</issue>):<fpage>585</fpage>&#x2013;<lpage>605</lpage>.</mixed-citation>
            </ref>
            <ref id="ref32">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>&#x00c7;&#x0131;lg&#x0131;n</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>G&#x00f6;k&#x00e7;en</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>G&#x00f6;k&#x015f;en</surname>
                            <given-names>Y</given-names>
                        </name>
</person-group>:
                    <article-title>Sentiment analysis of public sensitivity to COVID-19 vaccines on Twitter by majority voting classifier-based machine learning Twitter&#x2019;da COVID-19 a&#x015f;&#x0131;lar&#x0131;na kar&#x015f;&#x0131; kamu duyarl&#x0131;l&#x0131;&#x011f;&#x0131;n&#x0131;n &#x00e7;o&#x011f;unluk oylama s&#x0131;n&#x0131;fland&#x0131;r&#x0131;c&#x0131;s&#x0131; temelli makine &#x00f6;&#x011f;renmesi ile duygu analizi.</article-title>
                    <source>

                        <italic toggle="yes">J. Fac. Eng. Archit. Gazi Univ.</italic>
</source>
                    <year>2023</year>;<volume>38</volume>(<issue>2</issue>):<fpage>1093</fpage>&#x2013;<lpage>1104</lpage>.
                    <pub-id pub-id-type="doi">10.17341/gazimmfd.1030198</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref33">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hutch</surname>
                            <given-names>MR</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Patterns of diverse and changing sentiments towards COVID-19 vaccines: a sentiment analysis study integrating 11 million tweets and surveillance data across over 180 countries.</article-title>
                    <source>

                        <italic toggle="yes">J. Am. Med. Inform. Assoc.</italic>
</source>
                    <year>2023</year>;<volume>30</volume>(<issue>5</issue>):<fpage>923</fpage>&#x2013;<lpage>931</lpage>.
                    <pub-id pub-id-type="pmid">36821435</pub-id>
                    <pub-id pub-id-type="doi">10.1093/jamia/ocad029</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10114113</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref34">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Greyling</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rossouw</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Positive attitudes towards COVID-19 vaccines: A cross-country analysis.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2022</year>;<volume>17</volume>(<issue>3</issue>): e0264994.
                    <pub-id pub-id-type="pmid">35271637</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0264994</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8912241</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref36">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Han</surname>
                            <given-names>B</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>Y</given-names>
                        </name>
</person-group>:
                    <article-title>Exploring public sentiment and vaccination uptake of COVID-19 vaccines in England: a spatiotemporal and sociodemographic analysis of Twitter data.</article-title>
                    <source>Front. Public Health.</source>
                    <year>2023</year>;<volume>11</volume>:<fpage>1193750</fpage>.
                    <pub-id pub-id-type="pmid">37663835</pub-id>
                    <pub-id pub-id-type="doi">10.3389/fpubh.2023.1193750</pub-id>
                    <pub-id pub-id-type="pmcid">PMC10470640</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref37">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Osuji</surname>
                            <given-names>VC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Galante</surname>
                            <given-names>EM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mischoulon</surname>
                            <given-names>D</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>COVID-19 vaccine: A 2021 analysis of perceptions on vaccine safety and promise in a US sample.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2022</year>;<volume>17</volume>(<issue>5</issue>): e0268784.
                    <pub-id pub-id-type="pmid">35587947</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0268784</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9119541</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref38">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wankhade</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rao</surname>
                            <given-names>ACS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kulkarni</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>A survey on sentiment analysis methods, applications, and challenges.</article-title>
                    <source>

                        <italic toggle="yes">Artif. Intell. Rev.</italic>
</source>
                    <year>2022</year>;<volume>55</volume>(<issue>7</issue>):<fpage>5731</fpage>&#x2013;<lpage>5780</lpage>.
                    <pub-id pub-id-type="doi">10.1007/s10462-022-10144-1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref39">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kim</surname>
                            <given-names>Y</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kang</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jeong</surname>
                            <given-names>SR</given-names>
                        </name>
</person-group>:
                    <article-title>Text mining and sentiment analysis for predicting box office success.</article-title>
                    <source>

                        <italic toggle="yes">KSII T Internet Info.</italic>
</source>
                    <year>2018</year>;<volume>12</volume>(<issue>8</issue>):<fpage>4090</fpage>&#x2013;<lpage>4102</lpage>.</mixed-citation>
            </ref>
            <ref id="ref40">
                <label>21</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jang</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Shin</surname>
                            <given-names>H</given-names>
                        </name>
</person-group>:
                    <article-title>Effective use of linguistic features for sentiment analysis of korean. Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation.</article-title>Waseda University.<year>2011</year>.</mixed-citation>
            </ref>
            <ref id="ref41">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sudirjo</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Diantoro</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Al-Gasawneh</surname>
                            <given-names>JA</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Application of ChatGPT in Improving Customer Sentiment Analysis for Businesses.</article-title>
                    <source>

                        <italic toggle="yes">JTEKSIS.</italic>
</source>
                    <year>2023</year>;<volume>5</volume>(<issue>3</issue>):<fpage>283</fpage>&#x2013;<lpage>288</lpage>.
                    <pub-id pub-id-type="doi">10.47233/jteksis.v5i3.871</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref14">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lee</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Detection of political manipulation in online communities through measures of effort and collaboration.</article-title>
                    <source>

                        <italic toggle="yes">ACM Trans. Web.</italic>
</source>
                    <year>2015</year>;<volume>9</volume>(<issue>3</issue>):<fpage>1</fpage>&#x2013;<lpage>24</lpage>.
                    <pub-id pub-id-type="doi">10.1145/2767134</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref15">
                <label>24</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chakraborty</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bhatia</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bhattacharyya</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Sentiment Analysis of COVID-19 tweets by Deep Learning Classifiers&#x2014;A study to show how popularity is affecting accuracy in social media.</article-title>
                    <year>2020</year>;<volume>97</volume>:<fpage>106754</fpage>.</mixed-citation>
            </ref>
            <ref id="ref16">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Fang</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lin</surname>
                            <given-names>X</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Improving sentiment analysis accuracy with emoji embedding.</article-title>
                    <source>

                        <italic toggle="yes">J. Saf. Sci. Resil.</italic>
</source>
                    <year>2021</year>;<volume>2</volume>(<issue>4</issue>):<fpage>246</fpage>&#x2013;<lpage>252</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.jnlssr.2021.10.003</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref42">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Kochanek</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cichecki</surname>
                            <given-names>I</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kaszyca</surname>
                            <given-names>O</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Improving Training Dataset Balance with ChatGPT Prompt Engineering.</article-title>
                    <source>

                        <italic toggle="yes">Electronics.</italic>
</source>
                    <year>2024</year>;<volume>13</volume>(<issue>12</issue>):<fpage>2255</fpage>.
                    <pub-id pub-id-type="doi">10.3390/electronics13122255</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref17">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Zhou</surname>
                            <given-names>F</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jianxin Jiao</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Linsey</surname>
                            <given-names>JS</given-names>
                        </name>
</person-group>:
                    <article-title>Latent customer needs elicitation by use case analogical reasoning from sentiment analysis of online product reviews.</article-title>
                    <source>

                        <italic toggle="yes">J. Mech. Des.</italic>
</source>
                    <year>2015</year>;<volume>137</volume>(<issue>7</issue>):<fpage>071401</fpage>.
                    <pub-id pub-id-type="doi">10.1115/1.4030159</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref21">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hwang</surname>
                            <given-names>SE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kim</surname>
                            <given-names>W-H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Heo</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>Socio-demographic, psychological, and experiential predictors of COVID-19 vaccine hesitancy in South Korea, October-December 2020.</article-title>
                    <source>

                        <italic toggle="yes">Hum. Vaccin. Immunother.</italic>
</source>
                    <year>2022</year>;<volume>18</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>8</lpage>.
                    <pub-id pub-id-type="pmid">34614382</pub-id>
                    <pub-id pub-id-type="doi">10.1080/21645515.2021.1983389</pub-id>
                    <pub-id pub-id-type="pmcid">PMC8920123</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref22">
                <label>29</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>R&#x00f6;sner</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kr&#x00e4;mer</surname>
                            <given-names>NC</given-names>
                        </name>
</person-group>:
                    <article-title>Verbal venting in the social web: Effects of anonymity and group norms on aggressive language use in online comments.</article-title>
                    <source>

                        <italic toggle="yes">Soc. Media Soc.</italic>
</source>
                    <year>2016</year>;<volume>2</volume>(<issue>3</issue>):<fpage>205630511666422</fpage>.
                    <pub-id pub-id-type="doi">10.1177/2056305116664220</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref23">
                <label>30</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rodr&#x00ed;guez-Hidalgo</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tan</surname>
                            <given-names>ES</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Verlegh</surname>
                            <given-names>PW</given-names>
                        </name>
</person-group>:
                    <article-title>Expressing emotions in blogs: The role of textual paralinguistic cues in online venting and social sharing posts.</article-title>
                    <source>

                        <italic toggle="yes">Comput. Hum. Behav.</italic>
</source>
                    <year>2017</year>;<volume>73</volume>:<fpage>638</fpage>&#x2013;<lpage>649</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.chb.2017.04.007</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref24">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Suler</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>The online disinhibition effect.</article-title>
                    <source>

                        <italic toggle="yes">Cyberpsychol. Behav.</italic>
</source>
                    <year>2004</year>;<volume>7</volume>(<issue>3</issue>):<fpage>321</fpage>&#x2013;<lpage>326</lpage>.
                    <pub-id pub-id-type="doi">10.1089/1094931041291295</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref25">
                <label>32</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lapidot-Lefler</surname>
                            <given-names>N</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Barak</surname>
                            <given-names>A</given-names>
                        </name>
</person-group>:
                    <article-title>Effects of anonymity, invisibility, and lack of eye-contact on toxic online disinhibition.</article-title>
                    <source>

                        <italic toggle="yes">Comput. Hum. Behav.</italic>
</source>
                    <year>2012</year>;<volume>28</volume>(<issue>2</issue>):<fpage>434</fpage>&#x2013;<lpage>443</lpage>.
                    <pub-id pub-id-type="doi">10.1016/j.chb.2011.10.014</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref18">
                <label>33</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mkhize</surname>
                            <given-names>PL</given-names>
                        </name>
</person-group>:
                    <article-title>Effect of social trust on health information exchange in social network sites.</article-title>
                    <source>

                        <italic toggle="yes">S. Afr. J. Inf. Manag.</italic>
</source>
                    <year>2023</year>;<volume>25</volume>(<issue>1</issue>):<fpage>1539</fpage>.
                    <pub-id pub-id-type="doi">10.4102/sajim.v25i1.1539</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref19">
                <label>34</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Signorini</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Segre</surname>
                            <given-names>AM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Polgreen</surname>
                            <given-names>PM</given-names>
                        </name>
</person-group>:
                    <article-title>The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic.</article-title>
                    <source>

                        <italic toggle="yes">PLoS One.</italic>
</source>
                    <year>2011</year>;<volume>6</volume>(<issue>5</issue>):<fpage>e19467</fpage>.
                    <pub-id pub-id-type="pmid">21573238</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0019467</pub-id>
                    <pub-id pub-id-type="pmcid">PMC3087759</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref20">
                <label>35</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mavragani</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gkillas</surname>
                            <given-names>K</given-names>
                        </name>
</person-group>:
                    <article-title>COVID-19 predictability in the United States using Google Trends time series.</article-title>
                    <source>

                        <italic toggle="yes">Sci. Rep.</italic>
</source>
                    <year>2020</year>;<volume>10</volume>(<issue>1</issue>):<fpage>20693</fpage>.
                    <pub-id pub-id-type="pmid">33244028</pub-id>
                    <pub-id pub-id-type="doi">10.1038/s41598-020-77275-9</pub-id>
                    <pub-id pub-id-type="pmcid">PMC7692493</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref26">
                <label>36</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lin</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Parker</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pejavara</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>&#x201c;I Would Never Push a Vaccine on You&#x201d;: A Qualitative Study of Social Norms and Pressure in Vaccine Behavior in the US.</article-title>
                    <source>

                        <italic toggle="yes">Vaccines.</italic>
</source>
                    <year>2022</year>;<volume>10</volume>(<issue>9</issue>):<fpage>1402</fpage>.
                    <pub-id pub-id-type="pmid">36146480</pub-id>
                    <pub-id pub-id-type="doi">10.3390/vaccines10091402</pub-id>
                    <pub-id pub-id-type="pmcid">PMC9502292</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref27">
                <label>37</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Wu</surname>
                            <given-names>Z-X</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>H-F</given-names>
                        </name>
</person-group>:
                    <article-title>Peer pressure is a double-edged sword in vaccination dynamics.</article-title>
                    <source>

                        <italic toggle="yes">EPL.</italic>
</source>
                    <year>2013</year>;<volume>104</volume>(<issue>1</issue>):<fpage>10002</fpage>.
                    <pub-id pub-id-type="doi">10.1209/0295-5075/104/10002</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref28">
                <label>38</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Decoteau</surname>
                            <given-names>CL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sweet</surname>
                            <given-names>PL</given-names>
                        </name>
</person-group>:
                    <article-title>Vaccine Hesitancy and the Accumulation of Distrust.</article-title>
                    <source>

                        <italic toggle="yes">Soc. Probl.</italic>
</source>
                    <year>2023</year>;<fpage>spad006</fpage>.
                    <pub-id pub-id-type="doi">10.1093/socpro/spad006</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref29">
                <label>39</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Lee</surname>
                            <given-names>W</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Ahn</surname>
                            <given-names>S</given-names>
                        </name>
</person-group>:
                    <article-title>Risk Perception and Vaccination Intention towards COVID-19 News.</article-title>
                    <source>

                        <italic toggle="yes">Korean Journal of Journalism &amp; Communication Studies.</italic>
</source>
                    <year>2022</year>;<volume>66</volume>(<issue>6</issue>):<fpage>388</fpage>&#x2013;<lpage>425</lpage>.
                    <pub-id pub-id-type="doi">10.20879/kjjcs.2022.66.6.011</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref30">
                <label>40</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Palmedo</surname>
                            <given-names>PC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Rauh</surname>
                            <given-names>L</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lathan</surname>
                            <given-names>HS</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Exploring distrust in the wait and see: Lessons for vaccine communication.</article-title>
                    <source>

                        <italic toggle="yes">Am. Behav. Sci.</italic>
</source>
                    <year>2022</year>;<fpage>000276422110628</fpage>.
                    <pub-id pub-id-type="doi">10.1177/00027642211062865</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref31">
                <label>41</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Larson</surname>
                            <given-names>HJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jarrett</surname>
                            <given-names>C</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Eckersberger</surname>
                            <given-names>E</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Understanding vaccine hesitancy around vaccines and vaccination from a global perspective: a systematic review of published literature, 2007&#x2013;2012.</article-title>
                    <source>

                        <italic toggle="yes">Vaccine.</italic>
</source>
                    <year>2014</year>;<volume>32</volume>(<issue>19</issue>):<fpage>2150</fpage>&#x2013;<lpage>2159</lpage>.
                    <pub-id pub-id-type="pmid">24598724</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.vaccine.2014.01.081</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref45">
                <label>42</label>
                <mixed-citation publication-type="other">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Dai</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Liu</surname>
                            <given-names>Z</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Liao</surname>
                            <given-names>W</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Auggpt: Leveraging chatgpt for text data augmentation.</article-title>
                    <source>

                        <italic toggle="yes">arXiv.</italic>
</source>
                    <year>2023</year>; preprint arXiv:230213007.
                    <pub-id pub-id-type="doi">10.48550/arXiv.2302.13007</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref46">
                <label>43</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ouyang</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>MaungMaung</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Konishi</surname>
                            <given-names>K</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Stability analysis of chatgpt-based sentiment analysis in ai quality assurance.</article-title>
                    <source>

                        <italic toggle="yes">Electronics.</italic>
</source>
                    <year>2024</year>;<volume>13</volume>(<issue>24</issue>):<fpage>5043</fpage>.
                    <pub-id pub-id-type="doi">10.3390/electronics13245043</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref43">
                <label>44</label>
                <mixed-citation publication-type="data">
                    <collab>Korea Disease Control and Prevention Agency</collab>:
                    <data-title>Korea Disease Control and Prevention Agency daily COVID19 vaccination status (https://ncv.kdca.go.kr/vaccineStatus.es?mid=a11710000000). Zenodo.</data-title>[Dataset].<year>2023</year>.
                    <pub-id pub-id-type="doi">10.5281/zenodo.10252896</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref44">
                <label>45</label>
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sunyoung</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <data-title>The result of the confusion matrix for the accuracy evaluation of 100 posts. Zenodo.</data-title>
                    <year>2024</year>.
                    <pub-id pub-id-type="doi">10.5281/zenodo.13381133</pub-id>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report382353">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.176895.r382353</article-id>
            <title-group>
                <article-title>Reviewer response for version 3</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Argyris</surname>
                        <given-names>Young Anna</given-names>
                    </name>
                    <xref ref-type="aff" rid="r382353a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-2415-3223</uri>
                </contrib>
                <aff id="r382353a1">
                    <label>1</label>Michigan State University, East Lansing, Michigan, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>19</day>
                <month>6</month>
                <year>2025</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2025 Argyris YA</copyright-statement>
                <copyright-year>2025</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport382353" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.145845.3"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>
                <bold>Overview</bold>
            </p>
            <p> This paper addresses a crucial issue&#x2014;public sentiment toward COVID-19 vaccination in South Korea&#x2014;and examines it through a novel approach: utilizing ChatGPT-based sentiment analysis on social media data. The study's attempt to connect sentiment trends to real-world vaccination rates is timely and commendable, and the use of advanced language models is a noteworthy methodological innovation.</p>
            <p> However, the manuscript has several substantive issues that prevent it from being indexed in its current form. While I highly appreciate the authors&#x2019; effort and believe this line of inquiry is valuable, the concerns outlined below&#x2014;including methodological limitations and unclear contributions &#x2014;need to be addressed more fully. I share these critiques in the hope that they will help the authors improve their paper.</p>
            <p> </p>
            <p> 
                <bold>Literature Review</bold>
            </p>
            <p> The authors may also consider referencing a similar approach in Argyris et al.'s study published in Social Science &amp; Medicine, which also used natural language processing to analyze online vaccine discourse: (Ref.1). They also demonstrate significant correlations between social media posts and vaccination rates, and elaborate potential reasons for the associations, which might be helpful for the authors to contextualize the contribution of this paper within the broader literature.&#x00a0;</p>
            <p> </p>
            <p> 
                <bold>Methods</bold>
            </p>
            <p> 
                <italic>Note: Other reviewers have already addressed other issues, such as data imbalance and small sample size, and are not reiterated here.</italic>
            </p>
            <p> The authors specify that posts were collected from Blind, an anonymous social media community targeted at working professionals. While this is helpful, the implications of using a niche platform with a specific user demographic should be discussed more thoroughly, particularly regarding the representativeness of the data, ethical considerations related to consent and privacy, and the generalizability of the study&#x2019;s findings. To their credit, the authors do acknowledge this limitation in the discussion section, noting that the Blind platform primarily targets professionals and may not capture broader public sentiment. However, merely acknowledging this limitation does not fully address the methodological implications. The restricted demographic may systematically bias the types of sentiments expressed, especially given that workplace pressures and professional norms likely influence how vaccination is discussed on the platform. Blind is organized into numerous company- and industry-specific groups, which may affect the topics and tone of discussions, as well as amplify particular workplace-related sentiments such as coercion or pressure to comply with vaccination mandates.</p>
            <p> </p>
            <p> The posts underwent text preprocessing, which involved UTF-8 encoding, removal of stop words and URLs, as well as removal of emojis and special characters. While such cleaning is common, the removal of emojis may be questionable, as emojis&#x2014;though not crucial&#x2014;can be useful for sentiment analysis and emotional nuance detection. The authors should justify this decision or acknowledge its potential implications for the analysis.</p>
            <p> </p>
            <p> A major concern is the reliability of ChatGPT in accurately classifying sentiment in the analyzed posts. The authors report an 85% agreement rate between the model and two psychiatrists, but this evaluation was conducted on only 100 posts out of 854 analyzed. Given the complexity and nuance of language in social media discourse&#x2014;particularly when dealing with sarcasm, irony, or emotionally charged content&#x2014;a validation sample of 100 posts is insufficient to establish confidence in the model's overall performance. A more rigorous evaluation, ideally with interrater comparisons across a larger and more diverse subset of posts, is necessary to support the claim that the model accurately reflects user sentiment.</p>
            <p> The authors should clarify the coding scheme.</p>
            <p> </p>
            <p> 
                <bold>Results and Visualization</bold>
            </p>
            <p> Figure 2 is very difficult to understand. Based on the accompanying description, the authors appear to be using association rule mining to identify co-occurring sentiment categories within the reasons given for or against vaccination. For example, they report support, confidence, and lift values to indicate the strength and reliability of relationships like &#x201c;Decreasing mortality rate&#x201d; being associated with &#x201c;Symptom alleviation,&#x201d; or &#x201c;Antipathy to social oppression&#x201d; with &#x201c;Mistrust.&#x201d; However, the figure itself lacks clear labels, an intuitive layout, and an adequate explanation in the caption. The connection between the visual elements and these statistical measures is not readily apparent. The figure would benefit from a simplified design, a clearer legend, and a more detailed narrative in the main text to help readers accurately interpret the findings. For guidance on more effective visualizations of association rule mining results, the authors may refer to Dolores et al. (2023) - (Ref 2).&#x00a0;Table 4, which presents a clear and practical summary of association rule metrics and their interpretations.</p>
            <p> </p>
            <p> 
                <bold>Discussion</bold>
            </p>
            <p> Interestingly, the study finds a strong correlation between the number of negative posts and actual vaccination rates, despite the overwhelmingly negative sentiment expressed in the data. The authors acknowledge that many individuals were vaccinated not out of conviction, but because of job requirements or social pressure. In this context, the findings suggest that high vaccination rates in Korea (reported at 96.9%) may reflect compliance driven by external pressures rather than positive public sentiment. This nuance is important: public health behavior may not align with expressed beliefs, particularly in environments where vaccination is perceived as socially or professionally obligatory. The authors could emphasize this tension more clearly to avoid the mistaken interpretation that high uptake equates to high trust or positive attitudes toward vaccines.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>No</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Partly</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>No</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>No</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>No</p>
            <p>Reviewer Expertise:</p>
            <p>Information Systems, Health Communication, Computational method</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-382353-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Vaccine rhetoric on social media and COVID-19 vaccine uptake rates: A triangulation using self-reported vaccine acceptance</article-title>.
                        <source>
                            <italic>Social Science &amp; Medicine</italic>
                        </source>.<year>2024</year>;<volume>348</volume>:
                        <elocation-id>10.1016/j.socscimed.2024.116775</elocation-id>
                        <pub-id pub-id-type="doi">10.1016/j.socscimed.2024.116775</pub-id>
                    </mixed-citation>
                </ref>
                <ref id="rep-ref-382353-2">
                    <label>2</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>A big data association rule mining based approach for energy building behaviour analysis in an IoT environment</article-title>.
                        <source>
                            <italic>Scientific Reports</italic>
                        </source>.<year>2023</year>;<volume>13</volume>(<issue>1</issue>) :
                        <elocation-id>10.1038/s41598-023-47056-1</elocation-id>
                        <pub-id pub-id-type="doi">10.1038/s41598-023-47056-1</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment14323-382353">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Park</surname>
                            <given-names>Sunyoung</given-names>
                        </name>
                        <aff>Psychiatry, National Health Insurance Service Ilsan Hospital, Goyang-si, Gyeonggi-do, South Korea</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>None.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>6</day>
                    <month>8</month>
                    <year>2025</year>
                </pub-date>
            </front-stub>
            <body>
                <p>1.&#x00a0;Literature Review: Reference to Argyris et al. : Thank you for the suggestion. We agree that the study by Argyris et al. provides important contextual support for our work. We have revised the Literature Review section to include and discuss this reference. Specifically, we highlight how our study contributes to the growing body of research connecting social media sentiment with real-world vaccine behaviors, with a distinct focus on Korean-language data and the use of ChatGPT.</p>
                <p> </p>
                <p> 2.&#x00a0;Methods &#x2013; Use of Blind and Demographic Limitations:&#x00a0;This is an important point. While we acknowledged this limitation, we agree that the methodological implications should be expanded. We have added a detailed discussion in both the Methods and Discussion sections, emphasizing how Blind&#x2019;s focus on professionals may introduce bias, particularly regarding expressions of workplace-related pressure to vaccinate. We also discuss how company- and industry-specific groupings may affect discourse tone and topic.</p>
                <p> </p>
                <p> 3.&#x00a0;Removal of Emojis in Preprocessing:&#x00a0;We thank the reviewer for this valuable observation. We have now acknowledged in the Methods section that while emojis can contribute emotional context to sentiment analysis, they were removed to reduce preprocessing complexity and ensure consistency in textual inputs for the language model. We have also included a note in the Limitations section about the possible loss of emotional signals due to this decision.</p>
                <p> </p>
                <p> 4.&#x00a0;Validation Sample Size and Coding Scheme:&#x00a0;Thank you for this insightful comment. In response, we have expanded the validation sample from 100 to 200 posts to improve the reliability of our sentiment classification. The additional validation yielded a concordance rate of 84% between the two board-certified psychiatrists, which is consistent with the initial agreement rate.</p>
                <p> We have also provided a more detailed description of our coding scheme in the Methods section. Posts were categorized as positive, negative, or neutral based on the emotional tone and content related to COVID-19 vaccination. Annotation was guided by clinical criteria commonly used in psychiatric evaluation of affective expression, including linguistic cues (e.g., appraisal, sarcasm, urgency), expressed intentions (e.g., compliance, avoidance), and context (e.g., workplace influence, medical risk). Disagreements between annotators were resolved through discussion and consensus. These revisions are now reflected in the updated manuscript.</p>
                <p> </p>
                <p> 5.&#x00a0;Figure 2 &#x2013; Difficult Interpretation of Association Rules:&#x00a0;Thank you for pointing this out. We have redesigned Figure 2 using a more intuitive layout and clearer legends. The caption has been expanded to explain what each element represents (support, confidence, lift), and we&#x2019;ve added a narrative in the Results section that walks through key associations.</p>
                <p> </p>
                <p> 6.&#x00a0;Discussion &#x2013; Clarify the Tension Between Negative Sentiment and High Vaccination Rates:&#x00a0;We fully agree. In the revised Discussion, we now highlight this discrepancy more explicitly. We discuss how high vaccine uptake in Korea may have been driven by workplace mandates or social expectations rather than positive sentiment, reinforcing the complex dynamics between public behavior and online discourse.</p>
                <p> </p>
                <p> 7.&#x00a0;Reproducibility and Methodological Transparency:&#x00a0;To enhance reproducibility, we have updated our GitHub repository to include the full set of preprocessing scripts, sentiment analysis prompts used with ChatGPT, and R code used for association rule mining. We have described the process of categorizing variables for reasoning analysis in more detail in the main text. We have also clarified in the manuscript that the SNS post data cannot be shared due to ethical and copyright concerns. However, we ensured that sufficient methodological details are provided to allow replication of our process using alternative datasets.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report336916">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.169571.r336916</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Mohd Bahrin</surname>
                        <given-names>Ummu Fatihah</given-names>
                    </name>
                    <xref ref-type="aff" rid="r336916a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r336916a1">
                    <label>1</label>Universiti Teknologi MARA Cawangan Terengganu, Kuala Terengganu, Malaysia</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>25</day>
                <month>11</month>
                <year>2024</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2024 Mohd Bahrin UF</copyright-statement>
                <copyright-year>2024</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport336916" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.145845.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The study notes that the sentiment analysis faced challenges due to the data imbalance, which led to overfitting, in detecting negative sentiments while reducing accuracy for positive feelings. However, the document does not elaborate on specific strategies to address the imbalance. Recommendations for Handling Imbalanced Datasets</p>
            <p> Resampling Techniques:</p>
            <p> 1)Oversampling: Increase the number of positive and neutral posts using techniques like SMOTE (Synthetic Minority Oversampling Technique).</p>
            <p> 2)Undersampling: Reduce the number of negative posts to balance the dataset while ensuring that critical information is retained.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>I cannot comment. A qualified statistician is required.</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Setimnet Mining, machine learning, deep learning.</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment13109-336916">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Park</surname>
                            <given-names>Sunyoung</given-names>
                        </name>
                        <aff>Psychiatry, National Health Insurance Service Ilsan Hospital, Goyang-si, Gyeonggi-do, South Korea</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>None</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>9</day>
                    <month>1</month>
                    <year>2025</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you so much for taking the time to review my manuscript and for providing such valuable and thoughtful comments. Your insights have been incredibly helpful and offered me a great opportunity to deepen my understanding of the topic. I truly appreciate the effort you put into making this a better piece of research.</p>
                <p> </p>
                <p> Following your comment, I considered several ways to fix the data imbalance and conducted additional analyses. I have also added the results and acknowledged this as a limitation in the Discussion section.</p>
                <p> </p>
                <p> 
                    <bold>1. Limitations of Oversampling and Undersampling</bold> 
                    <list list-type="order">
                        <list-item>
                            <p>
                                <bold>Undersampling</bold>: Since the total amount of data in this study is not sufficient, reducing the number of negative postings would lead to a lack of data for correlation analysis, which could undermine the reliability of the results. Therefore, undersampling was deemed unsuitable for this study.</p>
                        </list-item>
                        <list-item>
                            <p>
                                <bold>Oversampling</bold>: The recommended method, such as SMOTE, produces results in vector form, making it difficult to apply to sentiment analysis using language models like ChatGPT. For this reason, we attempted data augmentation using ChatGPT
                                <sup>1</sup>.</p>
                        </list-item>
                    </list> </p>
                <p> 
                    <bold>2. Data Augmentation</bold> 
                    <list list-type="bullet">
                        <list-item>
                            <p>Data augmentation using ChatGPT was conducted with controlled prompts to encourage the use of synonyms and increase the diversity of positive and neutral data while maintaining contextual consistency.</p>
                        </list-item>
                        <list-item>
                            <p>The augmented data was adjusted so that positive and neutral data accounted for 25-30% of the total dataset, with the following distribution after augmentation: 
                                <list list-type="bullet">
                                    <list-item>
                                        <p>Positive: Increased from 44 to 264 (220 added)</p>
                                    </list-item>
                                    <list-item>
                                        <p>Neutral: Increased from 103 to 295 (192 added)</p>
                                    </list-item>
                                    <list-item>
                                        <p>Negative: Maintained at 704</p>
                                    </list-item>
                                </list> </p>
                        </list-item>
                        <list-item>
                            <p>The augmented data was randomly mixed with the original data, forming the full dataset, and sentiment analysis was conducted again using ChatGPT.</p>
                        </list-item>
                    </list> </p>
                <p> </p>
                <p> 
                    <bold>3. Results of Sentiment Analysis After Data Augmentation</bold> 
                    <list list-type="bullet">
                        <list-item>
                            <p>
                                <bold>Results</bold>: After data augmentation, the accuracy of the sentiment analysis did not improve. On the contrary, actual neutral data was more frequently misclassified as negative.</p>
                        </list-item>
                        <list-item>
                            <p>
                                <bold>Analysis of Causes</bold>: We propose the following hypotheses to explain these results: 
                                <list list-type="order">
                                    <list-item>
                                        <p>
                                            <bold>Nondeterministic Nature of ChatGPT</bold>: ChatGPT generates nondeterministic responses
                                            <sup>2</sup>, which may have led to the augmented minor dataset having no significant impact on the analysis results. This nondeterministic characteristic is particularly prone to inaccuracies in borderline cases (where the sentiment is ambiguous between positive and negative) and sarcasm.</p>
                                    </list-item>
                                    <list-item>
                                        <p>
                                            <bold>Quality Degradation in Augmented Data</bold>: The initial sentiment analysis accuracy was approximately 84%, suggesting possible inaccuracies in borderline cases and sarcastic expressions. If data augmentation was based on these inaccurate results, it likely generated lower-quality data that further reduced the overall dataset accuracy. The increase in cases where neutral data was misclassified as negative indicates potential challenges in accurately identifying ambiguous boundaries.</p>
                                    </list-item>
                                </list> </p>
                        </list-item>
                    </list> 
                    <bold>References</bold>
                </p>
                <p> 1.&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0; Dai H, Liu Z, Liao W, Huang X, Cao Y, Wu Z, et al. Auggpt: Leveraging chatgpt for text data augmentation. 
                    <italic>arXiv. </italic>2023; preprint arXiv:230213007. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.48550/arXiv.2302.13007">10.48550/arXiv.2302.13007</ext-link>
                </p>
                <p> 2.&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0;&#x00a0; Ouyang T, MaungMaung A, Konishi K, Seo Y, Echizen I. Stability analysis of chatgpt-based sentiment analysis in ai quality assurance. 
                    <italic>Electronics</italic>. 2024;13(24):5043. 
                    <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/electronics13245043">10.3390/electronics13245043</ext-link>
                </p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report319806">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.169571.r319806</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Cilgin</surname>
                        <given-names>Cihan</given-names>
                    </name>
                    <xref ref-type="aff" rid="r319806a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r319806a1">
                    <label>1</label>Bolu Abant Izzet Baysal University, Bolu, Turkey</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>4</day>
                <month>9</month>
                <year>2024</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2024 Cilgin C</copyright-statement>
                <copyright-year>2024</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport319806" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.145845.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>I think it is appropriate to approve and index this study in its current, final form.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>No</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Machine Learning, Deep Learning, sentiment analysis,</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report284069">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.159851.r284069</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Cilgin</surname>
                        <given-names>Cihan</given-names>
                    </name>
                    <xref ref-type="aff" rid="r284069a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r284069a1">
                    <label>1</label>Bolu Abant Izzet Baysal University, Bolu, Turkey</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>17</day>
                <month>6</month>
                <year>2024</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2024 Cilgin C</copyright-statement>
                <copyright-year>2024</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport284069" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.145845.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>This study used ChatGPT to perform sentiment analysis on posts about Covid-19 vaccines.</p>
            <p> </p>
            <p> 1.&#x00a0;Within the scope of the study, many studies conducted within the same scope were ignored and the literature section was not given. However, the existing studies in the literature are very important to both reveal and support the current contributions of these studies. For example, the study given below is a direct alternative to this issue:</p>
            <p> </p>
            <p> &gt;&gt; &#x00c7;&#x0131;lg&#x0131;n, C., G&#x00f6;k&#x00e7;en, H., &amp; G&#x00f6;k&#x015f;en, Y. (2023 [Ref -1]). Sentiment analysis of public sensitivity to COVID-19 vaccines on Twitter by majority voting classifier-based machine learning. 
                <italic>Journal of the Faculty of Engineering and Architecture of Gazi University</italic>,&#x00a0;
                <italic>38</italic>(2)</p>
            <p> </p>
            <p> 2.&#x00a0;There are many repetitive sentences under the title of web crawling and data collection. This title should definitely be reviewed again.</p>
            <p> </p>
            <p> 3.&#x00a0;&#x00a0;It is really interesting that only 854 relevant posts were confiscated out of approximately 500 thousand posts. Many studies in the literature used social media posts much higher than this number. The reader should be made more aware here by giving more details about this filtering process.</p>
            <p> </p>
            <p> 4.&#x00a0;Using a lexicon-based approach such as Vader or TextBlob in sentiment analysis, in addition to ChatGPT, may make the results of this study more valuable. In addition, a suitable data set is available for such a method comparison.</p>
            <p> </p>
            <p> 5.&#x00a0;Presenting a confusion matrix of the results obtained as a result of the 100 posts considered for the test data set is very important in terms of the consistency of the results.</p>
            <p> </p>
            <p> 6.&#x00a0;There are repetitive expressions, especially in the Discussion section of the study.</p>
            <p> </p>
            <p> 7.&#x00a0;There is no statement in the Discussion section that compares the findings of this study with the findings of other studies in the literature. As mentioned before, this is related to the lack of literature section. The similarity or difference of the findings with the existing literature is very important for the reader to evaluate the findings of this study. This study used ChatGPT to perform sentiment analysis on posts about Covid-19 vaccines.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>No</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Partly</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Partly</p>
            <p>Reviewer Expertise:</p>
            <p>Machine Learning, Deep Learning, sentiment analysis,</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <back>
            <ref-list>
                <title>References</title>
                <ref id="rep-ref-284069-1">
                    <label>1</label>
                    <mixed-citation publication-type="journal">
                        <person-group person-group-type="author"/>:
                        <article-title>Twitter&#x2019;da COVID-19 a&#x015f;&#x0131;lar&#x0131;na kar&#x015f;&#x0131; kamu duyarl&#x0131;l&#x0131;&#x011f;&#x0131;n&#x0131;n &#x00e7;o&#x011f;unluk oylama s&#x0131;n&#x0131;fland&#x0131;r&#x0131;c&#x0131;s&#x0131; temelli makine &#x00f6;&#x011f;renmesi ile duygu analizi</article-title>.
                        <source>
                            <italic>Gazi &#x00dc;niversitesi M&#x00fc;hendislik Mimarl&#x0131;k Fak&#x00fc;ltesi Dergisi</italic>
                        </source>.<year>2022</year>;<volume>38</volume>(<issue>2</issue>) :
                        <elocation-id>10.17341/gazimmfd.1030198</elocation-id>
                        <fpage>1093</fpage>-<lpage>1104</lpage>
                        <pub-id pub-id-type="doi">10.17341/gazimmfd.1030198</pub-id>
                    </mixed-citation>
                </ref>
            </ref-list>
        </back>
        <sub-article article-type="response" id="comment12156-284069">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Park</surname>
                            <given-names>Sunyoung</given-names>
                        </name>
                        <aff>Psychiatry, National Health Insurance Service Ilsan Hospital, Goyang-si, Gyeonggi-do, South Korea</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>None</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>4</day>
                    <month>8</month>
                    <year>2024</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thank you for your comment. I have revised this article as below.</p>
                <p> </p>
                <p> Please go through the respective Author responses for Reviewer Comments made.</p>
                <p> </p>
                <p> 1.&#x00a0;As suggested, I have developed and included a comprehensive literature review related to the main topic of this study. Additionally, I have incorporated important insights from the paper you mentioned as an example, which have significantly contributed to the argument presented in my work.</p>
                <p> 2.&#x00a0;Thank you for pointing this out. There was an error in the editing process of the Methods section, which led to the repetition of content. This has now been corrected.</p>
                <p> 3.&#x00a0;Thank you for highlighting this critical point. The relatively small number of relevant posts collected in this study can be attributed to the characteristics of the website BLIND, from which the data was gathered, and the fact that data collection occurred some time after vaccines were a hot topic. I have added a detailed explanation to address this. Additionally, I have also enhanced and elaborated on the filtering process in the study.</p>
                <p> 4.&#x00a0;Thank you for your insightful suggestion. It is true that most sentiment analysis studies utilize English data, and the lexicons you mentioned are indeed based on English. However, Korean, being a less commonly used language globally, presents challenges in applying these pre-established analytic methods directly. I believe this underscores the significance of using ChatGPT in our study, which is specifically adapted to handle Korean text. This point has been detailed in the literature review and discussed further in the discussion section of our paper.</p>
                <p> 5.&#x00a0;I have added the results of the confusion matrix and included comments on the identified features in the discussion section. Additionally, while creating the confusion matrix, there were slight modifications to the previously manually evaluated accuracy results (from 86% to 85%).</p>
                <p> 6.&#x00a0;I have revised the Discussion section to eliminate repetitive expressions and streamline the content.</p>
                <p> 7.&#x00a0;Thank you for your meaningful feedback. Our study has notable differences from other studies, particularly due to the linguistic and sociocultural factors specific to Korea. I have addressed these differences and added a comparison of our findings with those in the existing literature in the Discussion section.</p>
            </body>
        </sub-article>
    </sub-article>
</article>
