Dataset of companies’ profitability, government debt, financial statements' key indicators and earnings in an emerging market: Developing a panel and time series database of value-added tax rate increase impacts

Company profitability is a crucial indicator that can be used for developing and sustaining trust in accounting information and, thus, inefficient capital markets. Companies with good financial statements’ key indicators have a more extensive customer base and can diversify their revenue streams, making them more resilient to economic downturns. Assembling and managing taxes is a critical underpinning to protecting a country’s financial intensity and developing a country’s tax-system. VAT is a primary source of financial gain in developing nations, which differs from economic income in developed countries, where economic income is primarily derived from tax income. In emerging economies, the existing practice requires firms to effectively and efficiently publish annual-reports and indicators on market-websites, as users rely heavily on timely-information and need it to make decisions. However, these practices fell short of expectations, requiring more research. These variables are crucial for most accounting/economics/taxation research models and the lack of easily attainable data in well-known databases (e.g., ARGAAM; DataStream). This article is primarily a dataset for analysing taxation, performance variables, and key financial-statement indicators. The data describes the raw, combined, and filtered information at the company level, such as company profit and government debt in Saudi Arabia. It combines a firm-level panel dataset sample of company profit that its measures scaled by total assets and include: earnings before interest, taxes, decrease and amortisation, earnings before interest and taxes, earnings after taxes and earnings before taxes—moreover, the time series dataset sample of 11 financial statements’ key indicators. The dataset results from 494 company-year observations (226-panel data sample and 268-time series data sample) from 2019 to 2020. Data has been collected from taxation reports, corporate annual reports, ARGAAM database, FinBox database, the Trading Economics database and the Tadawul-market website in Saudi Arabia.


Introduction
The reasoning and setting behind the creation of this dataset are to help the researcher to perform a comparative analysis across different disciplines to study the role of these variables in a specific phenomenon or the factors enlightening value-added tax (VAT) usefulness and effectiveness.For example, estimate the consequences of a VAT rate increase on profitability, government debt etc.This dataset helps complete two of the few studies investigating the VAT effect in the Kingdom of Saudi Arabia (KSA).There was an encouragement to observe the effect of imposing the new 15% VAT on the profitability of nonfinancial Saudi-listed companies.This dataset is in two types: a firm-level panel dataset sample and a time series sample (Mgammal, 2021;Mgammal, Al-Matari, and Alruwaili, 2023).

Methods
This dataset article contains proxies to measure the economic impacts of VAT as used by several prior researchers (Mgammal, 2021).The representatives for measuring the economic effects of VAT were manually collected and built using secondary data obtained from publicly available data from 2019 to 2020.In Table 1, ProFtEBITDA means company profit measured by earnings before interest, taxes, devaluation, and amortisation (EBITDA) and scaled by total assets and data collected from FinBox database tools.Data were hand collected from companies' tax reports.We specify companies and identify them with inclusion/exclusion criteria, as mentioned in Table 3.The inclusion/exclusion criteria of all data in this article are as follows: we included nonfinancial companies and excluded finance firms.Then we filtered the sample by excluding companies with annual reports unavailable for two years.The fiscal year-end date is not 31/12/of each year, and the accounting period is over 12 months.Consequently, the final dataset is 494 company-year observations (226-panel data sample and 268-time series data sample) from 2019 to 2020.This final sample is the foundation that can be used for analysis in future research.SIZE2020 and SIZE2019 are the mean company size in 2020 and 2019, measured by the natural logarithm of total assets and data gathered from companies' annual reports.EAT2020 and EAT2019 are the mean company earnings after tax in 2020 and 2019 and were measured by deducting all expenditures and revenue taxes from the business's revenues.Following prior research, two variables were added, GvD2020 and GvD2019, which meant government debt in 2020 and 2019 and was measured as government debt over the gross domestic product (GDP).Data for this were extracted from the Trading Economics (https://tradingeconomics.com/) database.Regarding the process for accessing the data, the data was hand collated directly from the Trading Economics website using many Saudi Arabia Indicators reports, especially GDP Indicators reports.In this context, the data collected from the Trading dataset are the same inclusion/exclusion criteria as mentioned above, and we started data collection on 23/01/2021 to 30/03/2021.As in Table 2 below, an index containing 11 items was included: BALANCE SHEET: total assets, total equity, and liabilities-equity.INCOME STATEMENT: total income, total revenues, total expenses, and net income.CASH FLOW: changes in operation.Activity, changes in investing act, changes in the financing act, and cash at the end of the period.These factors help control for potential impacts when analysing how a VAT increase will affect a firm.We take each into account as each has a component of the likely effect of a VAT increase.Collectively, these classifications were based on data available on the Tadawul market website as we clarify in inclusion/exclusion criteria above.It is a periodic data set of most Saudi registered companies from before the introduction of the new VAT rate of 15% in 2019 to after the introduction of the new VAT rate in 2020.The dataset framework was chosen when the new 15% VAT was introduced due to public access to VAT information for nonfinancial companies.Tadawul requires all public companies to publish their financial statements on the Tadawul website quarterly and annually (https://m5.gs/OXFqam)(Tadawul, 2019(Tadawul, -2020)).These data in the file can help build additional variables, such as measuring the effect of VAT before and after increasing its rate using unique techniques such as the difference-in-difference (DID) approach and the autoregressive integrated moving average (ARIMA) modelling approach.Nevertheless, since the literature does not provide common definitions or metrics, we leave the creation of these additional variables at the discretion of potential users.Thus, users can use this dataset to create these measurements from their perspective (Baatwah and Aljaaidi, 2021;Mgammal, 2021).

Data description
The dataset included in this article contains three files describing and defining the sample and variables.Excel file 1 consists of all raw and filtered data for the variables for the panel data sample.Excel file 2 depicts time-series and crosssectional data for nonfinancial firms listed on the Saudi market for the second and third quarters of 2019 and the third and fourth quarters of 2020.Excel file 3 presents the raw material of variables used in measuring the company's profitability of the panel data sample.The period of this data is selected from an extensive section of registered companies in Saudi Arabia from 2019 before imposing the new VAT rate, 15%, to 2020 after setting the new VAT rate.The major segments are consumer discretionary, information technology, energy, consumer staples, materials, health care, industrials, communication services, real estate, utilities, and financials.The sample framework was chosen due to the time of implementation new 15% VAT rate and the public access to information about VAT if nonfinancial companies, where Tadawul forces all listed companies to publish their financial statements publicly quarterly and annually on the Tadawul website.Financial companies were excluded from the sample framework as they have unique treatments, and some previous studies investigated the effects of VAT on KSA Banks.The final panel data sample framework is 131 listed companies and 268 observations for the time sires' sample, as depicted in Table 3 below.
We utilise balanced panel data as it is a more sensitive measurement of the modifications that could occur between points in time (Cavana, Delahaye, & Sekeran, 2001).Additionally, the outcomes created are more robust, consistent, and stable, enabling a generalisation of the population to be more meaningful and representative.Therefore, the final dataset sample is 226 observations were specified to be eligible for implication in the analyses.It is the basis for the research, i.e. multivariate, bivariate, additional tests and descriptive.

Descriptive statistics
To recognise and determine the situation of every concept, descriptive statistics were utilised to clarify.

Value of the data
• This dataset is essential because it covers data on variables rarely overlooked in accounting, taxation and business performance research models, collectively or individually, but appeal to a wide range of stakeholders.For example, it enables capital markets regulators, standard setters, practitioners and users of financial reporting to easily access long-term data to assess the effectiveness and efficiency of VAT in controlling the risk of tax evasion in fast-growth markets.• This dataset is valuable for interdisciplinary studies investigating the role of these variables in specific phenomena or factors enlightening VAT and company performance effectiveness.
• This dataset is beneficial because it contains data on profitability collected to measure company profit using earnings before interest, taxes, depreciation, and amortisation (EBITDA) and scaled by the total assets data set.Further, the dataset has been arranged into individual and multiple measurements.
• The data allows researchers to scrutinise the influence of VAT increase on various accounting matters, such as corporate governance mechanisms, the performance of companies and the quality of financial reporting.Furthermore, this dataset is valuable to related parties, e.g., investors, stakeholders, tax authorities, decisionmakers, managers and market regulators in assessing and reviewing the tax system in Saudi Arabia.This assessment will give them assurance in making different decisions.
• These data can be analysed and/or compared to other emerging economies and G20 countries (https://www.g20.org/en/).It can be used in discussing the tax system and VAT rate regarding which parties are responsible for adding some VAT incentives in the tax system and updating the tax system sideways with VAT implementation to advantage from the effectiveness of VAT in the KSA.Because KSA could be similar to other countries in G20 and Gulf Cooperation Council (GCC).
• The data is also beneficial for studies on VAT incentives efficiency using data in the long term.
• VAT-Final-Time seires dataset DIB-2.xlsx(Excel file 2 depicts time-series and cross-sectional data for nonfinancial firms listed on the Saudi market for the second and third quarters of 2019 and the third and fourth quarters of 2020).

Faozi Almaqtari
A'Sharqiyah University (ASU), Ibra, Oman Thank you for giving me chance review of this intriguing data article.I believe that by utilizing this dataset, researchers can explore compelling issues within the realm of value-added tax (VAT) in the specific context of Saudi Arabia.I would kindly suggest that the authors carefully review the paper, as there are a few minor typos that could be addressed.In summary, I anticipate significant value emanating from this dataset, foreseeing its potential to contribute valuable insights and implications for economic considerations.
Is the rationale for creating the dataset(s) clearly described?

Saeed Rabea Baatwah
Department of Accounting, College of Business Administration, Shaqra University, Shaqra, Riyadh Province, Saudi Arabia Thank you for reviewing this interesting data article.I think that researchers can test interesting issues in the context of and in Saudi Arabia using this dataset.I would suggest to the authors that it will be more reflective of the dataset if the title is updated to "Dataset of value-added tax, companies' profitability, government debt, financial statements' key indicators, and earnings in an emerging market: Developing a panel and time series dataset".Furthermore, I would suggest the authors review the paper, as there are some minor typos.Overall.I expect greater value from this dataset and its potential to help draw economic implications.
Is the rationale for creating the dataset(s) clearly described?Yes Are the protocols appropriate and is the work technically sound?Yes

Are sufficient details of methods and materials provided to allow replication by others? Yes
Are the datasets clearly presented in a useable and accessible format?Yes Competing Interests: No competing interests were disclosed.
Reviewer Expertise: I am interested in auditing, financial reporting, CSR, taxation, and emerging countries.
I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
this regard, we looking forward to seeing the greater value that this dataset can bring to researchers studying VAT and Saudi Arabia.
Competing Interests: No competing interests were disclosed.
The benefits of publishing with F1000Research: Your article is published within days, with no editorial bias • You can publish traditional articles, null/negative results, case reports, data notes and more • The peer review process is transparent and collaborative • Your article is indexed in PubMed after passing peer review • Dedicated customer support at every stage • For pre-submission enquiries, contact research@f1000.com

Table 1 .
Definitions of the variables.
Table 4 displays the statistics of descriptive (standard deviation, median, mean, minimum, maximum values and degrees of freedom) for 113*2 = 226 observations of all variables of the panel data sample and 268 observations for the time series sample.

Table 2 .
Financial statements' key indicators definitions.

Table 3 .
Descriptive for the samples' characteristics.
*In the Kingdom of Saudi Arabia (KSA), value-added tax (VAT) was first introduced in all industries as a 5% VAT on goods and services as of Jan. 1, 2018, and, because of COVID-19, the Kingdom of Saudi Arabia (KSA) increased the VAT from 5% to 15% on July 1, 2020.

•
Raw matiral -DiB-2.xlsx.(Excel file 3 presents the raw material of variables used in measuring the company's profitability of the panel data sample).Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Yes Are the protocols appropriate and is the work technically sound? Yes Are sufficient details of methods and materials provided to allow replication by others? Yes Are the datasets clearly presented in a useable and accessible format? Yes Competing Interests:
No competing interests were disclosed.

have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
This is an open access peer review report distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.