Hybrid recommender system model for digital library from multiple online publishers

Pijitra Jomsri; Dulyawit Prangchumpol; Kittiya Poonsilp; Thammarat Panityakul

doi:10.12688/f1000research.133013.2

Home Browse Hybrid recommender system model for digital library from multiple...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Revised

Hybrid recommender system model for digital library from multiple online publishers

[version 2; peer review: 1 approved with reservations, 1 not approved]

Pijitra Jomsri ¹, Dulyawit Prangchumpol¹, Kittiya Poonsilp¹, Thammarat Panityakul²

PUBLISHED 04 Apr 2024

Author details Author details

¹ Suan Sunandha Rajabhat University, Dusit, Bangkok, 10300, Thailand
² Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand

Pijitra Jomsri
Roles: Conceptualization, Data Curation, Formal Analysis, Funding Acquisition, Methodology, Project Administration, Resources, Software, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

Dulyawit Prangchumpol
Roles: Data Curation, Funding Acquisition, Investigation, Supervision, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Kittiya Poonsilp
Roles: Data Curation, Formal Analysis, Investigation, Supervision, Validation, Visualization

Thammarat Panityakul
Roles: Visualization, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Background

The demand for online education promotion platforms has increased. In addition, the digital library system is one of the many systems that support teaching and learning. However, most digital library systems store books in the form of libraries that were developed or purchased exclusively by the library, without connecting data with different agencies in the same system.

Methods

A hybrid recommender system model for digital libraries, developed from multiple online publishers, has created a prototype digital library system that connects various important knowledge sources from multiple digital libraries and online publishers to create an index and recommend e-books. The developed system utilizes an API-based linking process to connect various important sources of knowledge from multiple data sources such as e-books on education from educational institutions, e-books from government agencies, and e-books from religious organizations are stored separately. Then, a hybrid recommender system suitable for users was developed using Collaborative Filtering (CF) model together with Content-Based Filtering. This research purposed the hybrid recommender system model, which took into account the factors of book category, reading habits of users, and sources of information. The evaluation of the experiments involved soliciting feedback from system users and comparing the results with conventional recommendation methods.

Results

A comparison of NDCG scores was conducted for Hybrid Score 50:50, Hybrid Score 20:80, Hybrid Score 80:20, CF-score and CB-score. The experimental result was found that the Hybrid Score 80:20 method had the highest average NDCG score.

Conclusions

Using a hybrid recommender system model that combines 80% Collaborative Filtering and 20% Content-Based Filtering can improve the recommender method, leading to better referral efficiency and greater overall efficiency compared to traditional approaches.

Keywords

Recommender systems, digital library, multiple database, user profile, hybrid recommender systems, collaborative filtering, content-based filtering

Corresponding author: Pijitra Jomsri

Competing interests: No competing interests were disclosed.

Grant information: Thank you to the Broadcasting and Telecommunications Research and Development Fund for Public Interest for supporting data and Suan Sunandha Rajabhat University for supporting scholarship for paper publication.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2024 Jomsri P et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Jomsri P, Prangchumpol D, Poonsilp K and Panityakul T. Hybrid recommender system model for digital library from multiple online publishers [version 2; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2024, 12:1140 (https://doi.org/10.12688/f1000research.133013.2) First published: 12 Sep 2023, 12:1140 (https://doi.org/10.12688/f1000research.133013.1) Latest published: 18 Nov 2024, 12:1140 (https://doi.org/10.12688/f1000research.133013.3)

Revised Amendments from Version 1

We conducted further evaluation of the experiment by increasing the number of participating users from 30 to 75. The results of the experiment remained consistent with the initial conclusions, and we have made adjustments based on the suggestions provided by both reviewers. Such as adding detail to the model, explaining the algorithm, add details about the objective and hypothesis.

1) Added new version of Figure 4.
2) Added new Table 1.
3) Added equations 1-5 and rearranged several equations.
4) Added 2 reference that is number 37 and 38.

See the authors' detailed response to the review by Muhammad Yousuf Ali
See the authors' detailed response to the review by Asefeh Asemi

1. Introduction

Reading is important for human development in terms of education, career development, quality of life, and national development. In Bangkok, Thailand, there are areas for self-learning through books, or public libraries, available free of charge. Libraries are necessary for people at all levels to use their knowledge from books to improve themselves, enhance their quality of life, create equality, and promote reading to the public. In addition, data published by the World Bank, UNESCO, and the United Nations (UN) demographic data indicate that the Covid-19 outbreak has contributed to over 17% of children worldwide facing a learning crisis. This may affect the potential of the modern population. Also, schools around the world have had to close more than usual over the past year. School closures have resulted in students having to switch to online classes, but the learning system does not cover the world, and many children do not have access to technology to study online,¹^,² which may result in a lack of basic literacy skills.

Currently, technology and telecommunication play an essential role in human life, coupled with rapidly advancing computer technology and communication systems. Therefore, the digital library system is another channel for collecting information or electronic books from multiple sources and disseminating them through the Internet. This allows children and general readers to access and search for books through a computer network without any restrictions on location, distance, and duration. It is an opportunity to expand the results of learning resources from physical to online, without borders, to expand opportunities and increase access to books, media, and publications, promoting reading widely and in line with modern society. However, the current online library system is not user-friendly because e-books are stored scattered in separate databases developed by each agency and this causes users to have to download multiple applications for reading e-books. In addition, most government economic development plans push for the promotion of reading and learning through modern regional library services, creating opportunities for youth groups to have access to quality services that are convenient and fast-quality services.

Based on such problems, the researcher has objective to developed a digital library model by studying techniques for combining multiple e-book database systems. Additionally, the researcher aims to use content to create hybrid recommender systems model for electronic books from multiple structures to serve users in the Bangkok area. This is to enable users to easily access them through a single system. Books were gathered from many important sources, and this system was able to enable the public to access the library system in an online format by designing a database system that linked electronic books from multiple databases together, collecting book information from many sources such as the National Library, government agencies, teaching materials, Dhamma books, novels, and short stories. Moreover, this research presents a model for recommending electronic books to users using the hybrid recommender systems model, combining Collaborative Filtering (CF), heuristic Content-based filtering, and user’s personal data. The researcher believes that hybrid recommender system model can enhance the efficiency of book recommendations in the digital library system. The collaborative filtering of this research concentrates on user reading, while Content-based filtering concentrates on titles, authors, book categories, keywords, and book details to offer suggestions to users in the area of their interest.

The structure of this paper is as follows: Section 2 provides a background and relevant literature. Section 3 outlines the methodology and framework employed in the hybrid recommender system. Section 4 shows the experimental outcomes. Lastly, Section 5 concludes the research and offers recommendations for future research.

2. Literature review

Recommender systems are ubiquitous on the internet. Typically, news websites feature a banner that displays recommendations such as “You may also like” or “People who liked this article also enjoyed this one.” This approach aligns with the traditional definition of recommender systems as outlined by Resnick and Varian,³ that they are systems that study a user’s preferences for a given object to make suggestions that might be useful to the user. Recommender systems enable users to customize their profiles, receive tailored suggestions, and make informed decisions about products and services that align with their preferences. The five primary recommendation techniques include: collaborative filtering, content filtering, demographic filtering, knowledge-based filtering, and utility-based filtering.⁴ A more fundamental way of categorizing recommender systems is to divide techniques into three primary groups: collaborative filtering, content-based filtering, and hybrid approaches.⁵

• Content-based filtering utilizes the "Content" feature of an item to generate user profiles based on their preferences and selections. This technique suggests a list of items that are similar to those that a user has already viewed or appreciated.⁶^–⁸
• Collaborative filtering relies on the exchange of opinions and feedback among users. This technique suggests a list of items that have been favored by other users with similar preferences.⁹^–¹¹
• The Hybrid Approach is a blend of Content-based and Collaborative Filtering that leverages both user preferences and item attributes. This technique utilizes a matrix derived from filtering interactions and contextual data from Content-based filtering to provide personalized recommendations.¹²^–¹⁴

In general, most databases are stored separately for different service providers. Users must know the source and explore various topics of interest.¹⁵^–²² However, some researchers have concluded that a single database is not sufficient to retrieve knowledge for users. Several referral systems for digital libraries have been proposed. Many researchers apply a hybrid model to improve recommender systems. Porcel et al. propose a hybrid system by combines collaborative recommendations and content-based.²³ Tejeda-Lorente et al. present a quality-based recommender system that considers the quality of an item in order to assess its relevance.²⁴ Serrano-Guerrero et al. present a fuzzy linguistic recommender model in a university digital library. This model uses the Google Wave approach that provides a shared space for different users and resources.²⁵ Morawski et al. offer a hybrid recommender system for rural libraries by combining content-based and collaborative filtering. The authors suggest the concept of a fuzzy flavor vector to deal with the problem of "cold start" problems caused by the smaller size of this library and the usual sparse data sets.²⁶ Jomsri proposes a library book recommendation system based on user profiling and association rules.²⁷ Some researchers focus a patron-driven hybrid library recommender system by applying machine learning techniques to recommend weeding decision-making operations by extracting and analyzing users’ opinions and ratings.²⁸

Some researchers have tried to develop models for library services, such as Yang and Hung’s proposed recommender system for book acquisition in libraries. The authors employ a basic metric that does not consider user feedback or opinions.²⁹ Wu et al. have introduced a library book acquisition recommender system that employs a network ranking mechanism.³⁰ Cabrerizo et al. suggest an extension to the LibQUAL+ model to address users’ perceptions and evaluate the quality of library services.³¹^–³² Some researchers use linked information spaces for different scientific digital libraries in Digital Humanities.³³ Another researcher conducted a study with the aim of developing a recommendation system model that integrates various types of supplementary information, apart from explicit ratings assigned to items. This supplementary information includes social connections between users and data on the items being recommended.³⁴ The main aim of researching the Hybrid Recommendation model is to overcome the issue of insufficient rating data by integrating the information from Content-Based and Collaborative Filtering models. Numerous studies have been conducted in this area, including one that implemented the Bayesian Probabilistic Matrix Factorization Framework to tackle the sparsity problem by supplementing taste data with user evaluation data stored in a matrix. Another study utilized an auto-encoder to learn side information data when user preference information is inadequate. Furthermore, a study was carried out to integrate information by utilizing an automatic encoder to learn the nonlinear activity of users and items while removing stacked noise.³⁵^,³⁶ The technique for recommender in this paper applies a hybrid approach model and creates an API for connecting content from multiple e-book databases to recommend users.

3. Methodology and framework of hybrid recommender system

This part describes the framework of hybrid recommender system including API function for connect multiply publisher, architecture of the book recommendation system, hybrid recommender systems model. The concept of hybrid recommender system was shown in Figure 1. This is a functional overview of a hybrid recommender system for a digital library from multiple online publishers. The system collects data from various publishers by creating a retrieval API and gathers important metadata for indexing. The metadata of various e-books are stored in the database of the developed system, without storing the e-book file from the publisher to maintain the book’s copyright. Partnered publishers for this edition of the book collection include the Listing Agency, Arsom Silp Institute of the Arts, and The Secretariat of the House of Representatives, all of which are valuable books in Thailand. The next step is to develop a digital library system with a channel for accessing book information. The login will be in the form of a one-time login for users to access all book listings linked to the system. The final step is to develop a recommendation system in the form of a hybrid recommender system and present the recommendation results to the user.

Figure 1. Framework of hybrid recommender system.

3.1 Function for connect multiply publisher

The process of collect Mata data from other sources. The system will link the book information through the database of the service provider and crawl data to collect information on each book for a created index such as title, category details, URL, etc. Therefore, users can read the original E-book through the URL of the book provider directly to support copyright from each E-book database policy. This prototype had a wide variety of e-books from a different database of organizations. All organizations encouraged Thai people to have access to reading services research information free of charge by creating functions to connect E-book data. However, the function may be adjusted according to the connection characteristics of different database systems. Initially, the system pass parameters required by the service and return values for data use as the following API functions including:

• Login function: The Login function supports user login and user logout.
• Get books list function: This function retrieves a list of all books purchased by the agency, along with basic information such as the title, author, publisher, number of pages, and number of copies.
• Get Category function: This function retrieves a list of book categories that the agency purchases, along with the number of books in each category.
• Get books by category function: This function retrieves a list of books in a specific category, along with basic information such as the title, author, publisher, number of pages, and number of copies.
• Get book type function: This function retrieves a list of book types that the agency purchases, along with the number of books of each type.
• Get books by book type function: This function retrieves a list of books in a specific book type, along with basic information such as the title, author, publisher, number of pages, and number of copies.
• Get book detail function: This function retrieves detailed information about a specific book, such as the title, author name, publisher, ISBN, year of publication, number of pages, number of volumes, and description.
• Search books function: This function searches for books available in the system based on the search query, which can be by title, author, publisher, or description.
• Read book function: This function checks the number of books that can be opened for reading.
• Checkout function: This function checks the number of books that can be checked out for online borrowing.

Table 1 show algorithm provides a structured approach to creating an API that manages user authentication, retrieves book details, and facilitates book reading functionality in a digital library system with multiple publishers. The emphasis is on securing user sessions, validating access permissions, and integrating external publisher APIs for a comprehensive digital library service.

Table 1. Algorithm of API for connection Multi-Publisher.

Algorithm: Develop API for Book Reading in a Multi-Publisher Digital Library System

Input: User credentials, Book ID
Output: Access to the book content for reading

Step 1: API Endpoint for User Authentication
FUNCTION authenticateUser (username, password)
VERIFY user credentials
IF credentials are valid
GENERATE a user session token
RETURN session token
ELSE
RETURN authentication error
END IF
END FUNCTION

Step 2: API Endpoint for Retrieving Book Details
FUNCTION getBookDetails (bookId, sessionToken)
VALIDATE sessionToken
IF sessionToken is valid
SEARCH for bookId in the local database
IF bookId exists in the local database
RETRIEVE book details from the local database
RETURN book details
ELSE
CALL externalPublisherAPI (bookId)
RETRIEVE book details from the publisher's API
STORE retrieved book details in the local database
RETURN book details
END IF
ELSE
RETURN session validation error
END IF
END FUNCTION

Step 3: API Endpoint for Reading Book Content
FUNCTION readBookContent (bookId, sessionToken)
VALIDATE sessionToken
IF sessionToken is valid
RETRIEVE the book content URL or data for bookId
CHECK copyright and access permissions
IF access is granted
PROVIDE access to the book content
LOG the reading activity
RETURN book content access
ELSE
RETURN access denied message
END IF
ELSE
RETURN session validation error
END IF
END FUNCTION

Step 4: External Publisher API Interaction
FUNCTION externalPublisherAPI (bookId)
SEND a request to the publisher's API with bookId
RECEIVE book details from the publisher's API
RETURN book details to the calling function
END FUNCTION
End

3.2 The architecture of the book recommendation system

The architecture for developing the book recommendation system in the digital library consists of several steps, which are illustrated in Figure 2:

• Crawler Data is a detail within the session that connects multiple publishers. The research develops programs responsible for extracting data from online databases and storing it in a database. The system collects the following information: title, author, date, month, year of publication, and ISSN, which is useful for monitoring user interest and indexing each e-book.
• Digital Library corpus is a database used to store details of books that Crawler retrieves from an authorized database system and is an e-book database system developed by the library itself.
• User Profile is created by storing information about each user’s reading behaviour, such as books they have read, books they have selected for their shelf, and books they have rated or reviewed, and these data are then processed to find out which books and what categories the user likes or dislikes in order to bring information to be fed to the Recommender System to recommend other books that are similar in content or genre to the books the user has already read and enjoyed. The system can also suggest books based on the user’s reading history and preferences, such as authors or topics they have shown interest in.
• Hybrid recommender system Combines the recommendations from Content-Based Filtering and Collaborative Filtering to generate a final list of personalized book recommendations for the user. The details are described in the next Session.

Figure 2. The architecture of the hybrid book recommendation system development of the online library system.

User Profiles can be stored and collected in the form of implicit feedback, including which books users view details and place on their personal bookshelves. Creating user profiles is a process of building a model of user settings. Assuming that there are n users participating in the system, m is books have been read, o is books have been keep in user shelf, and p is books have been reviewed.

Let U be a set of all the users contained in the system; U = {U₁, U₂ …, U_n}, R is a set of books read from digital library collection; R = {r₁, r₂ •••, r_m}, K is a set of keep from digital library collection; K = {k₁, k₂ •••, k_o}, V is a set of rating; V = {v₁, v₂ •••, v_p}, URKV_ijal is a set of user read books and keep book and rating book by user U_i; URKV_ijkl = {urkv_ilal, urkv_i2al, …, urkv_ijal} and Let E (u_i, urkv_ijal) indicates a relationship among user U_i, with read UR_ij. Here is the definition of the user profile.:

Definition [User Profile]:

For a user p_i where i = 1, .., n;

Let U_i; be a user profile of user u_i.

U_i; = {< u_i, urkv_ijal>/urkv_ijal ∈ URKV^ u_i ∈ U ^ E (u_i, urkv_ijal) = 1}

When a new user signs up for the digital library system, the recommender system may not be able to generate accurate recommendations since there haven’t been any interactions between the user and the books. Additionally, if the model hasn’t been updated since the user’s registration, the system may not recognize their existence and thus cannot make any predictions for them through CF. To resolve these problems, during the registration process, users are required to select one to three preferred categories. This information is used by a customized content-based filtering algorithm to provide personalized recommendations until the CF model can generate high-quality recommendations based on the user’s interactions.

3.3 Hybrid recommender systems model

A hybrid recommender system is a process that introduces e-books by analyzing data from users’ reading behavior. The system utilizes a combination of Collaborative Filtering (CF) and Content-Based Filtering (CB) to recommend e-books to users. This involves applying a weighted score to the recommendations generated by each of these methods. The process of hybrid recommending e-books to individual users is designed to suggest related e-books or e-books that users are expected to like. This is done by considering the User Profile that is collected from the user. The User Profile includes a set of user read books, the books that are kept in the shelf, and ratings given by the user to different books.

• Collaborative Filtering (CF) is used to identify users who have similar preferences and interests based on their reading behavior. This involves analyzing the behavior of similar users to identify e-books that the user might be interested in. The User Profile is used to identify similar users who share similar interests and preferences. This method is effective in generating recommendations for users who have similar reading habits. The maximum score of user similarity is one. Collaborative Filtering score is showed in equation 1.
(1)
${\hat{r}}_{ib} = {\bar{r}}_{i} + \frac{\sum_{u_{j} \in U, j \neq i} similarity (U_{i}, U_{j}) . (r_{jb} - {\bar{r}}_{j})}{\sum_{u_{j} \in U, j \neq i} |similarity (U_{i}, U_{j})|}$
${\hat{r}}_{ib}$ is the predicted preference or rating for book b by user i
${\bar{r}}_{i}$ is the average rating or preference of user i based on their interactions
$r_{jb}$ is the rating or interaction of user j with book b
${\bar{r}}_{j}$ is the average rating or preference of user j
$similarity (U_{i}, U_{j})$ is the similarity score between the profiles of users i and j
• Content-Based Filtering, on the other hand, recommends e-books based on the factors that the user has liked in the past. This involves analyzing factors such as the category of books, the publisher, and the year of publication. This method is useful for recommending e-books that match the user’s specific preferences. All of three factors are combined and maximum score is one. The detail of each score as follow:
- 1) Category of books Score: The definition of $P_{category}$ as a set of book categories that the user likes, and C as the category of the book being considered, allows us to calculate the score for the book category. This can be utilized in a Content-Based Filtering recommendation system to suggest books that align with the user’s preferences, as shown in equation 2.
  (2)
  $Category of books Score = {\begin{cases} 1, & if c \in P_{category} \\ 0, & Otherwise \end{cases}$
  This means that If the category of the book being considered (C) is within the set of categories that the user likes ( $P_{category}$ ), the book will receive a score of 1, indicating high relevance to the user. If the book’s category is not within the user’s preferred category set, it will receive a score of 0, indicating low or no relevance to the user. Using category score in the recommendation system helps to accurately suggest books that match the user’s interests and preferences in specific book categories.
- 2) Publisher score: The definition of $P_{publisher}$ as the set of publishers that the user prefers, and P as the publisher of the book under consideration, we can use this information to calculate the score for the publisher, which can be utilized in a Content-Based Filtering recommendation system to suggest books from publishers that the user likes, as shown in equation 3.
  (3)
  $Publisher score = {\begin{cases} 1, & if P \in P_{publisher} \\ 0, & Otherwise \end{cases}$
  This means that If the publisher of the book under consideration (P) is within the set of publishers that the user likes $P_{publisher}$ the book will receive a score of 1, indicating high relevance to the user. If the book’s publisher is not within the user’s preferred set of publishers, the book will receive a score of 0, indicating low or no relevance to the user. Moreover, using publisher score in the recommendation system helps to accurately suggest books from publishers that match the user’s preferences and past positive experiences, enhancing the personalized recommendation process.
- 3) Year of publication score: Year of Publication Score emphasizes the book’s novelty. The scoring process evaluates by contrasting the publication year with the current year. Previously, there has been research that has integrated the year factor with other elements to facilitate book recommendations,³⁷ as shown in equation 4.
  (4)
  $Year of publication score = \frac{n}{{arg}_{n} max (n)}$
- 4) Content-Based Filtering score: this score (CBScore) is calculated as the average from of the Category Score, and the Publisher Score and the Year of Publication Score. Each of these scores contributes to assessing the relevance of a book based on its publication date, the category it belongs to, and the publisher. By adding these scores together and dividing by three, the CB score provides a comprehensive metric that reflects the book’s overall alignment with a user’s preferences in terms of recency book, genre, and the credibility or popularity of the publisher as shown in equation 5.
  (5)
  $CB - Score = \frac{(Category Score + Publisher Score + Year of publication Score)}{3}$

To generate a final list of personalized e-book recommendations for the user, the recommendations generated by both Content-Based Filtering and Collaborative Filtering are combined. The system uses a weighting scheme to determine the relevance of each recommendation, based on factors such as the user’s past behavior, the popularity of the e-book, and other relevant factors. This results in a list of e-books that are tailored to the user’s interests and preferences, increasing the likelihood that the user will find e-books that they enjoy reading. Here is a formula for a hybrid recommender system that merges collaborative filtering and content-based filtering techniques:

(6)

Hybrid Score = (1 - α) \times CF-Score + α \times CB-Score

Such as:

Hybrid Score 50:50 = {(0.5)}^{} \times CF-Score + 0.5 \times CB-Score

Hybrid Score 20:80 = (0.2) \times CF-Score + 0.8 \times CB-Score

Hybrid Score 80:20 = {(0.8)}^{} \times CF-Score + 0.2 \times CB-Score

where:

CF-Score = similarity between the target user and other users who have similar preferences

CB-Score = relevance score of recommended items based on their content

α = a weighting factor that determines the relative importance of the two scores

Figure 3. The process of hybrid recommender systems model.

4. Experimental approach

The environment in which the experiment is conducted is split into three distinct parts. The first section describes the data set, the second describes the evaluation metric, and the last section describes the experimental results.

4.1 The digital library corpus

The collection of E-books comprises 2,715 items, while the number of members registered is 370 members from Library System for Learning in 2022. The digital library dataset includes the following information for each item: book ID, title, description, keywords, book categories, keywords, and book details, category of books, the publisher, and the year of publication, and either an e-book file in the owner’s system or a URL for accessing the full e-book in the case of books from partners.

4.2 Evaluation metric

To address the proposed experiment, this research carried out a study by inviting general users to participate in an evaluation. This aligns with the research on a hybrid approach to knowledge recommender services as documented in study.³⁸ In the experimental setup, the research participants were assigned the task of exploring books from the digital library. The seventy five subjects, specifically members of the general public who were interested in reading digital books and were proficient in using applications were invited and participated in the evaluation. Each participant was given six different search queries, and all queries were tested using different ranking approaches. The search engines presented the top 15 documents according to their relevance, with i representing the ranking number {i = 1, 2, 3, …, 15}. The participants were then asked to rate the relevancy of the search results using a five-point scale: Score 0 indicating “not relevant at all,” Score 1 indicating “probably not relevant,” Score 2 indicating “less relevant,” Score 3 indicating “probably relevant,” and Score 4 indicating “extremely relevant.” This paper utilized the Normalized Discounted Cumulative Gain (NDCG) metric to measure the performance of every search engine.³⁹ This measurement is specifically designed for evaluating web search performance. The NDCG was calculated using the equation (7).

(7)

{NDCG}_{q} = M_{q} \sum_{j = 1}^{k} \frac{(2^{r (j)} - 1)}{log (1 + j)}

The parameter k represents the truncation or threshold level, while the integer r(j) denotes the relevancy score given by the research participant. The normalization constant M_q is calculated to ensure that the ideal ordering would achieve an NDCG score of 1. The NDCG metric emphasizes relevant documents that appear among the top search results while penalizing irrelevant documents by reducing their impact on the NDCG score.

4.3 Experimental results

User evaluation refers to the process of collecting feedback from users on the performance of a recommender system. NDCG average score is a metric used to evaluate the performance of the system, calculated by taking the average of the NDCG scores for all users in the dataset. A comparison of NDCG of Hybrid Score50:50, Hybrid Score20:80, Hybrid Score80:20, CF-score and CB-score are shown in Figure 4. CF-score and CB-score are standalone recommendation algorithms that use either CF or CB exclusively. The study compares the average NDCG scores of five distinct recommender approaches. The graph has the x-axis representing the top 15 ranks of the search results and the y-axis displaying the NDCG score. Based on the graph, it appears that the Hybrid Score80:20 method has the highest NDCG average score among the five different recommender approaches being compared. This suggests that the Hybrid Score80:20 algorithm is the most effective at recommending relevant items to users.

Figure 4. Comparison of the average NDCG score.

This research applied One Way ANOVA on NDCG at top fifteen ranks (K = 1, 1-2, 1-3,…, 1-15) respectively to test whether there is a difference among the mean NDCG from three different recommender system model approaches. The researcher set up the hypothesis that the is no statistically significant difference between the CF-score and CB-score. The result indicates that the means of NDCG for the tree approaches to recommender system models were not equal with a significance level of α = 0.05. In simpler terms, there was a statistically significant difference in the search results. From Table 2 statistically significant differences were observed in the search results between the CB-score and CF-score, and between the CB-score and HybridScore 80:20.

Table 2. Result of multiple comparisons.

Rank (K)	Indexing		Mean Difference (I-J)	Std. Error	Sig. (2-tailed) (I)
1-15	CB-score	CF-score	-0.38	0.287	0.014
1-15	CB-score	Hybrid Score80:20	-0.25	0.081	0.003
1-15	Hybrid Score80:20	CF-score	0.47	0.052	0.276
1-15	Hybrid Score80:20	CB-score	0.25	0.081	0.003

5. Conclusion

The main focus of this research paper is the utilization of a heuristic recommender system that utilizes a Hybrid model. Seventy five participants were involved in the study from general public, and each participant generated six queries to investigate the e-books obtained through the recommender system. The top 15 documents for each search engine were displayed for relevance, and the participants rated the search results on a five-point scale based on relevancy. The results of the study indicate that the Hybrid model outperforms other models with a higher NDCG score, which suggests that the Hybrid Score80:20 performs better than other recommender models. Additionally, a One Way ANOVA was used to further analyze the mean difference results of CF-score and CB-score. The statistical testing results indicate that the mean NDCG scores differ among the Hybrid model, CF-score, and CB-score at k = 1-15. However, the mean NDCG scores do not differ between the Hybrid model and CF-filtering. The study suggests that further experimentation should be conducted to explore different Hybrid models. The process of aggregating data from multiple online publishers is a formal procedure that necessitates close collaboration and coordination with various agencies and publishing entities to secure comprehensive and accurate data. Specific challenges such as disparities in data formats, access rights, and data privacy protection must be effectively managed during the system’s development. These details are crucial for the system to leverage diverse data sources to generate valuable and relevant recommendations for users. The paper has some limitations such as the sample size of 75 participants, which may not be representative of the wider population, and may limit the generalizability of the study findings. Additionally, the participants in the study may have had different levels of familiarity with the e-books, which could have influenced their ratings of relevancy. Moreover, the study highlights the importance of using a hybrid model to improve the effectiveness of recommender systems. Future research should delve into the capabilities of deep learning techniques to augment the personalization aspect of the hybrid model.

Data availability

Underlying data

This research cannot provide the underlying data because it involves copyrighted data from multiple publishers, and all publishers have agreements that prohibit developers from disseminating book information and user experimentation data under the principles of the Personal Data Protection Act (PDPA). The data set was sourced from the Bangkok Digital Library System at https://www.bangkoklibrary.go.th/digital/. To access the dataset, please contact us via email at addigitallibrarybkk@gmail.com.

Extended data

Figshare: Evaluation form for Subject Test.pdf. https://doi.org/10.6084/m9.figshare.22308823.v1.⁴⁰

This project contains the following extended data:

- Evaluation form for Subject Test.pdf

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

References

1. Li Y, Nishimura N, Yagam H, et al.: An Empirical Study on Online Learners’ Continuance Intentions in China. Sustainability. 2021; 13: 889. Publisher Full Text
2. Bao W: COVID-19 and online teaching in higher education: A case study of Peking University. Hum. Behav. Emerg. Technol. 2020; 2: 113–115. PubMed Abstract | Publisher Full Text | Free Full Text
3. Resnick P, Varian HR: Recommender systems. Commun. ACM. 1997; 40(3): 56–58. Publisher Full Text
4. Burke R: Hybrid recommender systems: survey and experiments. User Model. User-Adapt. Interact. 2002; 12(4): 331–370. Publisher Full Text
5. Adomavicius G, Tuzhilin A: Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 2005; 17(6): 734–749. Publisher Full Text
6. Mooney RJ, Roy L: Content-based book recommending using learning for text categorization. In: Proceedings of the Fifth ACM Conference on Digital Libraries, DL’00. New York, NY, USA: ACM; 2000; pp. 195–204.
7. Pazzani MJ, Billsus D: Content-Based Recommendation Systems. Berlin, Heidelberg: Springer Berlin Heidelberg; 2007; pp. 325–341.
8. Lops P, De Gemmis M, Semeraron G: Content-based recommender systems: State of the art and trends. Recommender Systems Handbook. Springer; 2011; pp. 73–105.
9. Linden G, Smith B, York J: Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Comput. 2003; 7: 76–80. Publisher Full Text
10. Hu Y, Koren Y, Volinsky C: Collaborative filtering for implicit feedback datasets. 2008 Eighth IEEE International Conference on Data Mining. IEEE; 2008; pp. 263–272.
11. He X, Liao L, Zhang H, et al.: Neural collaborative filtering. Proceedings of the 26th International Conference on World Wide Web, WWW’17, International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland. 2017; pp. 173–182.
12. Balabanović M, Shoham Y: Fab: content-based, collaborative recommendation. Commun. ACM. 1997; 40(3): 66–72.
13. Burke R: Hybrid Web Recommender Systems. Berlin, Heidelberg: Springer Berlin Heidelberg; 2007; pp. 377–408.
14. Strub F, Gaudel R, Mary J: Hybrid recommender system based on autoencoders. Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, DLRS 2016. New York, NY, USA: ACM; 2016; pp. 11–16.
15. Wright K, Golder S, Lewis-Light K: What value is the CINAHL database when searching for systematic reviews of qualitative studies? Syst. Rev. 2015; 4: 104. PubMed Abstract | Publisher Full Text | Free Full Text
16. Wilkins T, Gillies RA, Davies K: EMBASE versus MEDLINE for family medicine searches: can MEDLINE searches find the forest or a tree? Can. Fam. Physician. 2005; 51: 848–849. PubMed Abstract
17. Halladay CW, Trikalinos TA, Schmid IT, et al.: Using data sources beyond PubMed has a modest impact on the results of systematic reviews of therapeutic interventions. J. Clin. Epidemiol. 2015; 68: 1076–1084. PubMed Abstract | Publisher Full Text
18. Ahmadi M, Ershad-Sarabi R, Jamshidiorak R, et al.: Comparison of bibliographic databases in retrieving information on telemedicine. J. Kerman Univ. Med. Sci. 2014; 21: 343–354.
19. Lorenzetti DL, Topfer L-A, Dennett L, et al.: Value of databases other than MEDLINE for rapid health technology assessments. Int. J. Technol. Assess. Health Care. 2014; 30: 173–178. PubMed Abstract | Publisher Full Text
20. Beckles Z, Glover S, Ashe J, et al.: Searching CINAHL did not add value to clinical questions posed in NICE guidelines. J. Clin. Epidemiol. 2013; 66: 1051–1057. Publisher Full Text
21. Hartling L, Featherstone R, Nuspl M, et al.: The contribution of databases to the results of systematic reviews: a crosssectional study. BMC Med. Res. Methodol. 2016; 16: 1–13.
22. Aagaard T, Lund H, Juhl C: Optimizing literature search in systematic reviews—are MEDLINE, EMBASE and CENTRAL enough for identifying effect studies within the area of musculoskeletal disorders? BMC Med. Res. Methodol. 2016; 16: 161. PubMed Abstract | Publisher Full Text | Free Full Text
23. Porcel C, Moreno JM, Herrera-Viedma E: A multi-disciplinar recommender system to advice research resources in university digital libraries. Expert Syst. Appl. 2009; 36(10): 12520–12528. Publisher Full Text
24. Tejeda-Lorente Á, Porcel C, PeisE SR, et al.: qualitybased recommender system to disseminate information in a university digital library. Inf. Sci. 2014; 261: 52–69. Publisher Full Text
25. Serrano-Guerrero J, Herrera-Viedma E, Olivas JA, et al.: A google wave-based fuzzy recommender system to disseminate information in university digital libraries 2.0. Inf. Sci. 2011; 181(9): 1503–1516. Publisher Full Text
26. Morawski J, Stepan T, Dick S, et al.: A fuzzy recommender system for public library catalogs. Int. J. Intell. Syst. 2017; 32(10), 1062–1084. Publisher Full Text
27. Jomsri P: Book recommendation system for digital library based on user profiles by using association rule. 2014 Fourth International Conference on Innovative Computing Technology (INTECH). IEEE; 2014; pp. 130–134.
28. Rhanoui M, Mikram M, Yousfi S, et al.: A hybrid recommender system for patron driven library acquisition and weeding. J. King Saud Univ.-Comput. Inf. Sci. 2020.
29. Yang S-T, Hung M-CA: model for book inquiry history analysis and bookacquisition recommendation of libraries. Libr. Collect. Acquis. Tech. Serv. 2012; 36(3–4): 127–142. Publisher Full Text
30. Wu F, Hu Y-H, Wang P-R: Developing a novel recommender network-based ranking mechanism for library book acquisition. Electron. Libr. 2017; 35(1): 50–68. Publisher Full Text
31. Cabrerizo FJ, Morente-Molinera JA, Pérez IJ, et al.: A decision support system to develop a quality management in academic digital libraries. Inf. Sci. 2015; 323: 48–58. Publisher Full Text
32. Cabrerizo FJ, López-Gijón J, Martínez M, et al.: A fuzzy linguistic extended libqual+ model to assess service quality in academic libraries. Int. J. Inf. Technol. Decis. Mak. 2017; 16(01): 225–244. Publisher Full Text
33. Bartalesi V, Pratelli N, Lenzi E: Linking different scientific digital libraries in Digital Humanities: the IMAGO case study. Int. J. Digit. Libr. 2022; 23: 303–317. Publisher Full Text
34. Zhao H, Yao Q, Song Y, et al.: Side Information Fusion for Recommender Systems over Heterogeneous Information Network. ACM Trans. Knowl. Discov. Data. 2021; 15: 1–32. Publisher Full Text
35. Kim YM, Choi S: Scalable Variational Bayesian Matrix Factorization with Side Information. Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, Reykjavik, Iceland. 22–25 April 2014; pp. 493–502.
36. Strub F, Gaudel R, Mary J: Hybrid Recommender System Based on Autoencoders. Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, Boston, MA, USA. 2016; pp. 11–16.
37. Jomsri P:FUCL mining technique for book recommender system in library service. Procedia Manuf. 2018; 22: 550–557. Publisher Full Text
38. Niranatlamphong W, Choochaiwattana W:Hybrid Approach for a Knowledge Recommender Service: A Combination of Item-Based and Tag-Based Recommendation. Walailak J. Sci. Tech. 2017; 14(10):791–799.
39. Kekäläinen J, Järvelin K: Evaluating information retrieval systems under the challenges of interaction and multidimensional dynamic relevance. Proceedings of the 4th CoLIS conference. 2002; pp. 253–270.
40. Jomsri P:Evaluation form for Subject Test.pdf. figshare. Online resource. 2023. Publisher Full Text

Comments on this article Comments (0)

Version 3

VERSION 3 PUBLISHED 12 Sep 2023

Author details Author details

¹ Suan Sunandha Rajabhat University, Dusit, Bangkok, 10300, Thailand
² Prince of Songkla University, Hat Yai, Songkhla, 90110, Thailand

Dulyawit Prangchumpol
Roles: Data Curation, Funding Acquisition, Investigation, Supervision, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Kittiya Poonsilp
Roles: Data Curation, Formal Analysis, Investigation, Supervision, Validation, Visualization

Thammarat Panityakul
Roles: Visualization, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

Thank you to the Broadcasting and Telecommunications Research and Development Fund for Public Interest for supporting data and Suan Sunandha Rajabhat University for supporting scholarship for paper publication.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (3)

version 3

Revised

Published: 18 Nov 2024, 12:1140

https://doi.org/10.12688/f1000research.133013.3

version 2

Revised

Published: 04 Apr 2024, 12:1140

https://doi.org/10.12688/f1000research.133013.2

version 1

Published: 12 Sep 2023, 12:1140

https://doi.org/10.12688/f1000research.133013.1

© 2024 Jomsri P et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Jomsri P, Prangchumpol D, Poonsilp K and Panityakul T. Hybrid recommender system model for digital library from multiple online publishers [version 2; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2024, 12:1140 (https://doi.org/10.12688/f1000research.133013.2)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 2

VERSION 2

PUBLISHED 04 Apr 2024

Revised

Views

Reviewer Report 23 Jul 2024

Asefeh Asemi, Corvinus University of Budapest, Budapest, Hungary

Not Approved

https://doi.org/10.5256/f1000research.164054.r263085

As a reviewer, there are several suggestions to improve the paper. Firstly, the methodology section could be condensed to provide a succinct overview of the research approach, focusing on essential details such as data collection, system development, and experimental design. This would help maintain a better balance between sections and improve readability. Secondly, the evaluation of the hybrid recommender system could be strengthened by increasing the sample size of participants and incorporating additional evaluation metrics to provide a more comprehensive assessment of system performance. Furthermore, discussing the limitations of the study in more detail and exploring their implications for the findings would enhance the paper's credibility. Additionally, providing clearer statements regarding the contribution of the proposed hybrid recommender system compared to existing literature or systems would help clarify the paper's significance.

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Library and Information Science, Recommender Systems

I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.

CITE

Report a concern

Author Response 18 Nov 2024

pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand

18 Nov 2024

Author Response
Thank you very much for the suggestions. I have implemented the recommended revisions as follows:
1. The API functions section has been condensed, while other sections have been expanded,
... Continue reading
Thank you very much for the suggestions. I have implemented the recommended revisions as follows:

The API functions section has been condensed, while other sections have been expanded, as recommended by the committee, to provide clearer details on the model itself.

Data collection was expanded from 30 to 150 participants to enhance the credibility of the research findings. Additional evaluation metrics, specifically precision, were incorporated for further assessment, as detailed in Section 4. I have also elaborated on the limitations in Section 5, Note that the user base for eBook services in these libraries remains limited in Thailand, which led to an extended data collection period.

I have added content to the conclusion section and provided a more comprehensive comparison with existing research in Section 5 to clarify how this study aligns with previous findings.
Thank you very much for the suggestions. I have implemented the recommended revisions as follows:

The API functions section has been condensed, while other sections have been expanded, as recommended by the committee, to provide clearer details on the model itself.

Data collection was expanded from 30 to 150 participants to enhance the credibility of the research findings. Additional evaluation metrics, specifically precision, were incorporated for further assessment, as detailed in Section 4. I have also elaborated on the limitations in Section 5, Note that the user base for eBook services in these libraries remains limited in Thailand, which led to an extended data collection period.

I have added content to the conclusion section and provided a more comprehensive comparison with existing research in Section 5 to clarify how this study aligns with previous findings.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 18 Nov 2024

pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand

18 Nov 2024

Author Response
Thank you very much for the suggestions. I have implemented the recommended revisions as follows:
1. The API functions section has been condensed, while other sections have been expanded,
... Continue reading
Thank you very much for the suggestions. I have implemented the recommended revisions as follows:

The API functions section has been condensed, while other sections have been expanded, as recommended by the committee, to provide clearer details on the model itself.

Data collection was expanded from 30 to 150 participants to enhance the credibility of the research findings. Additional evaluation metrics, specifically precision, were incorporated for further assessment, as detailed in Section 4. I have also elaborated on the limitations in Section 5, Note that the user base for eBook services in these libraries remains limited in Thailand, which led to an extended data collection period.

I have added content to the conclusion section and provided a more comprehensive comparison with existing research in Section 5 to clarify how this study aligns with previous findings.
Thank you very much for the suggestions. I have implemented the recommended revisions as follows:

The API functions section has been condensed, while other sections have been expanded, as recommended by the committee, to provide clearer details on the model itself.

Data collection was expanded from 30 to 150 participants to enhance the credibility of the research findings. Additional evaluation metrics, specifically precision, were incorporated for further assessment, as detailed in Section 4. I have also elaborated on the limitations in Section 5, Note that the user base for eBook services in these libraries remains limited in Thailand, which led to an extended data collection period.

I have added content to the conclusion section and provided a more comprehensive comparison with existing research in Section 5 to clarify how this study aligns with previous findings.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Version 1

VERSION 1

PUBLISHED 12 Sep 2023

Views

Reviewer Report 30 Nov 2023

Muhammad Yousuf Ali, Aga Khan University, Karachi, Karachi, Sindh, Pakistan

Approved with Reservations

https://doi.org/10.5256/f1000research.145977.r222585

I appreciated the Authors research work about the recommender system. However, following the following points are recommend to improve the readership and audience of this article.

1. Research objective is the one of the key point to carry out/conduct any research but authors did not mentioned research objective. Recommend add this research objective.

2. In this research paper researcher(s) did not mention what the research questions are and they try to explore the answer to the Research questions or hypothesis they are trying to test.

2.The Methodology section is missing the basic concept of the research. The authors try to carry out Quasi Experimental designed research, but did not express this methodology and not include any citations.

3. N=30, as the authors mentioned in section 5.0 line 1, did not define the population/sample characteristics. Define the sample like students, researchers, general public, children or women take part in this study.

4. "Experimental Results" Section 4.3, the authors applied a One way ANOVA test. I will recommend to draw a non-directional hypothesis that: "Is there any significant relationship between the Content filtering (CF) score and Content base (CB) score", and you validate your hypothesis accordingly.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Partly
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Partly

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Author Response 18 Nov 2024

pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand

18 Nov 2024

Author Response

I am deeply grateful for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to all your ... Continue reading I am deeply grateful for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to all your recommendations. Here are the details of the revisions as follow:
1.Objective of Paper: Added the objective, "To develop a digital library model by studying techniques for combining multiple e-book database systems," which now appears in Section 1. Introduction.

2.Research Questions: Added the research question, "The researcher believes that a hybrid recommender system model can enhance the efficiency of book recommendations in the digital library system," also in Section 1. Introduction.

3. Methodology : Expressed detailed methodology and added citations in Section 4.2 Evaluation Metric.

4. Sample Characteristics: Defined the characteristics of the sample and increased the sample size from 30 to 75 participants, as specified in Section 4.2 Evaluation Metric.

5. Hypothesis: In Section 4.3, I have formulated the hypothesis as recommended by the reviewer.
I am deeply grateful for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to all your recommendations. Here are the details of the revisions as follow:
1.Objective of Paper: Added the objective, "To develop a digital library model by studying techniques for combining multiple e-book database systems," which now appears in Section 1. Introduction.

2.Research Questions: Added the research question, "The researcher believes that a hybrid recommender system model can enhance the efficiency of book recommendations in the digital library system," also in Section 1. Introduction.

3. Methodology : Expressed detailed methodology and added citations in Section 4.2 Evaluation Metric.

4. Sample Characteristics: Defined the characteristics of the sample and increased the sample size from 30 to 75 participants, as specified in Section 4.2 Evaluation Metric.

5. Hypothesis: In Section 4.3, I have formulated the hypothesis as recommended by the reviewer.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 18 Nov 2024

pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand

18 Nov 2024

Author Response

I am deeply grateful for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to all your ... Continue reading I am deeply grateful for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to all your recommendations. Here are the details of the revisions as follow:
1.Objective of Paper: Added the objective, "To develop a digital library model by studying techniques for combining multiple e-book database systems," which now appears in Section 1. Introduction.

2.Research Questions: Added the research question, "The researcher believes that a hybrid recommender system model can enhance the efficiency of book recommendations in the digital library system," also in Section 1. Introduction.

3. Methodology : Expressed detailed methodology and added citations in Section 4.2 Evaluation Metric.

4. Sample Characteristics: Defined the characteristics of the sample and increased the sample size from 30 to 75 participants, as specified in Section 4.2 Evaluation Metric.

5. Hypothesis: In Section 4.3, I have formulated the hypothesis as recommended by the reviewer.
I am deeply grateful for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to all your recommendations. Here are the details of the revisions as follow:
1.Objective of Paper: Added the objective, "To develop a digital library model by studying techniques for combining multiple e-book database systems," which now appears in Section 1. Introduction.

2.Research Questions: Added the research question, "The researcher believes that a hybrid recommender system model can enhance the efficiency of book recommendations in the digital library system," also in Section 1. Introduction.

3. Methodology : Expressed detailed methodology and added citations in Section 4.2 Evaluation Metric.

4. Sample Characteristics: Defined the characteristics of the sample and increased the sample size from 30 to 75 participants, as specified in Section 4.2 Evaluation Metric.

5. Hypothesis: In Section 4.3, I have formulated the hypothesis as recommended by the reviewer.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 10 Nov 2023

Asefeh Asemi, Corvinus University of Budapest, Budapest, Hungary

Approved with Reservations

https://doi.org/10.5256/f1000research.145977.r216613

Clarity and Accuracy: The text is well-organized and provides a clear structure for the research paper. The abstract, introduction, methodology, and results sections are clearly defined.

Study Design and Technical Soundness: The study design seems appropriate for the research question of developing a hybrid recommender system for a digital library. The technical components, such as the API connection and hybrid recommender system model, are well-described.

Details of Methods and Analysis: The methods section provides an overview of the digital library corpus, evaluation metric (NDCG), and experimental results. However, specific details about the algorithms used in Collaborative Filtering (CF) and Content-Based Filtering (CB) are not provided. More information on these aspects would enhance the clarity.

Statistical Analysis: The use of the Normalized Discounted Cumulative Gain (NDCG) as an evaluation metric is appropriate for assessing the performance of the recommender system. The statistical analysis, particularly the One Way ANOVA, adds a quantitative aspect to the evaluation.

Source Data Availability: The authors mention that the underlying data cannot be provided due to copyright restrictions from multiple publishers. However, they offer a link to the Bangkok Digital Library System for accessing the dataset.

Support for Conclusions: The conclusion suggests that the Hybrid Score 80:20 method outperforms other models. The provided graph supports this claim.

Constructive Feedback: To enhance clarity, provide more details about the algorithms used in Collaborative Filtering (CF) and Content-Based Filtering (CB). Consider expanding on the limitations of the study, especially regarding the small sample size of 30 participants and potential biases in their familiarity with e-books. Encourage the authors to provide more context on the specific challenges faced in connecting data from multiple online publishers and how these challenges were addressed in the development of the hybrid recommender system.

The work appears promising, but additional details, especially regarding the specific algorithms used, and further context on challenges faced during implementation would contribute to a more comprehensive understanding of the research.

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Partly
Are the conclusions drawn adequately supported by the results?

Partly

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Library and Information Science, Recommender Systems

CITE

Report a concern

Author Response 18 Nov 2024

pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand

18 Nov 2024

Author Response

Thank you for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to your recommendations. The following ... Continue reading Thank you for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to your recommendations. The following improvements have been made such as:

1. Increased Sample Size: The sample size has been increased to 75 participants to expand the dataset and provide additional comparative results. The experimental findings, as shown in Table 2, indicate that the Hybrid Score 80:20 still performs better.

2. Enhanced Algorithm Details: Additional details about the algorithms have been provided in Section 3.1 (Function for Connecting Multiple Publishers) and Section 3.3 (Hybrid Recommender Systems Model) to clarify the methodology.

3. Challenges and Solutions: Additional information regarding the challenges faced in connecting data from multiple online publishers has been included in the Conclusion.
Thank you for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to your recommendations. The following improvements have been made such as:

1. Increased Sample Size: The sample size has been increased to 75 participants to expand the dataset and provide additional comparative results. The experimental findings, as shown in Table 2, indicate that the Hybrid Score 80:20 still performs better.

2. Enhanced Algorithm Details: Additional details about the algorithms have been provided in Section 3.1 (Function for Connecting Multiple Publishers) and Section 3.3 (Hybrid Recommender Systems Model) to clarify the methodology.

3. Challenges and Solutions: Additional information regarding the challenges faced in connecting data from multiple online publishers has been included in the Conclusion.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 18 Nov 2024

pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand

18 Nov 2024

Author Response

Thank you for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to your recommendations. The following ... Continue reading Thank you for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to your recommendations. The following improvements have been made such as:

1. Increased Sample Size: The sample size has been increased to 75 participants to expand the dataset and provide additional comparative results. The experimental findings, as shown in Table 2, indicate that the Hybrid Score 80:20 still performs better.

2. Enhanced Algorithm Details: Additional details about the algorithms have been provided in Section 3.1 (Function for Connecting Multiple Publishers) and Section 3.3 (Hybrid Recommender Systems Model) to clarify the methodology.

3. Challenges and Solutions: Additional information regarding the challenges faced in connecting data from multiple online publishers has been included in the Conclusion.
Thank you for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to your recommendations. The following improvements have been made such as:

1. Increased Sample Size: The sample size has been increased to 75 participants to expand the dataset and provide additional comparative results. The experimental findings, as shown in Table 2, indicate that the Hybrid Score 80:20 still performs better.

2. Enhanced Algorithm Details: Additional details about the algorithms have been provided in Section 3.1 (Function for Connecting Multiple Publishers) and Section 3.3 (Hybrid Recommender Systems Model) to clarify the methodology.

3. Challenges and Solutions: Additional information regarding the challenges faced in connecting data from multiple online publishers has been included in the Conclusion.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (0)

Version 3

VERSION 3 PUBLISHED 12 Sep 2023

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3
Version 3 (revision) 18 Nov 24	read		read
Version 2 (revision) 04 Apr 24	read
Version 1 12 Sep 23	read	read

Asefeh Asemi, Corvinus University of Budapest, Budapest, Hungary
Muhammad Yousuf Ali, Aga Khan University, Karachi, Karachi, Pakistan
Monika Verma, Bhilai Institute of Technology, Bhilai, India

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

8 Views

16 Jan 2025 | for Version 3

Monika Verma, Bhilai Institute of Technology, Bhilai, Chhattisgarh, India

8 Views Cite this report Responses(0)

Approved With Reservations

Major:

1) Data Source should be provided for better understanding of methodology
2) The methodology section lacks sufficient detail on how the sample was selected, which raises concerns about the generalizability of the results. Methodology need to be discussed in better detail.
3) Increase Sample size for performance evaluation
4) use at least one more performance metric to evaluate

Minor:
1) Cite few good papers , for eg add below references (Refer 1 and 2) and cite them properly in section 2

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

I cannot comment. A qualified statistician is required.
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Partly

References

1. Verma M, Patnaik P: An automatic college library book recommendation system using optimized Hidden Markov based weighted fuzzy ranking model. Engineering Applications of Artificial Intelligence. 2024; 130. Publisher Full Text
2. Verma M, Rawal A: An Enhanced Item-Based Collaborative Filtering Approach for Book Recommender System Design. ECS Transactions. 2022; 107 (1): 15439-15449 Publisher Full Text

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Recommender System, machine learning, Deep learning, Text Mining, Academic recommender system

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

5 Views

20 Nov 2024 | for Version 3

Asefeh Asemi, Corvinus University of Budapest, Budapest, Hungary

5 Views Cite this report Responses(0)

Approved

Author revisions are acceptable.

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Library and Information Science, Recommender Systems

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

17 Views

23 Jul 2024 | for Version 2

Asefeh Asemi, Corvinus University of Budapest, Budapest, Hungary

17 Views Cite this report Responses(1)

Not Approved

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Library and Information Science, Recommender Systems

Respond to this report

Responses (1)

Author Response

18 Nov 2024

pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand

Thank you very much for the suggestions. I have implemented the recommended revisions as follows:

The API functions section has been condensed, while other sections have been expanded, as recommended by the committee, to provide clearer details on the model itself.
Data collection was expanded from 30 to 150 participants to enhance the credibility of the research findings. Additional evaluation metrics, specifically precision, were incorporated for further assessment, as detailed in Section 4. I have also elaborated on the limitations in Section 5, Note that the user base for eBook services in these libraries remains limited in Thailand, which led to an extended data collection period.
I have added content to the conclusion section and provided a more comprehensive comparison with existing research in Section 5 to clarify how this study aligns with previous findings.

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

25 Views

30 Nov 2023 | for Version 1

Muhammad Yousuf Ali, Aga Khan University, Karachi, Karachi, Sindh, Pakistan

25 Views Cite this report Responses(1)

Approved With Reservations

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Partly
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Partly

Competing Interests

No competing interests were disclosed.

Respond to this report

Responses (1)

Author Response

18 Nov 2024

pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand

I am deeply grateful for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to all your recommendations. Here are the details of the revisions as follow:
1.Objective of Paper: Added the objective, "To develop a digital library model by studying techniques for combining multiple e-book database systems," which now appears in Section 1. Introduction.

2.Research Questions: Added the research question, "The researcher believes that a hybrid recommender system model can enhance the efficiency of book recommendations in the digital library system," also in Section 1. Introduction.

3. Methodology : Expressed detailed methodology and added citations in Section 4.2 Evaluation Metric.

4. Sample Characteristics: Defined the characteristics of the sample and increased the sample size from 30 to 75 participants, as specified in Section 4.2 Evaluation Metric.

5. Hypothesis: In Section 4.3, I have formulated the hypothesis as recommended by the reviewer.

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

23 Views

10 Nov 2023 | for Version 1

Asefeh Asemi, Corvinus University of Budapest, Budapest, Hungary

23 Views Cite this report Responses(1)

Approved With Reservations

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Yes
Are all the source data underlying the results available to ensure full reproducibility?

Partly
Are the conclusions drawn adequately supported by the results?

Partly

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Library and Information Science, Recommender Systems

Respond to this report

Responses (1)

Author Response

18 Nov 2024

pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand

Thank you for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to your recommendations. The following improvements have been made such as:

1. Increased Sample Size: The sample size has been increased to 75 participants to expand the dataset and provide additional comparative results. The experimental findings, as shown in Table 2, indicate that the Hybrid Score 80:20 still performs better.

2. Enhanced Algorithm Details: Additional details about the algorithms have been provided in Section 3.1 (Function for Connecting Multiple Publishers) and Section 3.3 (Hybrid Recommender Systems Model) to clarify the methodology.

3. Challenges and Solutions: Additional information regarding the challenges faced in connecting data from multiple online publishers has been included in the Conclusion.

View more View less

Competing Interests

No competing interests were disclosed.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Li Y, Nishimura N, Yagam H, et al.: An Empirical Study on Online Learners’ Continuance Intentions in China. Sustainability. 2021; 13: 889. Publisher Full Text

[2] 2. Bao W: COVID-19 and online teaching in higher education: A case study of Peking University. Hum. Behav. Emerg. Technol. 2020; 2: 113–115. PubMed Abstract | Publisher Full Text | Free Full Text

[3] 3. Resnick P, Varian HR: Recommender systems. Commun. ACM. 1997; 40(3): 56–58. Publisher Full Text

[4] 4. Burke R: Hybrid recommender systems: survey and experiments. User Model. User-Adapt. Interact. 2002; 12(4): 331–370. Publisher Full Text

[5] 5. Adomavicius G, Tuzhilin A: Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 2005; 17(6): 734–749. Publisher Full Text

[6] 6. Mooney RJ, Roy L: Content-based book recommending using learning for text categorization. In: Proceedings of the Fifth ACM Conference on Digital Libraries, DL’00. New York, NY, USA: ACM; 2000; pp. 195–204.

[7] 7. Pazzani MJ, Billsus D: Content-Based Recommendation Systems. Berlin, Heidelberg: Springer Berlin Heidelberg; 2007; pp. 325–341.

[8] 8. Lops P, De Gemmis M, Semeraron G: Content-based recommender systems: State of the art and trends. Recommender Systems Handbook. Springer; 2011; pp. 73–105.

[9] 9. Linden G, Smith B, York J: Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Comput. 2003; 7: 76–80. Publisher Full Text

[10] 10. Hu Y, Koren Y, Volinsky C: Collaborative filtering for implicit feedback datasets. 2008 Eighth IEEE International Conference on Data Mining. IEEE; 2008; pp. 263–272.

[11] 11. He X, Liao L, Zhang H, et al.: Neural collaborative filtering. Proceedings of the 26th International Conference on World Wide Web, WWW’17, International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland. 2017; pp. 173–182.

[12] 12. Balabanović M, Shoham Y: Fab: content-based, collaborative recommendation. Commun. ACM. 1997; 40(3): 66–72.

[13] 13. Burke R: Hybrid Web Recommender Systems. Berlin, Heidelberg: Springer Berlin Heidelberg; 2007; pp. 377–408.

[14] 14. Strub F, Gaudel R, Mary J: Hybrid recommender system based on autoencoders. Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, DLRS 2016. New York, NY, USA: ACM; 2016; pp. 11–16.

[15] 15. Wright K, Golder S, Lewis-Light K: What value is the CINAHL database when searching for systematic reviews of qualitative studies? Syst. Rev. 2015; 4: 104. PubMed Abstract | Publisher Full Text | Free Full Text

[16] 16. Wilkins T, Gillies RA, Davies K: EMBASE versus MEDLINE for family medicine searches: can MEDLINE searches find the forest or a tree? Can. Fam. Physician. 2005; 51: 848–849. PubMed Abstract

[17] 17. Halladay CW, Trikalinos TA, Schmid IT, et al.: Using data sources beyond PubMed has a modest impact on the results of systematic reviews of therapeutic interventions. J. Clin. Epidemiol. 2015; 68: 1076–1084. PubMed Abstract | Publisher Full Text

[18] 18. Ahmadi M, Ershad-Sarabi R, Jamshidiorak R, et al.: Comparison of bibliographic databases in retrieving information on telemedicine. J. Kerman Univ. Med. Sci. 2014; 21: 343–354.

[19] 19. Lorenzetti DL, Topfer L-A, Dennett L, et al.: Value of databases other than MEDLINE for rapid health technology assessments. Int. J. Technol. Assess. Health Care. 2014; 30: 173–178. PubMed Abstract | Publisher Full Text

[20] 20. Beckles Z, Glover S, Ashe J, et al.: Searching CINAHL did not add value to clinical questions posed in NICE guidelines. J. Clin. Epidemiol. 2013; 66: 1051–1057. Publisher Full Text

[21] 21. Hartling L, Featherstone R, Nuspl M, et al.: The contribution of databases to the results of systematic reviews: a crosssectional study. BMC Med. Res. Methodol. 2016; 16: 1–13.

[22] 22. Aagaard T, Lund H, Juhl C: Optimizing literature search in systematic reviews—are MEDLINE, EMBASE and CENTRAL enough for identifying effect studies within the area of musculoskeletal disorders? BMC Med. Res. Methodol. 2016; 16: 161. PubMed Abstract | Publisher Full Text | Free Full Text

[23] 23. Porcel C, Moreno JM, Herrera-Viedma E: A multi-disciplinar recommender system to advice research resources in university digital libraries. Expert Syst. Appl. 2009; 36(10): 12520–12528. Publisher Full Text

[24] 24. Tejeda-Lorente Á, Porcel C, PeisE SR, et al.: qualitybased recommender system to disseminate information in a university digital library. Inf. Sci. 2014; 261: 52–69. Publisher Full Text

[25] 25. Serrano-Guerrero J, Herrera-Viedma E, Olivas JA, et al.: A google wave-based fuzzy recommender system to disseminate information in university digital libraries 2.0. Inf. Sci. 2011; 181(9): 1503–1516. Publisher Full Text

[26] 26. Morawski J, Stepan T, Dick S, et al.: A fuzzy recommender system for public library catalogs. Int. J. Intell. Syst. 2017; 32(10), 1062–1084. Publisher Full Text

[27] 27. Jomsri P: Book recommendation system for digital library based on user profiles by using association rule. 2014 Fourth International Conference on Innovative Computing Technology (INTECH). IEEE; 2014; pp. 130–134.

[28] 28. Rhanoui M, Mikram M, Yousfi S, et al.: A hybrid recommender system for patron driven library acquisition and weeding. J. King Saud Univ.-Comput. Inf. Sci. 2020.

[29] 29. Yang S-T, Hung M-CA: model for book inquiry history analysis and bookacquisition recommendation of libraries. Libr. Collect. Acquis. Tech. Serv. 2012; 36(3–4): 127–142. Publisher Full Text

[30] 30. Wu F, Hu Y-H, Wang P-R: Developing a novel recommender network-based ranking mechanism for library book acquisition. Electron. Libr. 2017; 35(1): 50–68. Publisher Full Text

[31] 31. Cabrerizo FJ, Morente-Molinera JA, Pérez IJ, et al.: A decision support system to develop a quality management in academic digital libraries. Inf. Sci. 2015; 323: 48–58. Publisher Full Text

[32] 32. Cabrerizo FJ, López-Gijón J, Martínez M, et al.: A fuzzy linguistic extended libqual+ model to assess service quality in academic libraries. Int. J. Inf. Technol. Decis. Mak. 2017; 16(01): 225–244. Publisher Full Text

[33] 33. Bartalesi V, Pratelli N, Lenzi E: Linking different scientific digital libraries in Digital Humanities: the IMAGO case study. Int. J. Digit. Libr. 2022; 23: 303–317. Publisher Full Text

[34] 34. Zhao H, Yao Q, Song Y, et al.: Side Information Fusion for Recommender Systems over Heterogeneous Information Network. ACM Trans. Knowl. Discov. Data. 2021; 15: 1–32. Publisher Full Text

[35] 35. Kim YM, Choi S: Scalable Variational Bayesian Matrix Factorization with Side Information. Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, Reykjavik, Iceland. 22–25 April 2014; pp. 493–502.

[36] 36. Strub F, Gaudel R, Mary J: Hybrid Recommender System Based on Autoencoders. Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, Boston, MA, USA. 2016; pp. 11–16.

[37] 37. Jomsri P:FUCL mining technique for book recommender system in library service. Procedia Manuf. 2018; 22: 550–557. Publisher Full Text

[38] 38. Niranatlamphong W, Choochaiwattana W:Hybrid Approach for a Knowledge Recommender Service: A Combination of Item-Based and Tag-Based Recommendation. Walailak J. Sci. Tech. 2017; 14(10):791–799.

[39] 39. Kekäläinen J, Järvelin K: Evaluating information retrieval systems under the challenges of interaction and multidimensional dynamic relevance. Proceedings of the 4th CoLIS conference. 2002; pp. 253–270.

[40] 40. Jomsri P:Evaluation form for Subject Test.pdf. figshare. Online resource. 2023. Publisher Full Text

Hybrid recommender system model for digital library from multiple online publishers

Abstract

Background

Methods

Results

Conclusions

Keywords

Revised Amendments from Version 1

1. Introduction

2. Literature review

3. Methodology and framework of hybrid recommender system

Figure 1. Framework of hybrid recommender system.

3.1 Function for connect multiply publisher

Table 1. Algorithm of API for connection Multi-Publisher.

3.2 The architecture of the book recommendation system

Figure 2. The architecture of the hybrid book recommendation system development of the online library system.

3.3 Hybrid recommender systems model

(1)

(2)

(3)

(4)

(5)

(6)

Figure 3. The process of hybrid recommender systems model.

4. Experimental approach

4.1 The digital library corpus

4.2 Evaluation metric

(7)

4.3 Experimental results

Figure 4. Comparison of the average NDCG score.

Table 2. Result of multiple comparisons.

5. Conclusion

Data availability

Underlying data

Extended data

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated