ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Research Article
Revised

Hybrid recommender system model for digital library from multiple online publishers

[version 3; peer review: 1 approved, 2 approved with reservations]
PUBLISHED 18 Nov 2024
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

Background

The demand for online education promotion platforms has increased. In addition, the digital library system is one of the many systems that support teaching and learning. However, most digital library systems store books in the form of libraries that were developed or purchased exclusively by the library, without connecting data with different agencies in the same system.

Methods

A hybrid recommender system model for digital libraries, developed from multiple online publishers, has created a prototype digital library system that connects various important knowledge sources from multiple digital libraries and online publishers to create an index and recommend e-books. The developed system utilizes an API-based linking process to connect various important sources of knowledge from multiple data sources such as e-books on education from educational institutions, e-books from government agencies, and e-books from religious organizations are stored separately. Then, a hybrid recommender system suitable for users was developed using Collaborative Filtering (CF) model together with Content-Based Filtering. This research proposed the hybrid recommender system model, which took into account the factors of book category, reading habits of users, and sources of information. The evaluation of the experiments involved soliciting feedback from system users and comparing the results with conventional recommendation methods.

Results

A comparison of NDCG scores, and Precision scores were conducted for Hybrid Score 50:50, Hybrid Score 20:80, Hybrid Score 80:20, CF-score and CB-score. The experimental result was found that the Hybrid Score 80:20 method had the highest average NDCG score.

Conclusions

Using a hybrid recommender system model that combines 80% Collaborative Filtering, and 20% Content-Based Filtering can improve the recommender method, leading to better referral efficiency and greater overall efficiency compared to traditional approaches.

Keywords

Recommender systems, digital library, multiple database, user profile, hybrid recommender systems, collaborative filtering, content-based filtering

Revised Amendments from Version 2

This research has been revised from the previous version according to reviewer recommendations, with the following updates:
1. sample size: The sample size of the user evaluation process was expanded, increasing the sample size from version1 is 30 to 150 participants to enhance the reliability of the findings and e-book data was updated to 3,512 items
2. Evaluation Metrics: Additional evaluation metrics, specifically precision, were added to Section 4 for a more comprehensive assessment, and NDCG values were reprocessed. The results remained consistent with the initial findings, and the Precision results are aded in the abstract.
3. Limitations: The limitations of the research were further elaborated in Section 5.
4. Methodology: In version 3, the content has been refined based on feedback by condensing the description of API functionality for system development and providing a clearer, more concise explanation. Additionally, in response to previous feedback, descriptions of the algorithms and models for the Hybrid CF and CB approaches, initially recommended for inclusion, were incorporated in version 2.
5. Figures and Tables: Figure 4 and Table 2 were updated to reflect the new NDCG results based on the assessment of 150 participants. Additionally, a precision table and chart were added, presented in Table 3 and Figure 5.
6. Conclusion: The conclusion in Section 5 was expanded to provide a clearer summary of findings and a more comprehensive comparison with existing literature, emphasizing how this study aligns with prior research.
These changes were made to address the reviewers' suggestions thoroughly and to enhance the clarity for readers.

See the authors' detailed response to the review by Muhammad Yousuf Ali
See the authors' detailed response to the review by Asefeh Asemi

1. Introduction

Reading is important for human development in terms of education, career development, quality of life, and national development. In Bangkok, Thailand, there are areas for self-learning through books, or public libraries, available free of charge. Libraries are necessary for people at all levels to use their knowledge from books to improve themselves, enhance their quality of life, create equality, and promote reading to the public. In addition, data published by the World Bank, UNESCO, and the United Nations (UN) demographic data indicate that the Covid-19 outbreak has contributed to over 17% of children worldwide facing a learning crisis. This may affect the potential of the modern population. Also, schools around the world have had to close more than usual over the past year. School closures have resulted in students having to switch to online classes, but the learning system does not cover the world, and many children do not have access to technology to study online,1,2 which may result in a lack of basic literacy skills.

Currently, technology and telecommunication play an essential role in human life, coupled with rapidly advancing computer technology and communication systems. Therefore, the digital library system is another channel for collecting information or electronic books from multiple sources and disseminating them through the Internet. This allows children and general readers to access and search for books through a computer network without any restrictions on location, distance, and duration. It is an opportunity to expand the results of learning resources from physical to online, without borders, to expand opportunities and increase access to books, media, and publications, promoting reading widely and in line with modern society. However, the current online library system is not user-friendly because e-books are stored scattered in separate databases developed by each agency, and this causes users to have to download multiple applications for reading e-books. In addition, most government economic development plans push for the promotion of reading and learning through modern regional library services, creating opportunities for youth groups to have access to quality services that are convenient and fast-quality services.

Based on such problems, the researcher has objective to developed a digital library model by studying techniques for combining multiple e-book database systems. Additionally, the researcher aims to use content to create hybrid recommender systems model for electronic books from multiple structures to serve users in the Bangkok area. This is to enable users to easily access them through a single system. Books were gathered from many important sources, and this system was able to enable the public to access the library system in an online format by designing a database system that linked electronic books from multiple databases together, collecting book information from many sources such as the National Library, government agencies, teaching materials, Dhamma books, novels, and short stories. Moreover, this research presents a model for recommending electronic books to users using the hybrid recommender systems model, combining Collaborative Filtering (CF), heuristic Content-based filtering, and user’s personal data. The researcher believes that hybrid recommender system models can enhance the efficiency of book recommendations in the digital library system. The collaborative filtering of this research concentrates on user reading, while Content-based filtering concentrates on titles, authors, book categories, keywords, and book details to offer suggestions to users in the area of their interest.

The structure of this paper is as follows: Section 2 provides a background and relevant literature. Section 3 outlines the methodology and framework employed in the hybrid recommender system. Section 4 shows the experimental outcomes. Lastly, Section 5 concludes the research and offers recommendations for future research.

2. Literature review

Recommender systems are ubiquitous on the internet. Typically, news websites feature a banner that displays recommendations such as “You may also like” or “People who liked this article also enjoyed this one.” This approach aligns with the traditional definition of recommender systems as outlined by Resnick and Varian,3 that they are systems that study a user’s preferences for a given object to make suggestions that might be useful to the user. Recommender systems enable users to customize their profiles, receive tailored suggestions, and make informed decisions about products and services that align with their preferences. The five primary recommendation techniques include: collaborative filtering, content filtering, demographic filtering, knowledge-based filtering, and utility-based filtering.4 A more fundamental way of categorizing recommender systems is to divide techniques into three primary groups: collaborative filtering, content-based filtering, and hybrid approaches.5

  • Content-based filtering utilizes the “Content” feature of an item to generate user profiles based on their preferences and selections. This technique suggests a list of items that are similar to those that a user has already viewed or appreciated.68

  • Collaborative filtering relies on the exchange of opinions and feedback among users. This technique suggests a list of items that have been favored by other users with similar preferences.911

  • The Hybrid Approach is a blend of Content-based and Collaborative Filtering that leverages both user preferences and item attributes. This technique utilizes a matrix derived from filtering interactions and contextual data from Content-based filtering to provide personalized recommendations.1214

In general, most databases are stored separately for different service providers. Users must know the source and explore various topics of interest.1522 However, some researchers have concluded that a single database is not sufficient to retrieve knowledge for users. Several referral systems for digital libraries have been proposed. Many researchers apply a hybrid model to improve recommender systems. Porcel et al. propose a hybrid system by combines collaborative recommendations and content-based.23 Tejeda-Lorente et al. present a quality-based recommender system that considers the quality of an item in order to assess its relevance.24 Serrano-Guerrero et al. present a fuzzy linguistic recommender model in a university digital library. This model uses the Google Wave approach that provides a shared space for different users and resources.25 Morawski et al. offer a hybrid recommender system for rural libraries by combining content-based and collaborative filtering. The authors suggest the concept of a fuzzy flavor vector to deal with the problem of “cold start” problems caused by the smaller size of this library and the usual sparse data sets.26 Jomsri proposes a library book recommendation system based on user profiling and association rules.27 Some researchers focus a patron-driven hybrid library recommender system by applying machine learning techniques to recommend weeding decision-making operations by extracting and analyzing users’ opinions and ratings.28

Some researchers have tried to develop models for library services, such as Yang and Hung’s proposed recommender system for book acquisition in libraries. The authors employ a basic metric that does not consider user feedback or opinions.29 Wu et al. have introduced a library book acquisition recommender system that employs a network ranking mechanism.30 Cabrerizo et al. suggest an extension to the LibQUAL+ model to address users’ perceptions and evaluate the quality of library services.31,32 Some researchers use linked information spaces for different scientific digital libraries in Digital Humanities.33 Another researcher conducted a study with the aim of developing a recommendation system model that integrates various types of supplementary information, apart from explicit ratings assigned to items. This supplementary information includes social connections between users and data on the items being recommended.34 The main aim of researching the Hybrid Recommendation model is to overcome the issue of insufficient rating data by integrating the information from Content-Based and Collaborative Filtering models. Numerous studies have been conducted in this area, including one that implemented the Bayesian Probabilistic Matrix Factorization Framework to tackle the sparsity problem by supplementing taste data with user evaluation data stored in a matrix. Another study utilized an auto-encoder to learn side information data when user preference information is inadequate. Furthermore, a study was carried out to integrate information by utilizing an automatic encoder to learn the nonlinear activity of users and items while removing stacked noise.35,36 The technique for recommender in this paper applies a hybrid approach model and creates an API for connecting content from multiple e-book databases to recommend users.

3. Methodology and framework of hybrid recommender system

This part describes the framework of the hybrid recommender system including API function for connect multiply publisher, architecture of the book recommendation system, hybrid recommender systems model. The concept of hybrid recommender system was shown in Figure 1. This is a functional overview of a hybrid recommender system for a digital library from multiple online publishers. The system collects data from various publishers by creating a retrieval API and gathers important metadata for indexing. The metadata of various e-books are stored in the database of the developed system, without storing the e-book file from the publisher to maintain the book’s copyright. Partnered publishers for this edition of the book collection include the Listing Agency, Arsom Silp Institute of the Arts, and The Secretariat of the House of Representatives, all of which are valuable books in Thailand. The next step is to develop a digital library system with a channel for accessing book information. The login will be in the form of a one-time login for users to access all book listings linked to the system. The final step is to develop a recommendation system in the form of a hybrid recommender system and present the recommendation results to the user.

016e9989-c747-4802-a2a9-2814dc2b4acc_figure1.gif

Figure 1. Framework of hybrid recommender system.

3.1 Function for connect multiply publisher

The process of collecting Mata data from other sources. The system will link the book information through the database of the service provider and crawl data to collect information on each book for a created index such as title, category details, URL, etc. Therefore, users can read the original E-book through the URL of the book provider directly to support copyright from each E-book database policy. This prototype had a wide variety of e-books from a different database of organizations. All organizations encouraged Thai people to have access to reading services research information free of charge by creating functions to connect E-book data. However, the function may be adjusted according to the connection characteristics of different database systems. Initially, the system passes parameters required by the service and return values for data use as the following API functions including in Login function provides secure user authentication. The Get books list function retrieves a comprehensive list of all books purchased by the agency, displaying essential details such as title, author, publisher, page count, and number of copies. The Get Category function and Get books by category function organize books by category, while the Get book type function and Get books by book type function categorize books based on type. The Get book detail function fetches in-depth information for a specific book, including title, author, publisher, ISBN, publication year, page count, volume count, and description. The Search book’s function enables users to locate books based on title, author, publisher, or description, enhancing search efficiency. The Read book function allows book access, and the Checkout function facilitates book borrowing.

Table 1 show algorithm provides a structured approach to creating an API that manages user authentication, retrieves book details, and facilitates book reading functionality in a digital library system with multiple publishers. The emphasis is on securing user sessions, validating access permissions, and integrating external publisher APIs for a comprehensive digital library service.

Table 1. Algorithm of API for connection Multi-Publisher.

Algorithm: Develop API for Book Reading in a Multi-Publisher Digital Library System
Input: User credentials, Book ID
Output: Access to the book content for reading
Step 1: API Endpoint for User Authentication
 FUNCTION authenticateUser (username, password)
  VERIFY user credentials
  IF credentials are valid
   GENERATE a user session token
   RETURN session token
  ELSE
   RETURN authentication error
  END IF
 END FUNCTION

Step 2: API Endpoint for Retrieving Book Details
 FUNCTION getBookDetails (bookId, sessionToken)
  VALIDATE sessionToken
  IF sessionToken is valid
   SEARCH for bookId in the local database
   IF bookId exists in the local database
    RETRIEVE book details from the local database
    RETURN book details
   ELSE
    CALL externalPublisherAPI (bookId)
    RETRIEVE book details from the publisher's API
    STORE retrieved book details in the local database
    RETURN book details
   END IF
  ELSE
   RETURN session validation error
  END IF
 END FUNCTION
Step 3: API Endpoint for Reading Book Content
 FUNCTION readBookContent (bookId, sessionToken)
  VALIDATE sessionToken
  IF sessionToken is valid
   RETRIEVE the book content URL or data for bookId
   CHECK copyright and access permissions
   IF access is granted
    PROVIDE access to the book content
    LOG the reading activity
    RETURN book content access
   ELSE
    RETURN access denied message
   END IF
  ELSE
   RETURN session validation error
  END IF
 END FUNCTION

Step 4: External Publisher API Interaction
 FUNCTION externalPublisherAPI (bookId)
  SEND a request to the publisher's API with bookId
  RECEIVE book details from the publisher's API
  RETURN book details to the calling function
 END FUNCTION
End

3.2 The architecture of the book recommendation system

The architecture for developing the book recommendation system in the digital library consists of several steps, which are illustrated in Figure 2:

  • Crawler Data is a detail within the session that connects multiple publishers. The research develops programs responsible for extracting data from online databases and storing it in a database. The system collects the following information: title, author, date, month, year of publication, and ISSN, which is useful for monitoring user interest and indexing each e-book.

  • Digital Library corpus is a database used to store details of books that Crawler retrieves from an authorized database system and is an e-book database system developed by the library itself.

  • User Profile is created by storing information about each user’s reading behaviour, such as books they have read, books they have selected for their shelf, and books they have rated or reviewed, and these data are then processed to find out which books and what categories the user likes or dislikes in order to bring information to be fed to the Recommender System to recommend other books that are similar in content or genre to the books the user has already read and enjoyed. The system can also suggest books based on the user’s reading history and preferences, such as authors or topics they have shown interest in.

  • Hybrid recommender system Combines the recommendations from Content-Based Filtering and Collaborative Filtering to generate a final list of personalized book recommendations for the user. The details are described in the next Session.

016e9989-c747-4802-a2a9-2814dc2b4acc_figure2.gif

Figure 2. The architecture of the hybrid book recommendation system development of the online library system.

User Profiles can be stored and collected in the form of implicit feedback, including which books users view details and place on their personal bookshelves. Creating user profiles is a process of building a model of user settings. Assuming that there are n users participating in the system, m is books have been read, o is books have been keep in user shelf, and p is books have been reviewed.

Let U be a set of all the users contained in the system; U = {U1, U2 …, Un}, R is a set of books read from digital library collection; R = {r1, r2 •••, rm}, K is a set of keep from digital library collection; K = {k1, k2 •••, ko}, V is a set of rating; V = {v1, v2 •••, vp}, URKVijal is a set of user read books and keep book and rating book by user Ui; URKVijkl = {urkv ilal, urkv i2al, …, urkvijal} and Let E (ui, urkvijal) indicates a relationship among user Ui, with read URij. Here is the definition of the user profile:

Definition [User Profile]:

For a user pi where i = 1, .., n;

Let Ui; be a user profile of user ui.

Ui; = {< ui, urkvijal>/urkvijalURKV^ uiU ^ E (ui, urkvijal) = 1}

When a new user signs up for the digital library system, the recommender system may not be able to generate accurate recommendations since there haven’t been any interactions between the user and the books. Additionally, if the model hasn’t been updated since the user’s registration, the system may not recognize their existence and thus cannot make any predictions for them through CF. To resolve these problems, during the registration process, users are required to select one to three preferred categories. This information is used by a customized content-based filtering algorithm to provide personalized recommendations until the CF model can generate high-quality recommendations based on the user’s interactions.

3.3 Hybrid recommender system model

A hybrid recommender system is a process that introduces e-books by analyzing data from users’ reading behavior. The system utilizes a combination of Collaborative Filtering (CF) and Content-Based Filtering (CB) to recommend e-books to users. This involves applying a weighted score to the recommendations generated by each of these methods. The process of hybrid recommending e-books to individual users is designed to suggest related e-books or e-books that users are expected to like. This is done by considering the User Profile that is collected from the user. The User Profile includes a set of user read books, the books that are kept in the shelf, and ratings given by the user to different books.

  • Collaborative Filtering (CF) is used to identify users who have similar preferences and interests based on their reading behavior. This involves analyzing the behavior of similar users to identify e-books that the user might be interested in. The User Profile is used to identify similar users who share similar interests and preferences. This method is effective in generating recommendations for users who have similar reading habits. The maximum score of user similarity is one. Collaborative Filtering score is showed in equation 1.

    (1)
    r̂ib=r¯i+ujU,jisimilarityUiUj.rjbr¯jujU,jisimilarityUiUj

    r̂ib is the predicted preference or rating for book b by user i

    r¯i is the average rating or preference of user i based on their interactions

    rjb is the rating or interaction of user j with book b

    r¯j is the average rating or preference of user j

    similarityUiUj is the similarity score between the profiles of users i and j

  • Content-Based Filtering, on the other hand, recommends e-books based on the factors that the user has liked in the past. This involves analyzing factors such as the category of books, the publisher, and the year of publication. This method is useful for recommending e-books that match the user’s specific preferences. All of three factors are combined and maximum score is one. The detail of each score as follow:

    • 1) Category of books Score: The definition of Pcategory as a set of book categories that the user likes, and C as the category of the book being considered, allows us to calculate the score for the book category. This can be utilized in a Content-Based Filtering recommendation system to suggest books that align with the user’s preferences, as shown in equation 2.

      (2)
      Category of books Score={1,ifcPcategory0,Otherwise

      This means that If the category of the book being considered (C) is within the set of categories that the user likes ( Pcategory ), the book will receive a score of 1, indicating high relevance to the user. If the book’s category is not within the user’s preferred category set, it will receive a score of 0, indicating low or no relevance to the user. Using category score in the recommendation system helps to accurately suggest books that match the user’s interests and preferences in specific book categories.

    • 2) Publisher score: The definition of Ppublisher as the set of publishers that the user prefers, and P as the publisher of the book under consideration, we can use this information to calculate the score for the publisher, which can be utilized in a Content-Based Filtering recommendation system to suggest books from publishers that the user likes, as shown in equation 3.

      (3)
      Publisher score={1,ifPPpublisher0,Otherwise

      This means that If the publisher of the book under consideration (P) is within the set of publishers that the user likes Ppublisher the book will receive a score of 1, indicating high relevance to the user. If the book’s publisher is not within the user’s preferred set of publishers, the book will receive a score of 0, indicating low or no relevance to the user. Moreover, using publisher score in the recommendation system helps to accurately suggest books from publishers that match the user’s preferences and past positive experiences, enhancing the personalized recommendation process.

    • 3) Year of publication score: Year of Publication Score emphasizes the book’s novelty. The scoring process evaluates by contrasting the publication year with the current year. Previously, there has been research that has integrated the year factor with other elements to facilitate book recommendations,37 as shown in equation 4.

      (4)
      Year of publication score=nargnmaxn

    • 4) Content-Based Filtering score: this score (CBScore) is calculated as the average from of the Category Score, and the Publisher Score and the Year of Publication Score. Each of these scores contributes to assessing the relevance of a book based on its publication date, the category it belongs to, and the publisher. By adding these scores together and dividing by three, the CB score provides a comprehensive metric that reflects the book’s overall alignment with a user’s preferences in terms of recency book, genre, and the credibility or popularity of the publisher as shown in equation 5.

      (5)
      CB-Score=Category Score+Publisher Score+Year of publication Score3

To generate a final list of personalized e-book recommendations for the user, the recommendations generated by both Content-Based Filtering and Collaborative Filtering are combined. The system uses a weighting scheme to determine the relevance of each recommendation, based on factors such as the user’s past behavior, the popularity of the e-book, and other relevant factors. This results in a list of e-books that are tailored to the user’s interests and preferences, increasing the likelihood that the user will find e-books that they enjoy reading. Here is a formula for a hybrid recommender system that merges collaborative filtering and content-based filtering techniques:

(6)
Hybrid Score=1α×CF-Score+α×CB-Score

Such as:

Hybrid Score50:50=0.5×CF-Score+0.5×CB-Score
Hybrid Score20:80=0.2×CF-Score+0.8×CB-Score
Hybrid Score80:20=0.8×CF-Score+0.2×CB-Score
where:

CF-Score = similarity between the target user and other users who have similar preferences

CB-Score = relevance score of recommended items based on their content

α = a weighting factor that determines the relative importance of the two scores

016e9989-c747-4802-a2a9-2814dc2b4acc_figure3.gif

Figure 3. The process of hybrid recommender systems model.

4. Experimental approach

The environment in which the experiment is conducted is split into three distinct parts. The first section describes the data set, the second describes the evaluation metric, and the last section describes the experimental results.

4.1 The digital library corpus

The collection of E-books comprises 3,512 items, while the number of members registered is 1,120 members from Library System for Learning in 2023. The digital library dataset includes the following information for each item: book ID, title, description, keywords, book categories, keywords, and book details, category of books, the publisher, and the year of publication, and either an e-book file in the owner’s system or a URL for accessing the full e-book in the case of books from partners.

4.2 Evaluation metric

To address the proposed experiment, this research carried out a study by inviting general users to participate in an evaluation. This aligns with the research on a hybrid approach to knowledge recommender services as documented in study.38 In the experimental setup, the research participants were assigned the task of exploring books from the digital library. The 150 subjects, specifically members of the general public interested in reading digital books and proficient in using applications, were invited and participated in the evaluation. Each participant was given six different search queries, and all queries were tested using different ranking approaches. The search engines presented the top 15 documents according to their relevance, with i representing the ranking number {i = 1, 2, 3, …, 15}. The participants were then asked to rate the relevancy of the search results using a five-point scale: Score 0 indicating “not relevant at all,” Score 1 indicating “probably not relevant,” Score 2 indicating “less relevant,” Score 3 indicating “probably relevant,” and Score 4 indicating “extremely relevant.” This paper utilized the Normalized Discounted Cumulative Gain (NDCG) metric to measure the performance of every search engine.39 This measurement is specifically designed for evaluating web search performance. The NDCG was calculated using the equation (7).

(7)
NDCGq=Mqj=1k2rj1log1+j

The parameter k represents the truncation or threshold level, while the integer r(j) denotes the relevancy score given by the research participant. The normalization constant Mq is calculated to ensure that the ideal ordering would achieve an NDCG score of 1. The NDCG metric emphasizes relevant documents that appear among the top search results while penalizing irrelevant documents by reducing their impact on the NDCG score.

In addition, this research calculates precision to assess the percentage of recommended book items which are relevant among all book items suggested by the system. Precision emphasizes the accuracy of recommendations by indicating the extent to which the recommended books align with the user’s interests or search preferences. For each model, Precision is calculated using the following in equation (8). In this case, the system provides each user with 15 recommended items, and we calculate the proportion of these that are relevant or useful for the user.

(8)
Precision=Number of Relevant Items in Book Recommendations15

4.3 Experimental results

User evaluation refers to the process of collecting feedback from users on the performance of a recommender system. NDCG average score is a metric used to evaluate the performance of the system, calculated by taking the average of the NDCG scores for all users in the dataset. A comparison of NDCG of Hybrid Score50:50, Hybrid Score20:80, Hybrid Score80:20, CF-score and CB-score are shown in Figure 4. CF-score and CB-score are standalone recommendation algorithms that use either CF or CB exclusively. The study compares the average NDCG scores of five distinct recommender approaches. The graph has the x-axis representing the top 15 ranks of the search results and the y-axis displaying the NDCG score. Based on the graph, it appears that the Hybrid Score80:20 method has the highest NDCG average score among the five different recommender approaches being compared. This suggests that the Hybrid Score80:20 algorithm is the most effective at recommending relevant items to users.

016e9989-c747-4802-a2a9-2814dc2b4acc_figure4.gif

Figure 4. Comparison of the average NDCG score.

This research applied One Way ANOVA on NDCG at top fifteen ranks (K = 1, 1-2, 1-3,…, 1-15) respectively to test whether there is a difference among the mean NDCG from three different recommender system model approaches. The researcher set up the hypothesis that the is no statistically significant difference between the CF-score and CB-score. The result indicates that the means of NDCG for the tree approaches to recommender system models were not equal with a significance level of α = 0.05. In simpler terms, there was a statistically significant difference in the search results. From Table 2 statistically significant differences were observed in the search results between the CB-score and CF-score, and between the CB-score and HybridScore 80:20.

Table 2. Result of multiple comparisons.

Rank (K)IndexingMean Difference (I-J)Std. Error Sig. (2-tailed) (I)
1-15CB-score CF-score -0.390.2600.010
Hybrid Score80:20-0.270.0750.002
1-15Hybrid Score80:20CF-score 0.420.0480.220
CB-score 0.270.0750.002

The precision result in Table 3 shows that Hybrid Score 80:20 has the highest Precision at 0.82. This means that, on average, 82% of the items this model recommends are relevant to the user. This indicates that the Hybrid Score 80:20 model is the most accurate in providing recommendations that match user interests or needs. The other hybrid models and the CF model also perform well, while the CB model has the lowest Precision, highlighting the benefit of using hybrid approaches for more accurate recommendations.

Table 3. Precision results.

Model Precision
Hybrid Score 50:500.75
Hybrid Score 20:800.73
Hybrid Score 80:200.82
CF Score0.70
CB Score0.68
016e9989-c747-4802-a2a9-2814dc2b4acc_figure5.gif

Figure 5. Precision of different recommendation models.

5. Conclusion

The main focus of this research paper is the utilization of a heuristic recommender system that utilizes a Hybrid model. 150 participants were involved in the study from general public, and each participant generated six queries to investigate the e-books obtained through the recommender system. The top 15 documents for each search engine were displayed for relevance, and the participants rated the search results on a five-point scale based on relevancy. The results of the study indicate that the Hybrid model outperforms other models with a higher NDCG score, and precision score which suggests that the Hybrid Score80:20 performs better than other recommender models. Additionally, a One Way ANOVA was used to further analyze the mean difference results of CF-score and CB-score. The statistical testing results indicate that the mean NDCG scores differ among the Hybrid model, CF-score , and CB-score at k = 1-15. However, the mean NDCG scores do not differ between the Hybrid model and CF-filtering. The study suggests that further experimentation should be conducted to explore different Hybrid models. The process of aggregating data from multiple online publishers is a formal procedure that necessitates close collaboration and coordination with various agencies and publishing entities to secure comprehensive and accurate data. Specific challenges such as disparities in data formats, access rights, and data privacy protection must be effectively managed during the system’s development. These details are crucial for the system to leverage diverse data sources to generate valuable and relevant recommendations for users. Moreover, the hybrid recommender system addresses key limitations found in existing models by combining Collaborative Filtering and Content-Based Filtering in an optimized 80:20 ratio. Unlike previous systems, this approach enhances recommendation accuracy by capturing both user preferences and item attributes, achieving a balance that many traditional methods lack. Empirical results indicate that this hybrid model outperforms standard Collaborative and Content-Based systems, particularly in contexts where data sparsity poses a challenge. This system’s design has practical applications for digital libraries, offering a scalable and efficient solution that can adapt to diverse user needs. These contributions clarify the significance of the proposed system by demonstrating its capability to improve user experience through more personalized and accurate recommendations.

From this research, the hybrid recommender system model for a digital library incorporating data from multiple online publishers has generated user profiles through utilizing reading data, keeping books for reading within the system, and individual user book ratings. This model aligns with research on library book recommendations, such as Jomsri study, which also uses book borrowing data from libraries to build user profiles for book recommendations.27 Furthermore, this approach is consistent with Wu et al research which enhances the accuracy of library book recommendations by utilizing book borrowing and return data, outperforming traditional librarian rankings.30 Additionally, this study supports the use of a hybrid model, as the Hybrid Score 80:20 model demonstrates outstanding performance. This finding aligns with previous research highlighting the advantages of hybrid recommendation systems in library settings. Porcel et al. reported that hybrid systems combining Collaborative and Content-Based techniques leverage both user preferences and item attributes, resulting in more precise recommendations, particularly in digital libraries.23 Moreover, Morawski et al. developed a hybrid model for public libraries using fuzzy logic, supporting that hybrid approaches can significantly enhance recommendation accuracy, especially in cases of limited user interaction data.26

The paper has some limitations such as the sample size of 150 participants, which may not be representative of the wider population, and may limit the generalizability of the study findings. Additionally, the participants in the study may have had different levels of familiarity with the e-books, which could have influenced their ratings of relevancy. Another limitation of this study is the complexity involved in aggregating data from multiple online publishers. Each publisher may have distinct data formats, access protocols, and privacy policies, which require significant coordination and standardization. These challenges could affect the scalability of the system if applied to a broader range of publishers, as the need for consistent and secure data handling intensifies with system expansion. Developing more robust data integration protocols could be a crucial area for future work, enabling the system to incorporate diverse data sources while maintaining data accuracy and privacy. Furthermore, the study highlights the importance of using a hybrid model to improve the effectiveness of recommender systems. Future research should delve into the capabilities of deep learning techniques to augment the personalization aspect of the hybrid model.

Comments on this article Comments (0)

Version 3
VERSION 3 PUBLISHED 12 Sep 2023
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Jomsri P, Prangchumpol D, Poonsilp K and Panityakul T. Hybrid recommender system model for digital library from multiple online publishers [version 3; peer review: 1 approved, 2 approved with reservations]. F1000Research 2024, 12:1140 (https://doi.org/10.12688/f1000research.133013.3)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 3
VERSION 3
PUBLISHED 18 Nov 2024
Revised
Views
6
Cite
Reviewer Report 16 Jan 2025
Monika Verma, Bhilai Institute of Technology, Bhilai, Chhattisgarh, India 
Approved with Reservations
VIEWS 6
Major:

1) Data Source should be provided for better understanding of methodology
2) The methodology section lacks sufficient detail on how the sample was selected, which raises concerns about the generalizability of the results. Methodology need to be ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Verma M. Reviewer Report For: Hybrid recommender system model for digital library from multiple online publishers [version 3; peer review: 1 approved, 2 approved with reservations]. F1000Research 2024, 12:1140 (https://doi.org/10.5256/f1000research.174529.r347047)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
4
Cite
Reviewer Report 20 Nov 2024
Asefeh Asemi, Corvinus University of Budapest, Budapest, Hungary 
Approved
VIEWS 4
Author revisions ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Asemi A. Reviewer Report For: Hybrid recommender system model for digital library from multiple online publishers [version 3; peer review: 1 approved, 2 approved with reservations]. F1000Research 2024, 12:1140 (https://doi.org/10.5256/f1000research.174529.r341399)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Version 2
VERSION 2
PUBLISHED 04 Apr 2024
Revised
Views
16
Cite
Reviewer Report 23 Jul 2024
Asefeh Asemi, Corvinus University of Budapest, Budapest, Hungary 
Not Approved
VIEWS 16
As a reviewer, there are several suggestions to improve the paper. Firstly, the methodology section could be condensed to provide a succinct overview of the research approach, focusing on essential details such as data collection, system development, and experimental design. ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Asemi A. Reviewer Report For: Hybrid recommender system model for digital library from multiple online publishers [version 3; peer review: 1 approved, 2 approved with reservations]. F1000Research 2024, 12:1140 (https://doi.org/10.5256/f1000research.164054.r263085)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 18 Nov 2024
    pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand
    18 Nov 2024
    Author Response
    Thank you very much for the suggestions. I have implemented the recommended revisions as follows:
    1. The API functions section has been condensed, while other sections have been expanded,
    ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 18 Nov 2024
    pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand
    18 Nov 2024
    Author Response
    Thank you very much for the suggestions. I have implemented the recommended revisions as follows:
    1. The API functions section has been condensed, while other sections have been expanded,
    ... Continue reading
Version 1
VERSION 1
PUBLISHED 12 Sep 2023
Views
23
Cite
Reviewer Report 30 Nov 2023
Muhammad Yousuf Ali, Aga Khan University, Karachi, Karachi, Sindh, Pakistan 
Approved with Reservations
VIEWS 23
I appreciated the Authors research work about the recommender system. However, following the following points are recommend to improve the readership and audience of this article. 

1. Research objective is the one of the key point to ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Ali MY. Reviewer Report For: Hybrid recommender system model for digital library from multiple online publishers [version 3; peer review: 1 approved, 2 approved with reservations]. F1000Research 2024, 12:1140 (https://doi.org/10.5256/f1000research.145977.r222585)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 18 Nov 2024
    pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand
    18 Nov 2024
    Author Response
    I am deeply grateful for your valuable guidance on improving my paper.  I am pleased to inform you that I have revised the paper (Version 2) according to all your ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 18 Nov 2024
    pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand
    18 Nov 2024
    Author Response
    I am deeply grateful for your valuable guidance on improving my paper.  I am pleased to inform you that I have revised the paper (Version 2) according to all your ... Continue reading
Views
23
Cite
Reviewer Report 10 Nov 2023
Asefeh Asemi, Corvinus University of Budapest, Budapest, Hungary 
Approved with Reservations
VIEWS 23
Clarity and Accuracy: The text is well-organized and provides a clear structure for the research paper. The abstract, introduction, methodology, and results sections are clearly defined. 

Study Design and Technical Soundness: The study design seems appropriate for the research ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Asemi A. Reviewer Report For: Hybrid recommender system model for digital library from multiple online publishers [version 3; peer review: 1 approved, 2 approved with reservations]. F1000Research 2024, 12:1140 (https://doi.org/10.5256/f1000research.145977.r216613)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 18 Nov 2024
    pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand
    18 Nov 2024
    Author Response
    Thank you for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to your recommendations. The following ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 18 Nov 2024
    pijitra jomsri, Suan Sunandha Rajabhat University, Dusit, 10300, Thailand
    18 Nov 2024
    Author Response
    Thank you for your valuable guidance on improving my paper. I am pleased to inform you that I have revised the paper (Version 2) according to your recommendations. The following ... Continue reading

Comments on this article Comments (0)

Version 3
VERSION 3 PUBLISHED 12 Sep 2023
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.