-
Data quality assessment in digital score libraries International Journal on Digital Libraries Pub Date : 2021-03-21 Francesco Foscarin, Philippe Rigaux, Virginie Thion
Sheet music scores have been the traditional way to preserve and disseminate western classical music works for centuries. Nowadays, their content can be encoded in digital formats that yield a very detailed representation of music content expressed in the language of music notation. These digital scores constitute, therefore, an invaluable asset for digital library services such as search, analysis
-
Building an archaeological data repository: a digital library and digital humanities collaboration at the University of South Florida International Journal on Digital Libraries Pub Date : 2021-01-11 Xiying Mi, Richard Bernardy, LeEtta Schmidt
Digital Humanities projects have shown to be projects of collaboration and interdisciplinary cooperation. As digital humanities researchers are creating and discovering new methods of collaboration, libraries have been reflecting on how they can best support and nurture such collaborations. This paper aims to demonstrate a practical case of what the University of South Florida Digital Collections has
-
A user-transaction-based recommendation strategy for an educational digital library International Journal on Digital Libraries Pub Date : 2021-01-18 Gerd Kortemeyer, Stefan Dröschler
The automated recommendation of content resources to learners is one of the most promising functions of educational digital libraries. Underlying strategies should take the individual progress of the learner into account to provide appropriate recommendations that are meaningful to the learner. If presented with appropriate assistance, learners will more likely engage in productive learning strategies
-
Analyzing history-related posts in twitter International Journal on Digital Libraries Pub Date : 2020-10-28 Yasunobu Sumikawa, Adam Jatowt
Microblogging platforms such as Twitter have been increasingly used nowadays to share information between users. They are also convenient means for propagating content related to history. Hence, from the research viewpoint they can offer opportunities to analyze the way in which users refer to the past, and how as well when such references appear and what purposes they serve. Such study could allow
-
Multilabel graph-based classification for missing labels International Journal on Digital Libraries Pub Date : 2020-10-12 Yasunobu Sumikawa, Tatsurou Miyazaki
Assigning several labels to digital data is becoming easier as this can be achieved in a collaborative manner with Internet users. However, this process is still a challenge, especially in cases where several labels are assigned to each datum, as some suitable labels may be missed. The missing labels lead to inaccuracies in classification. In this study, we propose a novel graph-based multi-label classifier
-
Feature selection for classifying multi-labeled past events International Journal on Digital Libraries Pub Date : 2020-09-08 Yasunobu Sumikawa, Ryohei Ikejiri
The study and analysis of past events can provide numerous benefits. While event categorization has been previously studied, it usually assigned only one event category to an event. In this study, we focus on multi-label classification for past events, which is a more general and challenging problem than those approached in previous studies. We categorize events into thirteen different types using
-
A crowdsourcing approach to construct mono-lingual plagiarism detection corpus International Journal on Digital Libraries Pub Date : 2020-09-07 Habibollah Asghari, Omid Fatemi, Salar Mohtaj, Heshaam Faili
Plagiarism detection deals with detecting plagiarized fragments among textual documents. The availability of digital documents in online libraries makes plagiarism easier and on the other hand, to be easily detected by automatic plagiarism detection systems. Large scale plagiarism corpora with a wide variety of plagiarism cases are needed to evaluate different detection methods in different languages
-
Thinking digital libraries for preservation as digital cultural heritage: by R to R 4 facet of FAIR principles International Journal on Digital Libraries Pub Date : 2020-08-27 Nicola Barbuti
The Art. 2 of the UE Council conclusions of 21 May 2014 on cultural heritage as a strategic resource for a sustainable Europe (2014/C 183/08) states: “Cultural heritage consists of the resources inherited from the past in all forms and aspects—tangible, intangible and digital (born digital and digitized), including monuments, sites, landscapes, skills, practices, knowledge and expressions of human
-
Citation recommendation: approaches and datasets International Journal on Digital Libraries Pub Date : 2020-08-11 Michael Färber, Adam Jatowt
Citation recommendation describes the task of recommending citations for a given text. Due to the overload of published scientific works in recent years on the one hand, and the need to cite the most appropriate publications when writing scientific texts on the other hand, citation recommendation has emerged as an important research topic. In recent years, several approaches and evaluation data sets
-
Representing quantitative documentation of 3D cultural heritage artefacts with CIDOC CRMdig International Journal on Digital Libraries Pub Date : 2020-08-08 Chiara Eva Catalano, Valentina Vassallo, Sorin Hermon, Michela Spagnuolo
In this paper, we will explore the theme of the documentation of 3D cultural heritage assets, not only as entire artefacts but also including the interesting features of the object from an archaeological perspective. Indeed, the goal is supporting archaeological research and curation, providing a different approach to enrich the documentation of digital resources and their components with corresponding
-
OrgBR-M: a method to assist in organizing bibliographic material based on formal concept analysis—a case study in educational data mining International Journal on Digital Libraries Pub Date : 2020-08-01 Marcos Wander Rodrigues, Luis Enrique Zárate
For conducting a literature review is necessary a preliminary organization of the available bibliographic material. In this article, we present a novel method called OrgBR-M (method to organize bibliographic references), based on the formal concept analysis theory, to assist in organizing bibliographic material. Our method systematizes the organization of bibliography and proposes metrics to assist
-
A fuzzy-based framework for evaluation of website design quality index International Journal on Digital Libraries Pub Date : 2020-07-29 Satinder Kaur, Sunil Kumar Gupta
An unrecognized significance of the web acts as a driving force for the massive and rapid growth of websites in each domain of social life. For making a successful website, it is necessary for developers to embrace appropriate web testing and evaluation methodology. Some valuable works in the past have striven to appraise the web applications quantitatively. Various parameters have been considered
-
PVAF: an environment for disambiguation of scientific publication venues International Journal on Digital Libraries Pub Date : 2020-07-26 Tiago Antônio Paraizo, Denilson Alves Pereira
A publication venue authority file stores variants of the names of journals and conferences that publish scientific articles. It is useful in the construction of search tools and data disambiguation, and it is of special interest to agencies funding research and evaluating graduate programs, which use the quality of publication venues as a basis for evaluating researchers’ and research groups’ publications
-
Introduction to the focused issue on the 2017 ACM/IEEE-CS Joint Conference on Digital Libraries JCDL 2017 International Journal on Digital Libraries Pub Date : 2020-05-18 Catherine C. Marshall, Ian Milligan, Adam Jatowt
This special issueof International Journal onDigitalLibraries (IJDL) brings together a selection of notable papers nominated for the two best paper awards—theVannevar BushBest Paper Award and the Best Student Paper Award—at the 2017 ACM/IEEE-CS Joint Conference onDigital Libraries (JCDL 2017). JCDL is an annual international interdisciplinary conference that brings together researchers, developers
-
Current research on theory and practice of digital libraries: best papers from TPDL 2017 International Journal on Digital Libraries Pub Date : 2020-02-14 Giannis Tsakonas, Jaap Kamps
This volume presents a special issue on the 2017 edition of the theory and practice of digital libraries (TPDL) conference, held in Thessaloniki, Greece. We provide a brief overview of TPDL 2017 and introduce the selected papers that make up the rest of this volume. The papers cover different aspects of current digital library research, highlighting the important and multidisciplinary nature of the
-
An analysis and comparison of keyword recommendation methods for scientific data International Journal on Digital Libraries Pub Date : 2020-02-07 Youichi Ishida, Toshiyuki Shimizu, Masatoshi Yoshikawa
To classify and search various kinds of scientific data, it is useful to annotate those data with keywords from a controlled vocabulary. Data providers, such as researchers, annotate their own data with keywords from the provided vocabulary. However, for the selection of suitable keywords, extensive knowledge of both the research domain and the controlled vocabulary is required. Therefore, the annotation
-
Extending the IFLA Library Reference Model for a Brazilian popular music digital library International Journal on Digital Libraries Pub Date : 2020-01-31 Marcos Fragomeni Padron, Fernando William Cruz, Juliana Rocha de Faria Silva
Brazil is recognized as a musical country, with a diverse collection of musical resources served by many digital repositories and music libraries. Historically, those systems are supported by cataloging schemes that are insufficient because they follow standards more focused on the catalog record than on the structure of cataloged works. On the other hand, it is perceived the popularization of multi-entity
-
Towards an ontological cross-disciplinary solution for multidisciplinary data: VI-SEEM data management and the FAIR principles International Journal on Digital Libraries Pub Date : 2020-01-30 Valentina Vassallo, Achille Felicetti
Different scientific communities produce different kinds of datasets that rely on different data descriptions, approaches, and logical organisations. In such an environment, it is essential to establish a knowledge communication framework that can guarantee some fundamentals, such as an inclusive description and documentation of the interdisciplinary digital resources, their long-term preservation
-
Content selection criteria for news multi-video summarization based on human strategies International Journal on Digital Libraries Pub Date : 2020-01-23 Tamires Tessarolli de Souza Barbieri, Rudinei Goularte
In the recent years, the multimedia data volume produced and available for access has increased continuously and quickly, notably video content. This context has also increased the overload information problem: finding content of interest in the huge amount of available options. So, efficient schemes for content access are needed. Automatic video summarization is a research field that deals with this
-
Historical document layout analysis using anisotropic diffusion and geometric features International Journal on Digital Libraries Pub Date : 2020-01-23 Galal M. BinMakhashen, Sabri A. Mahmoud
There are several digital libraries worldwide which maintain valuable historical manuscripts. Usually, digital copies of these manuscripts are offered to researchers and readers in raster-image format. These images carry several document degradations that may hinder automatic information retrieval solutions such as manuscript indexing, categorization, retrieval by content, etc. In this paper, we propose
-
The HathiTrust Digital Library’s potential for musicology research International Journal on Digital Libraries Pub Date : 2020-01-23 J. Stephen Downie, Sayan Bhattacharyya, Francesca Giannetti, Eleanor Dickson Koehl, Peter Organisciak
The HathiTrust Digital Library (HTDL) is one of the largest digital libraries in the world, containing seventeen million volumes from the collections of major academic and research libraries. In this paper, we discuss the HTDL’s potential for musicology research by providing a bibliometric analysis of the collection as a whole, and of the music materials in particular. A series of case studies illustrates
-
FAIR data for prehistoric mining archaeology International Journal on Digital Libraries Pub Date : 2020-01-23 Gerald Hiebel, Gert Goldenberg, Caroline Grutsch, Klaus Hanke, Markus Staudt
This paper presents an approach how to create FAIR data for prehistoric mining archaeology, based on the CIDOC CRM ontology and semantic web standards. The interdisciplinary Research Centre HiMAT (History of mining activities in the Tyrol and adjacent areas, University of Innsbruck) investigates mining history from prehistoric to modern times with an interdisciplinary approach. One of the projects
-
A fuzzy approach to evaluate the attributions reliability in the archaeological sources International Journal on Digital Libraries Pub Date : 2020-01-21 Marianna Figuera
This paper presents a case study of data management and processing of archaeological information through a relational database. The unusual typology of the ‘small finds’ that were archaeologically analyzed and the specific history of the excavations at Phaistos and Ayia Triada (Crete, Greece) prompted our consideration of issues regarding data integrity. We sought to address the problem surrounding
-
Introduction to the focused issue on the 2016 ACM/IEEE-CS Joint Conference on Digital Libraries JCDL 2016 International Journal on Digital Libraries Pub Date : 2019-11-12 Richard Furuta, Michele C. Weigle
We are pleased in this issue to present extended and enhanced versions of four award-nominated papers from the 2016 ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL ’16). JCDL ’16 was held on the campus of Rutgers University–Newark and is the sixteenth in the series of conferences jointly sponsored by the Association for Computing Machinery and the IEEE Computer Society. The location is particularly
-
A way to express the reliability of archaeological data: data traceability at the Laboratoire Archéologie et Territoires (Tours, France) International Journal on Digital Libraries Pub Date : 2019-08-16 Olivier Marlet, Xavier Rodier
In order to respect the good practices in archaeology disseminated by the MASA Consortium (Archaeologists and Archaeological Sites Memories), the Laboratoire Archéologie et Territoires (Tours, France) wished to evaluate the progress of ArSol database (Soil Archives), its field data management database, with regard to the FAIR principles or the Five-Star Linked Open Data. The work undertaken to achieve
-
Identification of tweets that mention books International Journal on Digital Libraries Pub Date : 2019-08-05 Shuntaro Yada, Kyo Kageura, Cecile Paris
We address the task of identifying tweets that mention books from amongst tweets that contain the same strings as book titles. Assuming the existence of a comprehensive list of book titles, this task can be defined as text classification targeting tweets that contain the same string as book titles. In carrying out the task, we need to exclude two types of tweets. The first is automatically posted,
-
Assessing the quality of answers autonomously in community question–answering International Journal on Digital Libraries Pub Date : 2019-08-05 Long T. Le, Chirag Shah, Erik Choi
Community question–answering (CQA) has become a popular method of online information seeking. Within these services, peers ask questions and create answers to those questions. For some time, content repositories created through CQA sites have widely supported general-purpose tasks; however, they can also be used as online digital libraries that satisfy specific needs related to education. Horizontal
-
Heritage Science and Cultural Heritage: standards and tools for establishing cross-domain data interoperability International Journal on Digital Libraries Pub Date : 2019-08-03 Lisa Castelli, Achille Felicetti, Fabio Proietti
This paper describes a system for documenting scientific data produced in Heritage Sciences. The system is built around a general meta-model, flexible enough to provide descriptions, in a formal language, of the datasets produced by scientific research. Resulting metadata can be re-encoded and published in multiple formats. The underlying metadata schema is inspired by CIDOC CRM principles for data
-
Choice overload and recommendation effectiveness in related-article recommendations International Journal on Digital Libraries Pub Date : 2019-05-27 Felix Beierle, Akiko Aizawa, Andrew Collins, Joeran Beel
Choice overload describes a situation in which a person has difficulty in making decisions due to too many options. We examine choice overload when displaying related-article recommendations in digital libraries, and examine the effectiveness of recommendation algorithms in this domain. We first analyzed existing digital libraries, and found that only 30% of digital libraries show related-article recommendations
-
Improving semantic change analysis by combining word embeddings and word frequencies International Journal on Digital Libraries Pub Date : 2019-05-20 Adrian Englhardt, Jens Willkomm, Martin Schäler, Klemens Böhm
Language is constantly evolving. As part of diachronic linguistics, semantic change analysis examines how the meanings of words evolve over time. Such semantic awareness is important to retrieve content from digital libraries. Recent research on semantic change analysis relying on word embeddings has yielded significant improvements over previous work. However, a recent, but somewhat neglected observation
-
Expressiveness and machine processability of Knowledge Organization Systems (KOS): an analysis of concepts and relations International Journal on Digital Libraries Pub Date : 2019-04-12 Manolis Peponakis, Anna Mastora, Sarantos Kapidakis, Martin Doerr
AbstractThis study considers the expressiveness (that is, the expressive power or expressivity) of different types of Knowledge Organization Systems (KOS) and discusses its potential to be machine-processable in the context of the semantic web. For this purpose, the theoretical foundations of KOS are reviewed based on conceptualizations introduced by the Functional Requirements for Subject Authority
-
Curating and annotating a collection of traditional Irish flute recordings to facilitate stylistic analysis International Journal on Digital Libraries Pub Date : 2019-02-23 Münevver Köküer, Islah Ali-MacLachlan, Daithí Kearney, Peter Jančovič
This paper presents the curation and annotation of a collection of traditional Irish flute recordings to facilitate the analysis of stylistic characteristics. We introduce the structure of Irish tunes, types of tunes and the ornamentation, which is a decisive stylistic determinant in Irish traditional music. We identify seminal recordings of prominent flute players and provide information related to
-
Guest editors’ introduction to the special issue on digital libraries for musicology International Journal on Digital Libraries Pub Date : 2019-02-23 Kevin R. Page, J. Stephen Downie
Many digital libraries have long offered facilities to provide multimedia content, including music. However, there is now an ever more urgent need to specifically support the distinct multiple forms of music, the links between them, and the surrounding scholarly context, as required by the transformed and extended computational methods being applied to musicology and the wider digital humanities. These
-
Introduction to the focused issue on the 20th International Conference on Theory and Practice of Digital Libraries (TPDL 2016) International Journal on Digital Libraries Pub Date : 2019-02-01 Norbert Fuhr, László Kovács, Thomas Risse, Wolfgang Nejdl
Valuable and rapidly increasing volumes of data are created or transformed into digital form by all fields of scientific, educational, cultural and governmental and industry activities. For this purpose, the digital libraries community has developed long-term and interdisciplinary research agendas, providing significant results, such as development of digital libraries, solving practical problems,
-
A Wikidata-based tool for building and visualising narratives International Journal on Digital Libraries Pub Date : 2019-01-30 Daniele Metilli, Valentina Bartalesi, Carlo Meghini
In this paper we present a semi-automatic tool for constructing and visualising narratives, intended as networks of events related to each other by semantic relations. The tool obeys an ontology for narratives that we developed. It retrieves and assigns internationalised resource identifiers to the instances of the classes of the ontology using Wikidata as an external knowledge base and also facilitates
-
An MEI-based standard encoding for hierarchical music analyses International Journal on Digital Libraries Pub Date : 2018-12-08 David Rizo, Alan Marsden
We propose a standard representation for hierarchical musical analyses as an extension to the Music Encoding Initiative (MEI) representation for music. Analyses of music need to be represented in digital form for the same reasons as music: preservation, sharing of data, data linking, and digital processing. Systems exist for representing sequential information, but many music analyses are hierarchical
-
A framework for modelling and visualizing the US Constitutional Convention of 1787 International Journal on Digital Libraries Pub Date : 2018-11-26 Nicholas Cole, Alfie Abdul-Rahman, Grace Mallon
This paper describes a new approach to the presentation of records relating to formal negotiations and the texts that they create. It describes the architecture of a model, platform, and web interface ( https://www.quillproject.net ) that can be used by domain experts to convert the records typical of formal negotiations into a model of decision-making (with minimal training). This model has implications
-
Recent applications of Knowledge Organization Systems: introduction to a special issue International Journal on Digital Libraries Pub Date : 2018-11-21 Koraljka Golub, Rudi Schmiede, Douglas Tudhope
KnowledgeOrganization Systems (KOS), in the formof classification systems, thesauri, lexical databases, ontologies, gazetteers, and taxonomies,more than ever play a crucial role in digital information management and applications generally. Carrying semantics in awell-controlled and documented way, Knowledge Organization Systems serve a variety of important functions such as: tools for representation
-
An empirically validated, onomasiologically structured, and linguistically motivated online terminology International Journal on Digital Libraries Pub Date : 2018-11-17 Karolina Suchowolec, Christian Lang, Roman Schneider
Terminological resources play a central role in the organization and retrieval of scientific texts. Both simple keyword lists and advanced modelings of relationships between terminological concepts can make a most valuable contribution to the analysis, classification, and finding of appropriate digital documents, either on the web or within local repositories. This seems especially true for long-established
-
A pragmatic approach to hierarchical categorization of research expertise in the presence of scarce information International Journal on Digital Libraries Pub Date : 2018-11-16 Gustavo Oliveira de Siqueira, Sérgio Canuto, Marcos André Gonçalves, Alberto H. F. Laender
Throughout the history of science, different knowledge areas have collaborated to overcome major research challenges. The task of associating a researcher with such areas makes a series of tasks feasible such as the organization of digital repositories, expertise recommendation and the formation of research groups for complex problems. In this article, we propose a simple yet effective automatic classification
-
Automated identification of media bias in news articles: an interdisciplinary literature review International Journal on Digital Libraries Pub Date : 2018-11-16 Felix Hamborg, Karsten Donnay, Bela Gipp
Media bias, i.e., slanted news coverage, can strongly impact the public perception of the reported topics. In the social sciences, research over the past decades has developed comprehensive models to describe media bias and effective, yet often manual and thus cumbersome, methods for analysis. In contrast, in computer science fast, automated, and scalable methods are available, but few approaches systematically
-
Anatomy of scholarly information behavior patterns in the wake of academic social media platforms International Journal on Digital Libraries Pub Date : 2018-11-03 Hamed Alhoori, Mohammed Samaka, Richard Furuta, Edward A. Fox
As more scholarly content is born digital or converted to a digital format, digital libraries are becoming increasingly vital to researchers seeking to leverage scholarly big data for scientific discovery. Although scholarly products are available in abundance—especially in environments created by the advent of social networking services—little is known about international scholarly information needs
-
Assessing plausibility of scientific claims to support high-quality content in digital collections International Journal on Digital Libraries Pub Date : 2018-10-28 José María González Pinto, Wolf-Tilo Balke
This paper presents a formalization and extension of a novel approach to support high-quality content in digital libraries. Building on the concept of plausibility used in cognitive sciences, we aim at judging the plausibility of new scientific papers in light of prior knowledge. In particular, our work proposes a novel assessment of scientific papers to qualitatively support the work of reviewers
-
Towards extracting event-centric collections from Web archives International Journal on Digital Libraries Pub Date : 2018-10-27 Gerhard Gossen, Thomas Risse, Elena Demidova
Web archives constitute an increasingly important source of information for computer scientists, humanities researchers and journalists interested in studying past events. However, currently there are no access methods that help Web archive users to efficiently access event-centric information in large-scale archives that go beyond the retrieval of individual disconnected documents. In this article
-
Cultural heritage metadata aggregation using web technologies: IIIF, Sitemaps and Schema.org International Journal on Digital Libraries Pub Date : 2018-10-26 Nuno Freire, Glen Robson, John B. Howard, Hugo Manguinhas, Antoine Isaac
In the World Wide Web, a very large number of resources are made available through digital libraries. We (Europeana and data providers) report on case studies that tested the application of some of the most promising Web technologies, exploring several solutions based on the International Image Interoperability Framework (IIIF) and Sitemaps. We also describe an analysis of the Schema.org vocabulary
-
Tracking the history and evolution of entities: entity-centric temporal analysis of large social media archives International Journal on Digital Libraries Pub Date : 2018-10-26 Pavlos Fafalios, Vasileios Iosifidis, Kostas Stefanidis, Eirini Ntoutsi
How did the popularity of the Greek Prime Minister evolve in 2015? How did the predominant sentiment about him vary during that period? Were there any controversial sub-periods? What other entities were related to him during these periods? To answer these questions, one needs to analyze archived documents and data about the query entities, such as old news articles or social media archives. In particular
-
Designing an ontology for managing the diets of hypertensive individuals International Journal on Digital Libraries Pub Date : 2018-10-22 Julaine Clunis
This paper describes the development of an ontology which could act as a recommendation system for hypertensive individuals. The author has conceptualized and developed an ontology which describes recipes, nutrients in foods and the interactions between nutrients and prescribed drugs, disease and general health. The paper begins with a review of the literature on several ontology designs. The previous
-
From subtitles to substantial metadata: examining characteristics of named entities and their role in indexing International Journal on Digital Libraries Pub Date : 2018-10-16 Anne-Stine Ruud Husevåg
AbstractThis paper explores the possible role of named entities extracted from text in subtitles in automatic indexing of TV programs. This is done by analyzing entity types, name density and name frequencies in subtitles and metadata records from different genres of TV programs. The name density in metadata records is much higher than the name density in subtitles, and named entities with high frequencies
-
Heuristic and supervised approaches to handwritten annotation extraction for musical score images International Journal on Digital Libraries Pub Date : 2018-07-11 Eamonn Bell, Laurent Pugin
Performers’ copies of musical scores are typically rich in handwritten annotations, which capture historical and institutional performance practices. The development of interactive interfaces to explore digital archives of these scores and the systematic investigation of their meaning and function will be facilitated by the automatic extraction of handwritten score annotations. We present several approaches
-
Image libraries and their scholarly use in the field of art and architectural history International Journal on Digital Libraries Pub Date : 2018-07-07 Sander Münster, Christina Kamposiori, Kristina Friedrichs, Cindy Kröber
AbstractThe use of image libraries in the field of art and architectural history has been the subject of numerous research studies over the years. However, since previous investigations have focused, primarily, either on user behavior or reviewed repositories, our aim is to bring together both approaches. Against this background, this paper identifies the main characteristics of research and information
-
Characterising online museum users: a study of the National Museums Liverpool museum website International Journal on Digital Libraries Pub Date : 2018-07-05 David Walsh, Mark M. Hall, Paul Clough, Jonathan Foster
Museums are increasing access to their collections and providing richer user experiences via web-based interfaces. However, they are seeing high numbers of users looking at only one or two pages within 10 s and then leaving. To reduce this rate, a better understanding of the type of user who visits a museum website is required. Existing models for museum website users tend to focus on groups that are
-
Building and querying semantic layers for web archives (extended version) International Journal on Digital Libraries Pub Date : 2018-07-05 Pavlos Fafalios, Helge Holzmann, Vaibhav Kasturia, Wolfgang Nejdl
Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and
-
Open information extraction as an intermediate semantic structure for Persian text summarization International Journal on Digital Libraries Pub Date : 2018-06-28 Mahmoud Rahat, Alireza Talebpour
Semantic applications typically exploit structures such as dependency parse trees, phrase-chunking, semantic role labeling or open information extraction. In this paper, we introduce a novel application of Open IE as an intermediate layer for text summarization. Text summarization is an important method for providing relevant information in large digital libraries. Open IE is referred to the process
-
Toward comprehensive event collections International Journal on Digital Libraries Pub Date : 2018-06-22 Federico Nanni, Simone Paolo Ponzetto, Laura Dietz
Web archives, such as the Internet Archive, preserve an unprecedented abundance of materials regarding major events and transformations in our society. In this paper, we present an approach for building event-centric sub-collections from such large archives, which includes not only the core documents related to the event itself but, even more importantly, documents describing related aspects (e.g.
-
On the effectiveness of the scientific peer-review system: a case study of the Journal of High Energy Physics International Journal on Digital Libraries Pub Date : 2018-06-15 Sandipan Sikdar, Paras Tehria, Matteo Marsili, Niloy Ganguly, Animesh Mukherjee
The importance and the need for the peer-review system is highly debated in the academic community, and recently there has been a growing consensus to completely get rid of it. This is one of the steps in the publication pipeline that usually requires the publishing house to invest a significant portion of their budget in order to ensure quality editing and reviewing of the submissions received. Therefore
-
Analyzing the network structure and gender differences among the members of the Networked Knowledge Organization Systems (NKOS) community International Journal on Digital Libraries Pub Date : 2018-06-14 Fariba Karimi, Philipp Mayr, Fakhri Momeni
In this paper, we analyze a major part of the research output of the Networked Knowledge Organization Systems (NKOS) community in the period 2000–2016 from a network analytical perspective. We focus on the papers presented at the European and US NKOS workshops and in addition four special issues on NKOS in the last 16 years. For this purpose, we have generated an open dataset, the “NKOS bibliography”
-
Promoting user engagement with digital cultural heritage collections International Journal on Digital Libraries Pub Date : 2018-06-11 Maristella Agosti, Nicola Orio, Chiara Ponchia
In the context of cooperating in a project whose central aim has been the production of a corpus agnostic research environment supporting access to and exploitation of digital cultural heritage collections, we have worked towards promoting user engagement with the collections. The aim of this paper is to present the methods and the solutions that have been envisaged and implemented to engage a diversified
-
Knowledge Organization Systems (KOS) in the Semantic Web: a multi-dimensional review International Journal on Digital Libraries Pub Date : 2018-05-25 Marcia Lei Zeng, Philipp Mayr
Since the Simple Knowledge Organization System (SKOS) specification and its SKOS eXtension for Labels (SKOS-XL) became formal W3C recommendations in 2009, a significant number of conventional Knowledge Organization Systems (KOS) (including thesauri, classification schemes, name authorities, and lists of codes and terms, produced before the arrival of the ontology-wave) have made their journeys to join
-
Neural ParsCit: a deep learning-based reference string parser International Journal on Digital Libraries Pub Date : 2018-05-19 Animesh Prasad, Manpreet Kaur, Min-Yen Kan
We present a deep learning approach for the core digital libraries task of parsing bibliographic reference strings. We deploy the state-of-the-art long short-term memory (LSTM) neural network architecture, a variant of a recurrent neural network to capture long-range dependencies in reference strings. We explore word embeddings and character-based word embeddings as an alternative to handcrafted features
-
Bias-aware news analysis using matrix-based news aggregation International Journal on Digital Libraries Pub Date : 2018-05-18 Felix Hamborg, Norman Meuschke, Bela Gipp
Media bias describes differences in the content or presentation of news. It is an ubiquitous phenomenon in news coverage that can have severely negative effects on individuals and society. Identifying media bias is a challenging problem, for which current information systems offer little support. News aggregators are the most important class of systems to support users in coping with the large amount
Contents have been reproduced by permission of the publishers.