publications: - title: "Libraries and the Long Tail. Some Thoughts about Libraries in a Network Age" author: - name: "Lorcan Dempsey" link: "http://www.oclc.org/research/people/dempsey.htm" year: "2006" month: "April" doi: "http://www.dlib.org/dlib/april06/dempsey/04dempsey.html" links: doi: "http://www.dlib.org/dlib/april06/dempsey/04dempsey.html" tags: - "digital library" - "digital libraries" researchr: "https://researchr.org/publication/Dempsey%3Adlib%3A2006" cites: 0 citedby: 0 journal: "dlib" volume: "12" number: "4" kind: "article" key: "Dempsey:dlib:2006" - title: "Regret-based online ranking for a growing digital library" author: - name: "Erick Delage" link: "https://researchr.org/alias/erick-delage" year: "2009" doi: "http://doi.acm.org/10.1145/1557019.1557050" abstract: "The most common environment in which ranking is used takes a very specific form. Users sequentially generate queries in a digital library. For each query, ranking is applied to order a set of relevant items from which the user selects his favorite. This is the case when ranking search results for pages on the World Wide Web or for merchandize on an e-commerce site. In this work, we present a new online ranking algorithm, called NoRegret KLRank. Our algorithm is designed to use \"clickthrough\" information as it is provided by the users to improve future ranking decisions. More importantly, we show that its long term average performance will converge to the best rate achievable by any competing fixed ranking policy selected with the benefit of hindsight. We show how to ensure that this property continues to hold as new items are added to the set thus requiring a richer class of ranking policies. Finally, our empirical results show that, while in some context NoRegret KLRank might be considered conservative, a greedy variant of this algorithm actually outperforms many popular ranking algorithms. " links: doi: "http://doi.acm.org/10.1145/1557019.1557050" tags: - "empirical" - "ranking algorithm" - "design science" - "rule-based" - "digital library" - "web science" - "recommendation" - "online ranking" - "digital libraries" - "e-science" - "context-aware" - "Meta-Environment" - "search" - "recommendation algorithm" researchr: "https://researchr.org/publication/Delage09" cites: 0 citedby: 0 pages: "229-238" booktitle: "kdd" kind: "inproceedings" key: "Delage09" - title: "Collaborative Systems: Solving the Vocabulary Problem" author: - name: "Hsinchun Chen" link: "https://researchr.org/alias/hsinchun-chen" year: "1994" researchr: "https://researchr.org/publication/Chen94%3A18" cites: 0 citedby: 0 journal: "Computer" volume: "27" number: "5" pages: "58-66" kind: "article" key: "Chen94:18" - title: "A framework for describing web repositories" author: - name: "Frank McCown" link: "https://researchr.org/alias/frank-mccown" - name: "Michael L. Nelson" link: "https://researchr.org/alias/michael-l.-nelson" year: "2009" doi: "http://doi.acm.org/10.1145/1555400.1555456" abstract: "In prior work we have demonstrated that search engine caches and archiving projects like the Internet Archive's Wayback Machine can be used to \"lazily preserve\" website and reconstruct them when they are lost. We use the term \"web repositories\" for collections of automatically refreshed and migrated content, and collectively we refer to these repositories as the \"web infrastructure\". In this paper we present a framework for describing web repositories and the status of web resources in them. This includes an abstract API for web repository interaction, the concepts of deep vs. flat and light/dark/grey repositories and terminology of describing the recoverability of a web resource. Our API may serve as a foundation for future web repository interfaces." links: doi: "http://doi.acm.org/10.1145/1555400.1555456" tags: - "laziness" - "caching" - "web caching" - "search" - "abstract machine" researchr: "https://researchr.org/publication/McCownN09a" cites: 0 citedby: 0 pages: "341-344" booktitle: "JCDL" kind: "inproceedings" key: "McCownN09a" - title: "Whetting the appetite of scientists: producing summaries tailored to the citation context" author: - name: "Stephen Wan" link: "https://researchr.org/alias/stephen-wan" - name: "Cécile Paris" link: "https://researchr.org/alias/c%C3%A9cile-paris" - name: "Robert Dale" link: "https://researchr.org/alias/robert-dale" year: "2009" doi: "http://doi.acm.org/10.1145/1555400.1555410" abstract: "The amount of scientific material available electronically is forever increasing. This makes reading the published literature, whether to stay up-to-date on a topic or to get up to speed on a new topic, a difficult task. Yet, this is an activity in which all researchers must be engaged on a regular basis. Based on a user requirements analysis, we developed a new research tool, called the Citation-Sensitive In-Browser Summariser (CSIBS), which supports researchers in this browsing task. CSIBS enables readers to obtain information about a citation at the point at which they encounter it. This information is aimed at enabling the reader to determine whether or not to invest the time in exploring the cited article further, thus alleviating information overload. CSIBS builds a summary of the cited document, bringing together meta-data about the document and a citation-sensitive preview that exploits the citation context to retrieve the sentences from the cited document that are relevant at this point. This paper briefly presents our user requirements analysis, then describes the system and, finally, discusses the observations from an initial pilot study. We found that CSIBS facilitates the relevancy judgment task, by increasing the users' self-reported confidence in making such judgements." links: doi: "http://doi.acm.org/10.1145/1555400.1555410" tags: - "rule-based" - "meta-model" - "analysis" - "data-flow" - "context-aware" - "Meta-Environment" - "data-flow analysis" - "meta-objects" researchr: "https://researchr.org/publication/WanPD09" cites: 0 citedby: 0 pages: "59-68" booktitle: "JCDL" kind: "inproceedings" key: "WanPD09" - title: "The Arrowsmith Project: 2005 Status Report" author: - name: "Neil R. Smalheiser" link: "https://researchr.org/alias/neil-r.-smalheiser" year: "2005" doi: "http://dx.doi.org/10.1007/11563983_5" abstract: "In the 1980s, Don Swanson proposed the concept of “undiscovered public knowledge,” and published several examples in which two disparate literatures (i.e., sets of articles having no papers in common, no authors in common, and few cross-citations) nevertheless held complementary pieces of knowledge that, when brought together, made compelling and testable predictions about potential therapies for human disorders. In the 1990s, Don and I published more predictions together and created a computer-assisted search strategy (“Arrowsmith”). At first, the so-called one-node search was emphasized, in which one begins with a single literature (e.g., that dealing with a disease) and searches for a second unknown literature having complementary knowledge (e.g. that dealing with potential therapies). However, we soon realized that the two-node search is better aligned to the information practices of most biomedical investigators: in this case, the user chooses two literatures and then seeks to identify meaningful links between them. Could typical biomedical investigators learn to carry out Arrowsmith analyses? Would they find routine occasions for using such a sophisticated tool? Would they uncover significant links that affect their experiments? Four years ago, we initiated a project to answer these questions, working with several neuroscience field testers. Initially we expected that investigators would spend several days learning how to carry out searches, and would spend several days analyzing each search. Instead, we completely re-designed the user interface, the back-end databases, and the methods of processing linking terms, so that investigators could use Arrowsmith without any tutorial at all, and requiring only minutes to carry out a search. The Arrowsmith Project now hosts a suite of free, public tools. It has launched new research spanning medical informatics, genomics and social informatics, and has, indeed, assisted investigators in formulating new experiments, with direct impact on basic science and neurological diseases. " links: doi: "http://dx.doi.org/10.1007/11563983_5" "arrowsmith": "http://arrowsmith.psych.uic.edu/arrowsmith_uic/index.html" tags: - "design science" - "design research" - "testing" - "e-science" - "social" - "search" researchr: "https://researchr.org/publication/Smalheiser05" cites: 0 citedby: 0 pages: "26-43" booktitle: "dis" kind: "inproceedings" key: "Smalheiser05" - title: "Equity for Open-Access Journal Publishing" author: - name: "Stuart M. Shieber" link: "http://www.eecs.harvard.edu/shieber/" year: "2009" month: "08" doi: "10.1371/journal.pbio.1000165" abstract: "Open-access journals, which provide access to their scholarly articles freely and without limitations, are at a systematic disadvantage relative to traditional closed-access journal publishing and its subscription-based business model. A simple, cost-effective remedy to this inequity could put open-access publishing on a path to become a sustainable, efficient system." links: "url": "http://dx.doi.org/10.1371%2Fjournal.pbio.1000165" tags: - "rule-based" - "open-access publishing" - "meta-model" - "source-to-source" - "Meta-Environment" - "systematic-approach" - "open-source" researchr: "https://researchr.org/publication/Shieber%3A2009" cites: 0 citedby: 0 journal: "PLoS Biol" volume: "7" number: "8" kind: "article" key: "Shieber:2009" - title: "HT06, tagging paper, taxonomy, Flickr, academic article, to read" author: - name: "Cameron Marlow" link: "https://researchr.org/alias/cameron-marlow" - name: "Mor Naaman" link: "https://researchr.org/alias/mor-naaman" - name: "Danah Boyd" link: "http://www.danah.org/" - name: "Marc Davis" link: "https://researchr.org/alias/marc-davis" year: "2006" doi: "http://doi.acm.org/10.1145/1149941.1149949" links: doi: "http://doi.acm.org/10.1145/1149941.1149949" tags: - "tagging" - "flickr" - "taxonomy" researchr: "https://researchr.org/publication/MarlowNBD06" cites: 0 citedby: 0 pages: "31-40" booktitle: "ht" kind: "inproceedings" key: "MarlowNBD06" - title: "TagFusion: A System for Integration and Leveraging of Collaborative Tags " author: - name: "Milan Stankovic" link: "http://www.milanstankovic.org/index.html" - name: "Jelena Jovanović" link: "https://researchr.org/alias/jelena-jovanovi%C4%87" year: "2010" month: "January" doi: "http://dx.doi.org/10.1007/978-1-4419-1219-0_1" abstract: "An ever increasing number of Web sites that allow users to easily collaborate, produce, and share content and interact have turned the Web into a more dynamic social place often referred to as Social Web. One form of Social Web sites is collaborative tagging systems that allow their users to annotate Web resources using tags, thus intuitively organizing them and making them easily findable. Tags would have been much more useful if collaborative tagging systems had been collaborating and allowing for integration of their tagging metadata. In this paper we address this issue of lack of collaboration by suggesting an approach for achieving that collaboration. We also present a concrete system called TagFusion that we developed to test the feasibility of the suggested approach. Special attention has been given to the strategies for attracting collaborative tagging systems to integrate using TagFusion, as well as to the possibility of involving artificial entities in the annotation process in accordance with the Semantic Web vision. " links: doi: "http://dx.doi.org/10.1007/978-1-4419-1219-0_1" "tagfusion": "http://www.milanstankovic.org/tagfusion/" tags: - "folksonomy" - "tagging" - "social web" - "testing" - "collaborative tagging" - "social" - "systematic-approach" - "semantic web" researchr: "https://researchr.org/publication/Stankovic-2010" cites: 0 citedby: 0 journal: "Annals of Information Systems" volume: "6" kind: "article" key: "Stankovic-2010" - title: "Research Library: A New Look of Academic Digital Libraries" author: - name: "Aditya M. Bhide" link: "https://researchr.org/alias/aditya-m.-bhide" - name: "Yoo Jae Heung" link: "https://researchr.org/alias/yoo-jae-heung" - name: "Choi Mun Kee" link: "https://researchr.org/alias/choi-mun-kee" year: "2007" doi: "http://doi.ieeecomputersociety.org/10.1109/ICIW.2007.54" abstract: "Information technology has leverage the efficiency of research activities in many aspects. There is a dramatic increment in the numbers of academic digital libraries these days; but it is observed that most of them are not exploited to their full usage. We propose a web application in form of research library where one can customize research resources to fit their needs. Research library is a specialized academic digital library which also enables researchers to create a virtual space where they can integrate their own research resources. This paper introduces user interface prototype which mainly differentiates such research library from any other current web applications or libraries. The light is also thrown on its key features. The research library can be viewed as a new look of academic digital library which is treated as noisy environment most of the time by researchers. " links: doi: "http://doi.ieeecomputersociety.org/10.1109/ICIW.2007.54" tags: - "academic digital library" - "digital library" - "digital libraries" - "web applications" - "Meta-Environment" - "incremental" researchr: "https://researchr.org/publication/BhideHK07" cites: 0 citedby: 0 pages: "57" booktitle: "aict" kind: "inproceedings" key: "BhideHK07" - title: "Defrosting the Digital Library: Bibliographic Tools for the Next Generation Web" author: - name: "Duncan Hull" link: "http://www.cs.manchester.ac.uk/~hulld/" - name: "Steve R. Pettifer" link: "http://aig.cs.man.ac.uk/people/srp/" - name: "Douglas B. Kell" link: "http://dbkgroup.org/dbk.htm" year: "2008" month: "10" doi: "http://dx.doi.org/10.1371/journal.pcbi.1000204" abstract: "Many scientists now manage the bulk of their bibliographic information electronically, thereby organizing their publications and citation material from digital libraries. However, a library has been described as `thought in cold storage,' and unfortunately many digital libraries can be cold, impersonal, isolated, and inaccessible places. In this Review, we discuss the current chilly state of digital libraries for the computational biologist, including PubMed, IEEE Xplore, the ACM digital library, ISI Web of Knowledge, Scopus, Citeseer, arXiv, DBLP, and Google Scholar. We illustrate the current process of using these libraries with a typical workflow, and highlight problems with managing data and metadata using URIs. We then examine a range of new applications such as Zotero, Mendeley, Mekentosj Papers, MyNCBI, CiteULike, Connotea, and HubMed that exploit the Web to make these digital libraries more personal, sociable, integrated, and accessible places. We conclude with how these applications may begin to help achieve a digital defrost, and discuss some of the issues that will help or hinder this in terms of making libraries on the Web warmer places in the future, becoming resources that are considerably more useful to both humans and machines." links: doi: "http://dx.doi.org/10.1371/journal.pcbi.1000204" "url": "http://dx.doi.org/10.1371%2Fjournal.pcbi.1000204" tags: - "DBLP" - "bibliography" - "digital library" - "social web" - "bibliographic tools" - "data-flow" - "digital libraries" - "reviewing" - "web applications" - "state machines" - "workflow" - "Google" - "social" researchr: "https://researchr.org/publication/HullPK%3A2008" cites: 0 citedby: 0 journal: "PLoS Comput Biol" volume: "4" number: "10" kind: "article" key: "HullPK:2008" - title: "Collaborative Tagging in Recommender Systems" author: - name: "Ae-Ttie Ji" link: "https://researchr.org/alias/ae-ttie-ji" - name: "Cheol Yeon" link: "https://researchr.org/alias/cheol-yeon" - name: "Heung-Nam Kim" link: "https://researchr.org/alias/heung-nam-kim" - name: "GeunSik Jo" link: "https://researchr.org/alias/geunsik-jo" year: "2007" doi: "http://dx.doi.org/10.1007/978-3-540-76928-6_39" links: doi: "http://dx.doi.org/10.1007/978-3-540-76928-6_39" tags: - "recommender systems" - "tagging" researchr: "https://researchr.org/publication/JiYKJ07%3A0" cites: 0 citedby: 0 pages: "377-386" booktitle: "ausai" kind: "inproceedings" key: "JiYKJ07:0" - title: "Why can’t I manage academic papers like MP3s? The evolution and intent of metadata standards" author: - name: "James Howison" link: "http://james.howison.name/" - name: "Abby Goodrum" link: "https://researchr.org/alias/abby-goodrum" year: "2004" doi: "http://freelancepropaganda.com/archives/MP3vPDF.pdf" abstract: "This paper considers the deceptively simple question: Why can’t downloaded academic papers be managed in the simple and effective manner in which digital music files are managed? We make the case that the answer is different treatments of metadata. Two key differences are identified: Firstly, digital music metadata is standardized and moves with the content file, while academic metadata is not and does not. Secondly digital music metadata lookup services are collaborative and automate the movement from a digital file to the appropriate metadata, while academic metadata services do not. To understand why these differences exist we examine the divergent evolution of metadata standards for digital music and academic papers. It is observed that the processes differ in interesting ways according to their intent. Specifically music metadata was developed primarily for personal file management, while the focus of academic metadata has been on information retrieval. We argue that lessons from MP3 metadata can assist individual academics facing their growing personal document management challenges. Our focus therefore is not on metadata for the academic publishing industry or institutional resource sharing, it is limited to the personal libraries growing on our hard-drives. This bottom-up approach to document management combined with p2p distribution radically altered the music landscape. Might such an approach have a similar impact on academic publishing? This paper outlines plans for improving the personal management of academic papers—doing academic metadata and file management the MP3 way—and considers the likelihood of success." links: doi: "http://freelancepropaganda.com/archives/MP3vPDF.pdf" tags: - "p2p" - "academic digital library" - "information retrieval" - "digital library" - "digital libraries" - "systematic-approach" researchr: "https://researchr.org/publication/HowisonG%3A2004" cites: 0 citedby: 0 booktitle: "Proceedings of the 2004 Colleges, Code and Intellectual Property Conference" kind: "inproceedings" key: "HowisonG:2004" - title: "Amazon.com Recommendations: Item-to-Item Collaborative Filtering" author: - name: "Greg Linden" link: "http://glinden.blogspot.com/" - name: "Brent Smith" link: "http://www.chezsmith.net/misc/resume.php" - name: "Jeremy York" link: "https://researchr.org/alias/jeremy-york" year: "2003" doi: "http://doi.ieeecomputersociety.org/10.1109/MIC.2003.1167344" abstract: "By comparing similar items rather than similar customers, item-to-item collaborative filtering scales to very large data sets and produces high-quality recommendations." links: doi: "http://doi.ieeecomputersociety.org/10.1109/MIC.2003.1167344" tags: - "recommendation algorithms" - "Amazon" - "collaborative filtering" - "data-flow" researchr: "https://researchr.org/publication/LindenSY03%3A0" cites: 0 citedby: 0 journal: "internet" volume: "7" number: "1" pages: "76-80" kind: "article" key: "LindenSY03:0" - title: "Research profiling: Improving the literature review" author: - name: "Alan L. Porter" link: "https://researchr.org/alias/alan-l.-porter" - name: "Alisa Kongthon" link: "https://researchr.org/alias/alisa-kongthon" - name: "Jye-Chyi Lu" link: "https://researchr.org/alias/jye-chyi-lu" year: "2002" doi: "http://dx.doi.org/10.1023/A:1014873029258" abstract: "We propose enhancing the traditional literature review through “research profiling”. This broad scan of contextual literature can extend the span of science by better linking efforts across research domains. Topical relationships, research trends, and complementary capabilities can be discovered, thereby facilitating research projects. Modern search engine and text mining tools enable research profiling by exploiting the wealth of accessible information in electronic abstract databases such as MEDLINE and Science Citation Index. We illustrate the potential by showing sixteen ways that “research profiling” can augment a traditional literature review on the topic of data mining. " links: doi: "http://dx.doi.org/10.1023/A:1014873029258" tags: - "research profiling" - "literature review" - "data-flow" - "reviewing" - "e-science" - "search" researchr: "https://researchr.org/publication/PorterKL%3A2002" cites: 0 citedby: 0 journal: "scientometrics" volume: "53" number: "3" kind: "article" key: "PorterKL:2002" - title: "Some(what) grand challenges for information retrieval" author: - name: "Nicholas J. Belkin" link: "https://researchr.org/alias/nicholas-j.-belkin" year: "2008" doi: "http://doi.acm.org/10.1145/1394251.1394261" abstract: "Although we see the positive results of information retrieval research embodied throughout the Internet, on our computer desktops, and in many other aspects of daily life, at the same time we notice that people still have a wide variety of difficulties in finding information that is useful in resolving their problematic situations. This suggests that there still remain substantial challenges for research in IR. Already in 1988, on the occasion of receiving the ACM SIGIR Gerard Salton Award, Karen Spärck Jones suggested that substantial progress in information retrieval was likely only to come through addressing issues associated with users (actual or potential) of IR systems, rather than continuing IR research's almost exclusive focus on document representation and matching and ranking techniques. In recent years it appears that her message has begun to be heard, yet we still have relatively few substantive results that respond to it. In this paper, I identify and discuss a few challenges for IR research which fall within the scope of association with users, and which I believe, if properly addressed, are likely to lead to substantial increases in the usefulness, usability and pleasurability of information retrieval." links: doi: "http://doi.acm.org/10.1145/1394251.1394261" tags: - "information retrieval" researchr: "https://researchr.org/publication/Belkin08%3A0" cites: 0 citedby: 0 journal: "sigir" volume: "42" number: "1" pages: "47-54" kind: "article" key: "Belkin08:0" - title: "Modeling and Building Personalized Digital Libraries with PIPE and 5SL" author: - name: "Marcos André Gonçalves" link: "http://buscatextual.cnpq.br/buscatextual/visualizacv.jsp?id=K4763169A6" - name: "Ali A. Zafer" link: "https://researchr.org/alias/ali-a.-zafer" - name: "Naren Ramakrishnan" link: "https://researchr.org/alias/naren-ramakrishnan" - name: "Edward A. Fox" link: "http://fox.cs.vt.edu/" year: "2001" doi: "http://www.ercim.org/publication/ws-proceedings/DelNoe02/Goncalves.pdf" links: doi: "http://www.ercim.org/publication/ws-proceedings/DelNoe02/Goncalves.pdf" tags: - "modeling" - "digital library" - "digital libraries" researchr: "https://researchr.org/publication/GoncalvesZRF01" cites: 0 citedby: 0 booktitle: "delos" kind: "inproceedings" key: "GoncalvesZRF01" - title: "An Extensible Virtual Digital Libraries Generator" author: - name: "Massimiliano Assante" link: "https://researchr.org/alias/massimiliano-assante" - name: "Leonardo Candela" link: "http://www.nmis.isti.cnr.it/candela/Leonardo_Candela_Website/Welcome.html" - name: "Donatella Castelli" link: "http://www.isti.cnr.it/php-pers/iselpers.php?Castelli+Donatella" - name: "Luca Frosini" link: "https://researchr.org/alias/luca-frosini" - name: "Lucio Lelii" link: "https://researchr.org/alias/lucio-lelii" - name: "Paolo Manghi" link: "https://researchr.org/alias/paolo-manghi" - name: "Andrea Manzi" link: "https://researchr.org/alias/andrea-manzi" - name: "Pasquale Pagano" link: "https://researchr.org/alias/pasquale-pagano" - name: "Manuele Simi" link: "https://researchr.org/alias/manuele-simi" year: "2008" doi: "http://dx.doi.org/10.1007/978-3-540-87599-4_14" abstract: "In this paper we describe the design and implementation of the VDL Generator, a tool to simplify and automatise the Digital Library development process. In particular, we discuss how our approach to the realisation of this tool simplifies the task of implementing, extending and modifying such a fundamental component. This tool models its issue as a generic search problem that can easily be adapted to different application scenarios. In particular, to guarantee its extensibility we carefully identify, isolate and organise the VDL Generator constituents, i.e. (i) the set of logical components that can be used when designing a Digital Library, (ii) the set of physical components that by implementing the logical components contribute to implement the Digital Library and (iii) the search strategy exploited to accomplish the generation task. Furthermore, we report on the experiences matured in implementing and exploiting such an innovative service in the context of the Diligent EU funded project and discuss future plans for its consolidation. " links: doi: "http://dx.doi.org/10.1007/978-3-540-87599-4_14" tags: - "meta-model" - "digital library" - "model-driven development" - "digital libraries" - "context-aware" - "Meta-Environment" - "search" - "design" - "process modeling" - "systematic-approach" researchr: "https://researchr.org/publication/Assante%3AECDL%3A2008" cites: 0 citedby: 0 pages: "122-134" booktitle: "ercimdl" kind: "inproceedings" key: "Assante:ECDL:2008" - title: "Disambiguating authors in academic publications using random forests" author: - name: "Pucktada Treeratpituk" link: "https://researchr.org/alias/pucktada-treeratpituk" - name: "C. Lee Giles" link: "https://researchr.org/alias/c.-lee-giles" year: "2009" doi: "http://doi.acm.org/10.1145/1555400.1555408" links: doi: "http://doi.acm.org/10.1145/1555400.1555408" tags: - "C++" researchr: "https://researchr.org/publication/TreeratpitukG09" cites: 0 citedby: 0 pages: "39-48" booktitle: "JCDL" kind: "inproceedings" key: "TreeratpitukG09" - title: "Towards a digital library theory: a formal digital library ontology" author: - name: "Marcos André Gonçalves" link: "http://buscatextual.cnpq.br/buscatextual/visualizacv.jsp?id=K4763169A6" - name: "Edward A. Fox" link: "http://fox.cs.vt.edu/" - name: "Layne T. Watson" link: "https://researchr.org/alias/layne-t.-watson" year: "2008" doi: "http://dx.doi.org/10.1007/s00799-008-0033-1" abstract: "Digital libraries (DLs) have eluded definitional consensus and lack agreement on common theories and frameworks. This makes comparison of DLs extremely difficult, promotes ad-hoc development, and impedes interoperability. In this paper we propose a formal ontology for DLs that defines the fundamental concepts, relationships, and axiomatic rules that govern the DL domain, therefore providing a frame of reference for the discussion of essential concepts of DL design and construction. The ontology is an axiomatic, formal treatment of DLs, which distinguishes it from other approaches that informally define a number of architectural variants. The process of construction of the ontology was guided by 5S, a formal framework for digital libraries. To test its expressibility we have used the ontology to create a taxonomy of DL services and to reason about issues of reusability, extensibility, and composability. Some practical applications of the ontology are also described including: the definition of a digital library services taxonomy, the proposal of a modeling language for digital libraries, and the specification of quality metrics to evaluate digital libraries. We also demonstrate how to use the ontology to formally describe DL architectures and to prove some properties about them, thus helping to further validate the ontology. " links: doi: "http://dx.doi.org/10.1007/s00799-008-0033-1" tags: - "ontologies" - "rule-based" - "application framework" - "ontology" - "meta-model" - "modeling language" - "modeling" - "digital library" - "architecture" - "language modeling" - "testing" - "language design" - "reuse" - "model-driven development" - "rules" - "digital libraries" - "Meta-Environment" - "taxonomy" - "design" - "process modeling" - "extensible language" - "systematic-approach" - "domain-specific language" researchr: "https://researchr.org/publication/GoncalvesFW08" cites: 0 citedby: 0 journal: "jodl" volume: "8" number: "2" pages: "91-114" kind: "article" key: "GoncalvesFW08" - title: "Collaborative Filtering by Personality Diagnosis: A Hybrid Memory and Model-Based Approach" author: - name: "David M. Pennock" link: "https://researchr.org/alias/david-m.-pennock" - name: "Eric Horvitz" link: "https://researchr.org/alias/eric-horvitz" - name: "Steve Lawrence" link: "http://research.google.com/pubs/author103.html" - name: "C. Lee Giles" link: "https://researchr.org/alias/c.-lee-giles" year: "2000" doi: "http://rome.exp.sis.pitt.edu/UAI/Abstract.asp?articleID=55&proceedingID=16" abstract: "The growth of Internet commerce has stimulated the use of collaborative filtering (CF) algorithms as recommender systems. Such systems leverage knowledge about the known preferences of multiple users to recommend items of interest to other users. CF methods have been harnessed to make recommendations about such items as web pages, movies, books, and toys. Researchers have proposed and evaluated many approaches for generating recommendations. We describe and evaluate a new method called personality diagnosis (PD). Given a user’s preferences for some items, we compute the probability that he or she is of the same “personality type” as other users, and, in turn, the probability that he or she will like new items. PD retains some of the advantages of traditional similarity-weighting techniques in that all data is brought to bear on each prediction and new data can be added easily and incrementally. Additionally, PD has a meaningful probabilistic interpretation, which may be leveraged to justify, explain, and augment results. We report empirical results on the EachMovie database of movie ratings, and on user profile data collected from the CiteSeer digital library of Computer Science research papers. The probabilistic framework naturally supports a variety of descriptive measurements—in particular, we consider the applicability of a value of information (VOI) computation." links: doi: "http://rome.exp.sis.pitt.edu/UAI/Abstract.asp?articleID=55&proceedingID=16" tags: - "empirical" - "rule-based" - "recommender systems" - "collaborative filtering" - "digital library" - "web science" - "user profiling" - "type system" - "data-flow" - "C++" - "digital libraries" - "e-science" - "information models" - "database" - "incremental" - "systematic-approach" - "recommendation algorithm" researchr: "https://researchr.org/publication/PennockHLG00" cites: 0 citedby: 0 pages: "473-480" booktitle: "uai" kind: "inproceedings" key: "PennockHLG00" - title: "Networked Digital Library of Theses and Dissertations: Bridging the Gaps for Global Access - Part 1: Mission and Progress" author: - name: "Hussein Suleman" link: "https://researchr.org/alias/hussein-suleman" - name: "Anthony Atkins" link: "https://researchr.org/alias/anthony-atkins" - name: "Marcos André Gonçalves" link: "http://buscatextual.cnpq.br/buscatextual/visualizacv.jsp?id=K4763169A6" - name: "Robert K. France" link: "https://researchr.org/alias/robert-k.-france" - name: "Edward A. Fox" link: "http://fox.cs.vt.edu/" - name: "Vinod Chachra" link: "https://researchr.org/alias/vinod-chachra" - name: "Murray Crowder" link: "https://researchr.org/alias/murray-crowder" - name: "Jeffrey Young" link: "https://researchr.org/alias/jeffrey-young" year: "2001" doi: "http://www.dlib.org/dlib/september01/suleman/09suleman-pt1.html" abstract: "The Networked Digital Library of Theses and Dissertations (NDLTD) is a collaborative effort of universities around the world to promote creating, archiving, distributing and accessing Electronic Theses and Dissertations (ETDs). Since its inception in 1996, over a hundred universities have joined the initiative, underscoring the importance institutions place on training their graduates in the emerging forms of digital publishing and information access. The outreach and training mission of NDLTD is an ongoing project so in this article we report on the current status of membership and support activities. Recent research has focused on creating a union database that will provide a means to search and retrieve ETDs from the combined collections of NDLTD member institutions. The Virtua system developed by VTLS will serve as the heart of this union database. In order to bridge the gap between the existing distributed institutional archives and a unified collection of ETDs, we have developed a metadata standard especially suited to ETDs - this is then used by partner sites to export their freely-available metadata using the Metadata Harvesting Protocol of the Open Archives Initiative. We also link name authority information into the metadata records to support unique identification of authors and others associated with the works. Additional research efforts include advanced search mechanisms, semantic interoperability, the design and development of multi- and cross-lingual search systems, and software modules that support the development of higher-level services to aid researchers in seeking relevant ETDs. " links: doi: "http://www.dlib.org/dlib/september01/suleman/09suleman-pt1.html" tags: - "protocol" - "digital library" - "design research" - "source-to-source" - "digital libraries" - "database" - "search" - "design" - "open-source" researchr: "https://researchr.org/publication/SulemanAGFFCCY01" cites: 0 citedby: 0 journal: "dlib" volume: "7" number: "9" kind: "article" key: "SulemanAGFFCCY01" - title: "Annotations in an Academic Digital Library: The Case of Conference Note-Taking and Annotation" author: - name: "Sally Jo Cunningham" link: "https://researchr.org/alias/sally-jo-cunningham" - name: "Chris Knowles" link: "https://researchr.org/alias/chris-knowles" year: "2005" doi: "http://dx.doi.org/10.1007/11599517_8" abstract: "This paper explores the potential usefulness and acceptability of annotation facilities by prospective users of an IT research digital library. We studied current annotation and note-taking behavior of IT researchers (academic and commercial), as exhibited at IT conferences. Here, we examine the implications of this information behavior for the design of annotation tools in a research-oriented digital library. " links: doi: "http://dx.doi.org/10.1007/11599517_8" tags: - "academic digital library" - "digital library" - "design research" - "digital libraries" - "design" researchr: "https://researchr.org/publication/CunninghamK05%3A0" cites: 0 citedby: 0 pages: "62-71" booktitle: "ICADL" kind: "inproceedings" key: "CunninghamK05:0" - title: "DBLP - Some Lessons Learned" author: - name: "Michael Ley" link: "http://www.informatik.uni-trier.de/~ley/" year: "2009" doi: "http://www.vldb.org/pvldb/2/vldb09-98.pdf" abstract: "The DBLP Computer Science Bibliography evolved from an early small experimental Web server to a popular service for the computer science community. Many design decisions and details of the public XML-records behind DBLP never were documented. This paper is a review of the evolution of DBLP. The main perspective is data modeling. In DBLP persons play a central role, our discussion of person names may be applicable to many other data bases. All DBLP data are available for your own experiments. You may either download the complete set, or use a simple XML-based API described in an online appendix." links: doi: "http://www.vldb.org/pvldb/2/vldb09-98.pdf" tags: - "DBLP" - "design science" - "rule-based" - "completeness" - "bibliography" - "meta-model" - "XML" - "XML Schema" - "modeling" - "web service" - "web science" - "data-flow" - "object-role modeling" - "reviewing" - "e-science" - "Meta-Environment" - "design" researchr: "https://researchr.org/publication/Ley%3A2009" cites: 0 citedby: 0 journal: "pvldb" volume: "2" number: "2" kind: "article" key: "Ley:2009" - title: "DelosDLMS. From the DELOS vision to the implementation of a future digital library management system" author: - name: "Yannis E. Ioannidis" link: "https://researchr.org/alias/yannis-e.-ioannidis" - name: "Diego Milano" link: "https://researchr.org/alias/diego-milano" - name: "Hans-Jörg Schek" link: "https://researchr.org/alias/hans-j%C3%B6rg-schek" - name: "Heiko Schuldt" link: "http://dbis.cs.unibas.ch/team/heiko-schuldt/dbis_staff_view" year: "2008" doi: "http://dx.doi.org/10.1007/s00799-008-0044-y" abstract: "DelosDLMS is a novel digital library management system (DLMS) that has been developed as an integration effort within the DELOS Network of Excellence, a European Commission initiative funded under its fifth and sixth framework programs. In this paper, we describe DelosDLMS that takes into account the recommendations of several activities that were initiated by DELOS including the DELOS vision for digital libraries (DLs). A key aspect of DelosDLMS is its novel generic infrastructure that allows the generation of digital library systems out of a set of basic system services and DL services in a modular and extensible way. DL services like feature extraction, visualization, intelligent browsing, media-type-specific indexing, support for multilinguality, relevance feedback and many others can easily be incorporated or replaced. A further key aspect of DelosDLMS is its robustness against failures and its scalability for large collections and many parallel user requests. We discuss the current status of an effort to build DelosDLMS, a Digital Library Management System that integrates in various ways several components developed by DELOS members and showcases a great variety of functionality that is outlined as part of the DELOS vision. " links: doi: "http://dx.doi.org/10.1007/s00799-008-0044-y" tags: - "object-oriented programming" - "generic programming" - "digital library" - "functional programming" - "parallel programming" - "type system" - "digital libraries" - "e-science" - "aspect oriented programming" - "feature-oriented programming" researchr: "https://researchr.org/publication/IoannidisMSS08" cites: 0 citedby: 0 journal: "jodl" volume: "9" number: "2" pages: "101-114" kind: "article" key: "IoannidisMSS08" - title: "5SL: a language for declarative specification and generation of digital libraries" author: - name: "Marcos André Gonçalves" link: "http://buscatextual.cnpq.br/buscatextual/visualizacv.jsp?id=K4763169A6" - name: "Edward A. Fox" link: "http://fox.cs.vt.edu/" year: "2002" doi: "http://doi.acm.org/10.1145/544220.544276" abstract: "Digital libraries (DLs) are among the most complex kinds of information systems, due in part to their intrinsic multi disciplinary nature. Nowadays DLs are built within monolithic, tightly integrated, and generally inflexible systems -- or by assembling disparate components together in an ad-hoc way, with resulting problems in interoperability and adaptability. More importantly, conceptual modeling, requirements analysis, and software engineering approaches are rarely supported, making it extremely difficult to tailor DL content and behavior to the interests, needs, and preferences of particular communities. In this paper, we address these problems. In particular, we present 5SL, a declarative language for specifying and generating domain-specific digital libraries. 5SL is based on the 5S formal theory for digital libraries and enables high-level specification of DLs in five complementary dimensions, including: the kinds of multimedia information the DL supports (Stream Model); how that information is structured and organized (Structural Model); different logical and presentational properties and operations of DL components (Spatial Model); the behavior of the DL (Scenario Model); and the different societies of actors and managers of services that act together to carry out the DL behavior (Societal Model). The practical feasibility of the approach is demonstrated by the presentation of a 5SL digital library generator for the MARIAN digital library system. " links: doi: "http://doi.acm.org/10.1145/544220.544276" tags: - "rule-based" - "software components" - "meta-model" - "modeling language" - "modeling" - "language engineering" - "digital library" - "software language engineering" - "language modeling" - "software component" - "domain analysis" - "analysis" - "requirements engineering" - "software engineering" - "model-driven engineering" - "digital libraries" - "information models" - "Meta-Environment" - "multimedia" - "systematic-approach" - "domain-specific language" researchr: "https://researchr.org/publication/GoncalvesF02" cites: 0 citedby: 0 pages: "263-272" booktitle: "JCDL" kind: "inproceedings" key: "GoncalvesF02" - title: "CiteSeer: An Automatic Citation Indexing System" author: - name: "C. Lee Giles" link: "https://researchr.org/alias/c.-lee-giles" - name: "Kurt D. Bollacker" link: "https://researchr.org/alias/kurt-d.-bollacker" - name: "Steve Lawrence" link: "http://research.google.com/pubs/author103.html" year: "1998" doi: "db/conf/dl/GilesBL98.html" tags: - "automatic citation indexing" - "C++" - "citation indexing" researchr: "https://researchr.org/publication/GilesBL98" cites: 0 citedby: 0 pages: "89-98" booktitle: "DL" kind: "inproceedings" key: "GilesBL98" - title: "The (Digital) Library Environment: Ten Years After" author: - name: "Lorcan Dempsey" link: "http://www.oclc.org/research/people/dempsey.htm" year: "2006" month: "February" doi: "http://www.ariadne.ac.uk/issue46/dempsey/" abstract: "We have recently come through several decennial celebrations: the W3C, the Dublin Core Metadata Initiative, D-Lib Magazine, and now Ariadne. What happened clearly in the mid-nineties was the convergence of the Web with more pervasive network connectivity, and this made our sense of the network as a shared space for research and learning, work and play, a more real and apparently achievable goal. What also emerged - at least in the library and research domains - was a sense that it was also a propitious time for digital libraries to move from niche to central role as part of the information infrastructure of this new shared space. However, the story did not quite develop this way. We have built digital libraries and distributed information systems, but they are not necessarily central. A new information infrastructure has been built, supported by technical development and new business models. The world caught up and moved on. What does this mean for the library and the digital library? In this article I will spend a little time looking at the environment in the early and mid-nineties, but this is really a prelude to thinking about where we are today, and saying something about libraries, digital libraries and related issues in the context of current changes. " links: doi: "http://www.ariadne.ac.uk/issue46/dempsey/" tags: - "meta-model" - "digital library" - "model-driven development" - "object-role modeling" - "digital libraries" - "information models" - "context-aware" - "Meta-Environment" researchr: "https://researchr.org/publication/Dempsey%3AAriadne%3A2006" cites: 0 citedby: 0 journal: "Ariadne" volume: "46" kind: "article" key: "Dempsey:Ariadne:2006" - title: "REFEREE: An Open Framework for Practical Testing of Recommender Systems using ResearchIndex" author: - name: "Dan Cosley" link: "https://researchr.org/alias/dan-cosley" - name: "Steve Lawrence" link: "http://research.google.com/pubs/author103.html" - name: "David M. Pennock" link: "https://researchr.org/alias/david-m.-pennock" year: "2002" doi: "http://www.vldb.org/conf/2002/S03P01.pdf" abstract: "Automated recommendation (e.g., personalized product recommendation on an ecommerce web site) is an increasingly valuable service associated with many databases—typically online retail catalogs and web logs. Currently, a major obstacle for evaluating recommendation algorithms is the lack of any standard, public, real-world testbed appropriate for the task. In an attempt to fill this gap, we have created REFEREE, a framework for building recommender systems using ResearchIndex—a huge online digital library of computer science research papers—so that anyone in the research community can develop, deploy, and evaluate recommender systems relatively easily and quickly. ResearchIndex is in many ways ideal for evaluating recommender systems, especially so-called hybrid recommenders that combine information filtering and collaborative filtering techniques. The documents in the database are associated with a wealth of content information (author, title, abstract, full text) and collaborative information (user behaviors), as well as linkage information via the citation structure. Our framework supports more realistic evaluation metrics that assess user buy-in directly, rather than resorting to offline metrics like prediction accuracy that may have little to do with end user utility. The sheer scale of ResearchIndex (over 500,000 documents with thousands of user accesses per hour) will force algorithm designers to make real-world tradeoffs that consider performance, not just accuracy. We present our own tradeoff decisions in building an example hybrid recommender called PD-Live. The algorithm uses contentbased similarity information to select a set of documents from which to recommend, and collaborative information to rank the documents. PD-Live performs reasonably well compared to other recommenders in ResearchIndex. " links: doi: "http://www.vldb.org/conf/2002/S03P01.pdf" tags: - "deployment" - "recommender systems" - "collaborative filtering" - "digital library" - "web service" - "testing" - "web science" - "source-to-source" - "digital libraries" - "e-science" - "database" - "recommendation algorithm" - "open-source" researchr: "https://researchr.org/publication/CosleyLP02" cites: 0 citedby: 0 pages: "35-46" booktitle: "VLDB" kind: "inproceedings" key: "CosleyLP02" - title: "No bull, no spin: a comparison of tags with other forms of user metadata" author: - name: "Catherine C. Marshall" link: "https://researchr.org/alias/catherine-c.-marshall" year: "2009" doi: "http://doi.acm.org/10.1145/1555400.1555438" abstract: "User-contributed tags have shown promise as a means of indexing multimedia collections by harnessing the combined efforts and enthusiasm of online communities. But tags are only one way of describing multimedia items. In this study, I compare the characteristics of public tags with other forms of descriptive metadata'titles and narrative captions'that users have assigned to a collection of very similar images gathered from the photo-sharing service Flickr. The study shows that tags converge on different descriptions than the other forms of metadata do, and that narrative metadata may be more effective than tags for capturing certain aspects of images that may influence their subsequent retrieval and use. The study also examines how photographers use peoples' names to personalize the different types of metadata and how they tell stories across short sequences of images. The study results are then brought to bear on design recommendations for user tagging tools and automated tagging algorithms and on using photo sharing sites as de facto art and architecture resources." links: doi: "http://doi.acm.org/10.1145/1555400.1555438" tags: - "tagging" - "architecture" - "C++" - "design" - "multimedia" - "recommendation algorithm" researchr: "https://researchr.org/publication/Marshall09" cites: 0 citedby: 0 pages: "241-250" booktitle: "JCDL" kind: "inproceedings" key: "Marshall09" - title: "Service-Oriented Science" author: - name: "Ian Foster" link: "http://www.mcs.anl.gov/about/people_detail.php?id=285" year: "2005" month: "May" doi: "http://dx.doi.org/10.1126/science.1110411" abstract: "New information architectures enable new approaches to publishing and accessing valuable data and programs. So-called service-oriented architectures define standard interfaces and protocols that allow developers to encapsulate information tools as services that clients can access without knowledge of, or control over, their internal workings. Thus, tools formerly accessible only to the specialist can be made available to all; previously manual data-processing and analysis tasks can be automated by having services access services. Such service-oriented approaches to science are already being applied successfully, in some cases at substantial scales, but much more effort is required before these approaches are applied routinely across many disciplines. Grid technologies can accelerate the development and adoption of service-oriented science by enabling a separation of concerns between discipline-specific content and domain-independent software and hardware infrastructure. " links: doi: "http://dx.doi.org/10.1126/science.1110411" tags: - "object-oriented programming" - "service-oriented science" - "program analysis" - "software architecture" - "service-oriented architecture" - "separation of concerns" - "protocol" - "architecture" - "domain analysis" - "analysis" - "data-flow programming" - "data-flow" - "e-science" - "subject-oriented programming" - "access control" - "data-flow analysis" - "data encapsulation" - "systematic-approach" - "feature-oriented programming" researchr: "https://researchr.org/publication/Foster%3Ascience%3A2005" cites: 0 citedby: 0 journal: "science" volume: "308" number: "5723" kind: "article" key: "Foster:science:2005" - title: "researchr.org" author: - name: "Eelco Visser" link: "https://researchr.org/alias/eelco-visser" year: "2009" doi: "http://researchr.org" abstract: "Researchr is a web service for indexing, managing, and sharing bibliographic information of scientific publications for researchers by researchers." links: doi: "http://researchr.org" tags: - "bibliography" - "software" - "digital library" - "web service" - "web services" - "researchr" researchr: "https://researchr.org/publication/Visser%3A2009" cites: 0 citedby: 0 howpublished: "http://researchr.org" kind: "misc" key: "Visser:2009" - title: "Extending the DelosDLMS by the FAST Annotation Service" author: - name: "Maristella Agosti" link: "https://researchr.org/alias/maristella-agosti" - name: "Gert Brettlecker" link: "https://researchr.org/alias/gert-brettlecker" - name: "Nicola Ferro" link: "https://researchr.org/alias/nicola-ferro" - name: "Paola Ranaldi" link: "https://researchr.org/alias/paola-ranaldi" - name: "Heiko Schuldt" link: "http://dbis.cs.unibas.ch/team/heiko-schuldt/dbis_staff_view" year: "2007" tags: - "DELOS" researchr: "https://researchr.org/publication/AgostiBFRS07" cites: 0 citedby: 0 pages: "7-12" booktitle: "ircdl" kind: "inproceedings" key: "AgostiBFRS07" - title: "Managing the Quality of Person Names in DBLP" author: - name: "Patrick Reuther" link: "https://researchr.org/alias/patrick-reuther" - name: "Bernd Walter" link: "https://researchr.org/alias/bernd-walter" - name: "Michael Ley" link: "http://www.informatik.uni-trier.de/~ley/" - name: "Alexander Weber" link: "https://researchr.org/alias/alexander-weber" - name: "Stefan Klink" link: "https://researchr.org/alias/stefan-klink" year: "2006" doi: "http://dx.doi.org/10.1007/11863878_55" links: doi: "http://dx.doi.org/10.1007/11863878_55" tags: - "DBLP" researchr: "https://researchr.org/publication/ReutherWLWK06" cites: 0 citedby: 0 pages: "508-511" booktitle: "ercimdl" kind: "inproceedings" key: "ReutherWLWK06" - title: "Digital Libraries and Autonomous Citation Indexing" author: - name: "Steve Lawrence" link: "http://research.google.com/pubs/author103.html" - name: "C. Lee Giles" link: "https://researchr.org/alias/c.-lee-giles" - name: "Kurt D. Bollacker" link: "https://researchr.org/alias/kurt-d.-bollacker" year: "1999" abstract: "The Web is revolutionizing the way researchers access scientific literature, however scientific literature on the Web is largely disorganized. Autonomous citation indexing can help organize the literature by automating the construction of citation indices. Autonomous citation indexing aims to improve the dissemination and retrieval of scientific literature, and provides improvements in cost, availability, comprehensiveness, efficiency, and timeliness." tags: - "digital library" - "C++" - "digital libraries" - "autonomous citation indexing" - "citation indexing" researchr: "https://researchr.org/publication/LawrenceGB%3Acomputer%3A1999" cites: 0 citedby: 0 journal: "Computer" volume: "32" number: "6" pages: "67-71" kind: "article" key: "LawrenceGB:computer:1999" - title: "Setting the foundations of digital libraries. The DELOS Manifesto" author: - name: "Leonardo Candela" link: "http://www.nmis.isti.cnr.it/candela/Leonardo_Candela_Website/Welcome.html" - name: "Donatella Castelli" link: "http://www.isti.cnr.it/php-pers/iselpers.php?Castelli+Donatella" - name: "Pasquale Pagano" link: "https://researchr.org/alias/pasquale-pagano" - name: "Constantion Thanos" link: "https://researchr.org/alias/constantion-thanos" - name: "Yannis Ioannidis" link: "https://researchr.org/alias/yannis-ioannidis" - name: "Georgia Koutrika" link: "https://researchr.org/alias/georgia-koutrika" - name: "Seamus Ross" link: "https://researchr.org/alias/seamus-ross" - name: "Hans-Joerg Scheck" link: "https://researchr.org/alias/hans-joerg-scheck" - name: "Heiko Schuldt" link: "http://dbis.cs.unibas.ch/team/heiko-schuldt/dbis_staff_view" year: "2007" month: "March/April" doi: "http://dlib.org/dlib/march07/castelli/03castelli.html" abstract: "The term \"Digital Libraries\" corresponds to a very complex notion with several diverse aspects and cannot be captured by a simple definition. A robust model of Digital Libraries encapsulating the richness of these perspectives is required. This need has led to the drafting of The Digital Library Manifesto, the aim of which is to set the foundations and identify the cornerstone concepts within the universe of Digital Libraries, facilitating the integration of research results and proposing better ways of developing appropriate systems. The Manifesto is a result of the collaborative work of members of the European Union co-funded DELOS Network of Excellence on Digital Libraries.1 It exploits the collective understanding that has been acquired, over more than a decade, on Digital Libraries by European research groups active in the Digital Library field, both within DELOS and outside, as well as by other groups around the world. This article presents the core parts of the Manifesto that introduce the entities of discourse of the Digital Library universe. " links: doi: "http://dlib.org/dlib/march07/castelli/03castelli.html" tags: - "meta-model" - "digital library" - "digital libraries" - "Meta-Environment" researchr: "https://researchr.org/publication/DELOS%3AManifesto%3A2007" cites: 0 citedby: 0 journal: "D-Lib Magazine" volume: "13" number: "3/4" kind: "article" key: "DELOS:Manifesto:2007" - title: "Requirements Gathering and Modeling of Domain-Specific Digital Libraries with the 5S Framework: An Archaeological Case Study with ETANA" author: - name: "Rao Shen" link: "https://researchr.org/alias/rao-shen" - name: "Marcos André Gonçalves" link: "http://buscatextual.cnpq.br/buscatextual/visualizacv.jsp?id=K4763169A6" - name: "Weiguo Fan" link: "https://researchr.org/alias/weiguo-fan" - name: "Edward A. Fox" link: "http://fox.cs.vt.edu/" year: "2005" doi: "http://dx.doi.org/10.1007/11551362_1" abstract: "Requirements gathering and conceptual modeling are essential for the customization of digital libraries (DLs), to help attend the needs of target communities. In this paper, we show how to apply the 5S (Streams, Structures, Spaces, Scenarios, and Societies) formal framework to support both tasks. The intuitive nature of the framework allows for easy and systematic requirements analysis, while its formal nature ensures the precision and correctness required for semi-automatic DL generation. Further, we show how 5S can help us define a domain-specific DL metamodel in the field of archaeology. Finally, an archaeological DL case study (from the ETANA project) yields informal and formal descriptions of two DL models (instances of the metamodel). " links: doi: "http://dx.doi.org/10.1007/11551362_1" tags: - "case study" - "meta-model" - "modeling" - "digital library" - "domain analysis" - "analysis" - "digital libraries" - "Meta-Environment" - "systematic-approach" researchr: "https://researchr.org/publication/ShenGFF05" cites: 0 citedby: 0 pages: "1-12" booktitle: "ercimdl" kind: "inproceedings" key: "ShenGFF05" - title: "Visual overviews for discovering key papers and influences across research fronts" author: - name: "Aleks Aris" link: "https://researchr.org/alias/aleks-aris" - name: "Ben Shneiderman" link: "http://www.cs.umd.edu/~ben/" - name: "Vahed Qazvinian" link: "https://researchr.org/alias/vahed-qazvinian" - name: "Dragomir R. Radev" link: "https://researchr.org/alias/dragomir-r.-radev" year: "2009" doi: "http://dx.doi.org/10.1002/asi.21160" abstract: "Gaining a rapid overview of an emerging scientific topic, sometimes called research fronts, is an increasingly common task due to the growing amount of interdisciplinary collaboration. Visual overviews that show temporal patterns of paper publication and citation links among papers can help researchers and analysts to see the rate of growth of topics, identify key papers, and understand influences across subdisciplines. This article applies a novel network-visualization tool based on meaningful layouts of nodes to present research fronts and show citation links that indicate influences across research fronts. To demonstrate the value of two-dimensional layouts with multiple regions and user control of link visibility, we conducted a design-oriented, preliminary case study with 6 domain experts over a 4-month period. The main benefits were being able (a) to easily identify key papers and see the increasing number of papers within a research front, and (b) to quickly see the strength and direction of influence across related research fronts." links: doi: "http://dx.doi.org/10.1002/asi.21160" tags: - "rule-based" - "layout" - "case study" - "design research" - "design" researchr: "https://researchr.org/publication/ArisSQR09" cites: 40 citedby: 0 journal: "jasis" volume: "60" number: "11" pages: "2219-2228" kind: "article" key: "ArisSQR09" - title: "DelosDLMS - The Integrated DELOS Digital Library Management System" author: - name: "Maristella Agosti" link: "https://researchr.org/alias/maristella-agosti" - name: "Stefano Berretti" link: "https://researchr.org/alias/stefano-berretti" - name: "Gert Brettlecker" link: "https://researchr.org/alias/gert-brettlecker" - name: "Alberto Del Bimbo" link: "https://researchr.org/alias/alberto-del-bimbo" - name: "Nicola Ferro" link: "https://researchr.org/alias/nicola-ferro" - name: "Norbert Fuhr" link: "https://researchr.org/alias/norbert-fuhr" - name: "Daniel A. Keim" link: "https://researchr.org/alias/daniel-a.-keim" - name: "Claus-Peter Klas" link: "https://researchr.org/alias/claus-peter-klas" - name: "Thomas Lidy" link: "https://researchr.org/alias/thomas-lidy" - name: "Diego Milano" link: "https://researchr.org/alias/diego-milano" - name: "Moira C. Norrie" link: "https://researchr.org/alias/moira-c.-norrie" - name: "Paola Ranaldi" link: "https://researchr.org/alias/paola-ranaldi" - name: "Andreas Rauber" link: "https://researchr.org/alias/andreas-rauber" - name: "Hans-Jörg Schek" link: "https://researchr.org/alias/hans-j%C3%B6rg-schek" - name: "Tobias Schreck" link: "https://researchr.org/alias/tobias-schreck" - name: "Heiko Schuldt" link: "http://dbis.cs.unibas.ch/team/heiko-schuldt/dbis_staff_view" - name: "Beat Signer" link: "https://researchr.org/alias/beat-signer" - name: "Michael Springmann" link: "https://researchr.org/alias/michael-springmann" year: "2007" doi: "http://dx.doi.org/10.1007/978-3-540-77088-6_4" abstract: "DelosDLMS is a prototype of a next-generation Digital Library (DL) management system. It is realized by combining various specialized DL functionalities provided by partners of the DELOS network of excellence. Currently, DelosDLMS combines text and audio-visual searching, offers new information visualization and relevance feedback tools, provides novel interfaces, allows retrieved information to be annotated and processed, integrates and processes sensor data streams, and finally, from a systems engineering point of view, is easily configured and adapted while being reliable and scalable. The prototype is based on the OSIRIS/ISIS platform, a middleware environment developed by ETH Zürich and now being extended at the University of Basel. " links: doi: "http://dx.doi.org/10.1007/978-3-540-77088-6_4" tags: - "rule-based" - "digital library" - "DLMS" - "data-flow" - "C++" - "digital libraries" - "Meta-Environment" - "stream processing" researchr: "https://researchr.org/publication/DelosDLMS%3A2007" cites: 0 citedby: 0 pages: "36-45" booktitle: "delos" kind: "inproceedings" key: "DelosDLMS:2007" - title: "Semantic Digital Libraries" year: "2009" tags: - "digital library" - "digital libraries" researchr: "https://researchr.org/publication/springer%3A2009semDL" cites: 0 citedby: 0 editor: - name: "Sebastian Ryszard Kruk" link: "https://researchr.org/alias/sebastian-ryszard-kruk" - name: "Bill McDaniel" link: "https://researchr.org/alias/bill-mcdaniel" publisher: "Springer" isbn: "978-3-540-85433-3" kind: "book" key: "springer:2009semDL" - title: "Arnetminer: expertise oriented search using social networks" author: - name: "Juanzi Li" link: "https://researchr.org/alias/juanzi-li" - name: "Jie Tang" link: "https://researchr.org/alias/jie-tang" - name: "Jing Zhang" link: "https://researchr.org/alias/jing-zhang" - name: "Qiong Luo" link: "https://researchr.org/alias/qiong-luo" - name: "Yunhao Liu" link: "https://researchr.org/alias/yunhao-liu" - name: "MingCai Hong" link: "https://researchr.org/alias/mingcai-hong" year: "2008" doi: "http://dx.doi.org/10.1007/s11704-008-0008-9" abstract: "Expertise Oriented Search (EOS) aims at providing comprehensive expertise analysis on data from distributed sources. It is useful in many application domains, for example, finding experts on a given topic, detecting the confliction of interest between researchers, and assigning reviewers to proposals. In this paper, we present the design and implementation of our expertise oriented search system, Arnetminer (http://www.arnetminer.net). Arnetminer has gathered and integrated information about a half-million computer science researchers from the Web, including their profiles and publications. Moreover, Arnetminer constructs a social network among these researchers through their co-authorship, and utilizes this network information as well as the individual profiles to facilitate expertise oriented search tasks. In particular, the co-authorship information is used both in ranking the expertise of individual researchers for a given topic and in searching for associations between researchers. We have conducted initial experiments on Arnetminer. Our results demonstrate that the proposed relevancy propagation expert finding method outperforms the method that only uses person local information, and the proposed two-stage association search on a large-scale social network is order of magnitude faster than the baseline method. " links: doi: "http://dx.doi.org/10.1007/s11704-008-0008-9" tags: - "design science" - "social web" - "design research" - "points-to analysis" - "domain analysis" - "web science" - "analysis" - "data-flow" - "source-to-source" - "e-science" - "web applications" - "social" - "search" - "data-flow analysis" - "design" - "open-source" researchr: "https://researchr.org/publication/LiTZLLH08" cites: 0 citedby: 0 journal: "fcsc" volume: "2" number: "1" pages: "94-105" kind: "article" key: "LiTZLLH08" - title: "Autonomous Citation Matching" author: - name: "Steve Lawrence" link: "http://research.google.com/pubs/author103.html" - name: "C. Lee Giles" link: "https://researchr.org/alias/c.-lee-giles" - name: "Kurt D. Bollacker" link: "https://researchr.org/alias/kurt-d.-bollacker" year: "1999" doi: "http://doi.acm.org/10.1145/301136.301255" abstract: "Advances in computational resources and the communications infrastructure, and the rapid rise of the World Wide Web, have led to the increasingly widespread availability of scientific papers in electronic form. Scientific papers usually contain citations to previous work, and indices of these citations are valuable for literature search, analysis, and evaluation. Current citation indices of the scientific literature are constructed using manual effort and are typically expensive. Part of the reason for using manual effort is the great variability of citation syntax – it can be difficult to autonomously determine if two citations refer to the same article because citations can be written in many different formats. We present machine learning techniques that identify variant forms of citations to the same paper. A number of algorithms are presented. An algorithm based on word and phrase matching is found to perform best, and is sufficiently accurate for unassisted use in an autonomous citation indexing system. An algorithm based on a string edit distance performs poorly in comparison. A computationally efficient subfield algorithm is also presented. The accuracy and efficiency of all algorithms is quantitatively compared on a number of datasets." links: doi: "http://doi.acm.org/10.1145/301136.301255" tags: - "rule-based" - "machine learning" - "analysis" - "C++" - "edit distance" - "search" researchr: "https://researchr.org/publication/LawrenceGB99" cites: 0 citedby: 0 pages: "392-393" booktitle: "agents" kind: "inproceedings" key: "LawrenceGB99" - title: "e-Science: The Added Value for Modern Discovery" author: - name: "Vladimir Getov" link: "https://researchr.org/alias/vladimir-getov" year: "2008" doi: "http://dx.doi.org/10.1109/MC.2008.460" links: doi: "http://dx.doi.org/10.1109/MC.2008.460" tags: - "discovery" - "e-science" researchr: "https://researchr.org/publication/Getov08" cites: 0 citedby: 0 journal: "Computer" volume: "41" number: "11" pages: "30-31" kind: "article" key: "Getov08" - title: "Interaction Design in Digital Libraries" author: - name: "Constantine Stephanidis" link: "https://researchr.org/alias/constantine-stephanidis" year: "1998" doi: "http://link.springer.de/link/service/series/0558/bibs/1513/15130703.htm" links: doi: "http://link.springer.de/link/service/series/0558/bibs/1513/15130703.htm" tags: - "interaction design" - "digital library" - "digital libraries" - "design" researchr: "https://researchr.org/publication/Stephanidis98" cites: 0 citedby: 0 pages: "703" booktitle: "ercimdl" kind: "inproceedings" key: "Stephanidis98" - title: "RESTful Web Services" author: - name: "Leonard Richardson" link: "http://www.crummy.com/" - name: "Sam Ruby" link: "http://intertwingly.net/blog/" year: "2007" month: "May" doi: "http://oreilly.com/catalog/9780596529260" abstract: "You've built web sites that can be used by humans. But can you also build web sites that are usable by machines? That's where the future lies, and that's what this book shows you how to do. Today's web service technologies have lost sight of the simplicity that made the Web successful. This book explains how to put the \"Web\" back into web services with REST, the architectural style that drives the Web." links: doi: "http://oreilly.com/catalog/9780596529260" tags: - "web service" - "architecture" - "REST" - "web services" - "Ruby" researchr: "https://researchr.org/publication/RichardsonRuby-2007" cites: 0 citedby: 0 publisher: "O'Reilly" kind: "book" key: "RichardsonRuby-2007" - title: "ORE Specifications and User Guides - Table of Contents" year: "2008" month: "October" doi: "http://www.openarchives.org/ore/1.0/toc.html" links: doi: "http://www.openarchives.org/ore/1.0/toc.html" researchr: "https://researchr.org/publication/OAI-ORE%3A2008" cites: 0 citedby: 0 howpublished: "http://www.openarchives.org/ore/1.0/toc.html" kind: "misc" key: "OAI-ORE:2008" - title: "Duplicate record identification in bibliographic databases" author: - name: "Pankaj Goyal" link: "https://researchr.org/alias/pankaj-goyal" year: "1987" tags: - "bibliography" - "bibliographic databases" researchr: "https://researchr.org/publication/Goyal87" cites: 0 citedby: 0 journal: "is" volume: "12" number: "3" pages: "239-242" kind: "article" key: "Goyal87" - title: "From the WWW and Minimal Digital Libraries, to Powerful Digital Libraries: Why and How" author: - name: "Edward A. Fox" link: "https://researchr.org/alias/edward-a.-fox" year: "2005" doi: "http://dx.doi.org/10.1007/11599517_74" abstract: "Digital libraries have emerged since the early 1990s, distinguished in part by their emphasis on useful content, helpful organization, and a range of services that include at least indexing, searching, and browsing. In the 5S (Streams, Structures, Spaces, Scenarios, and Societies) formal model for digital libraries we precisely define key concepts and terms, so the field can move beyond the stage of continually explaining basic ideas and debating definitions. Thus, we define a minimal digital library in terms of clear definitions for repository, metadata catalog, services, and societies, which in turn build upon characterizations of digital object, collection, hypertext, etc. " links: doi: "http://dx.doi.org/10.1007/11599517_74" tags: - "hypertext" - "meta-model" - "digital library" - "object-role modeling" - "digital libraries" - "Meta-Environment" - "meta-objects" researchr: "https://researchr.org/publication/Fox05" cites: 0 citedby: 0 pages: "525" booktitle: "ICADL" kind: "inproceedings" key: "Fox05" - title: "Building Digital Libraries Made Easy: Toward Open Digital Libraries" author: - name: "Edward A. Fox" link: "http://fox.cs.vt.edu/" - name: "Hussein Suleman" link: "https://researchr.org/alias/hussein-suleman" - name: "Ming Luo" link: "https://researchr.org/alias/ming-luo" year: "2002" doi: "http://link.springer.de/link/service/series/0558/bibs/2555/25550014.htm" abstract: "Digital libraries (DLs) promote a sharing culture among those who contribute and those who use resources. This same approach works when building Open Digital Libraries (ODLs). Leveraging the intellectual and practical investment made in the Open Archives Initiative through an eXtended Protocol for Metadata Harvesting (XPMH), one can build lightweight protocols to tie together key components that together make up the core of a DL. DL developers in various settings have learned how to apply this framework in a few hours. The ODL approach has been effective with the Computer Science Teaching Center (www.cstc.org), the Networked Digital Library of Theses and Dissertations (www.ndltd.org), and AmericanSouth.org. Hence, to support our Computing and Information Technology Interactive Digital Educational Library (www.citidel.org) and to provide a generic capability for other parts of the US National Science, technology, engineering, and mathematics education Digital Library (www.nsdl.org), we are developing a “DL-in-a-box” toolkit. When lightweight protocols, pools of components, and open standard reference models are combined carefully, as suggested in the OCKHAM discussions, both the DL user and developer communities can benefit from the principle of sharing. " links: doi: "http://link.springer.de/link/service/series/0558/bibs/2555/25550014.htm" tags: - "meta-model" - "protocol" - "digital library" - "source-to-source" - "model-driven engineering" - "digital libraries" - "e-science" - "teaching" - "information models" - "Meta-Environment" - "systematic-approach" - "open-source" researchr: "https://researchr.org/publication/FoxSL02" cites: 0 citedby: 0 pages: "14-24" booktitle: "ICADL" kind: "inproceedings" key: "FoxSL02" - title: "Consolidation of References to Persons in Bibliographic Databases" author: - name: "Nuno Freire" link: "https://researchr.org/alias/nuno-freire" - name: "José Luis Borbinha" link: "https://researchr.org/alias/jos%C3%A9-luis-borbinha" - name: "Bruno Martins" link: "https://researchr.org/alias/bruno-martins" year: "2008" doi: "http://dx.doi.org/10.1007/978-3-540-89533-6_26" abstract: "Entity resolution is the process of determining if, in a specific context, two or more references correspond to the same entity. In this work, we address this problem in the context of references to persons as they are found in bibliographic data, specifically in the case of consolidating multiple datasets. Or solution follows the extraction, transformation and loading (ETL) process, typical in data warehouses. It computes the similarities of the attribute values for the references, and employs a decision tree to decide when the references match. We describe the characteristics of these references within bibliographic datasets, and how we explored those characteristics by developing new similarity metrics to improve the quality of the consolidation process. We evaluated our work by designing an experiment with data from four national libraries. The results show that the proposed similarity metrics contribute significantly to the consolidation process. " links: doi: "http://dx.doi.org/10.1007/978-3-540-89533-6_26" tags: - "bibliography" - "data-flow" - "bibliographic databases" - "context-aware" - "reference resolving" - "transformation" researchr: "https://researchr.org/publication/FreireBM08" cites: 0 citedby: 0 pages: "256-265" booktitle: "ICADL" kind: "inproceedings" key: "FreireBM08" - title: "Using web information for author name disambiguation" author: - name: "Denilson Alves Pereira" link: "https://researchr.org/alias/denilson-alves-pereira" - name: "Berthier A. Ribeiro-Neto" link: "https://researchr.org/alias/berthier-a.-ribeiro-neto" - name: "Nivio Ziviani" link: "https://researchr.org/alias/nivio-ziviani" - name: "Alberto H. F. Laender" link: "https://researchr.org/alias/alberto-h.-f.-laender" - name: "Marcos André Gonçalves" link: "https://researchr.org/alias/marcos-andr%C3%A9-gon%C3%A7alves" - name: "Anderson A. Ferreira" link: "https://researchr.org/alias/anderson-a.-ferreira" year: "2009" doi: "http://doi.acm.org/10.1145/1555400.1555409" abstract: "In digital libraries, ambiguous author names may occur due to the existence of multiple authors with the same name (polysemes) or different name variations for the same author (synonyms). We proposed here a new method that uses information available on the Web to deal with both problems at the same time. Our idea consists of gathering information from input citations and submitting queries to a Web search engine, aiming at finding curricula vitae and Web pages containing publications of the ambiguous authors. From the content of documents in the answer sets returned by the Web search engine, useful information that can help in the disambiguation process is extracted. Using this information, author names are disambiguated by leveraging a hierarchical clustering method that groups citations in the same document together in a bottom-up fashion. Experimental results show that the our method yields results that outperform those of two state-of-the-art unsupervised methods and are statistically comparable with those of a supervised one, but requiring no training. We observe gains of up to 65.2% in the pairwise F1 metric when compared with our best unsupervised baseline method." links: doi: "http://doi.acm.org/10.1145/1555400.1555409" tags: - "digital library" - "disambiguation" - "digital libraries" - "search" researchr: "https://researchr.org/publication/PereiraRZLGF09" cites: 0 citedby: 0 pages: "49-58" booktitle: "JCDL" kind: "inproceedings" key: "PereiraRZLGF09" - title: "Self-Organization and Identification of Web Communities" author: - name: "Gary William Flake" link: "https://researchr.org/alias/gary-william-flake" - name: "Steve Lawrence" link: "http://research.google.com/pubs/author103.html" - name: "C. Lee Giles" link: "https://researchr.org/alias/c.-lee-giles" - name: "Frans Coetzee" link: "https://researchr.org/alias/frans-coetzee" year: "2002" doi: "http://computer.org/computer/co2002/r3066abs.htm" abstract: "Despite the decentralized and unorganized nature of the web, we show that the web self-organizes such that communities of highly related pages can be efficiently identified based purely on connectivity. This discovery allows the identification of communities independent of, and unbiased by, the specific words used by authors. Applications include improved search engines, content filtering, and objective analysis of relationships within and between communities on the web." links: doi: "http://computer.org/computer/co2002/r3066abs.htm" tags: - "community engineering" - "rule-based" - "discovery" - "web engineering" - "analysis" - "C++" - "web applications" - "web communities" - "search" researchr: "https://researchr.org/publication/FlakeLGC02" cites: 0 citedby: 0 journal: "Computer" volume: "35" number: "3" pages: "66-71" kind: "article" key: "FlakeLGC02" - title: "Metacrap: Putting the torch to seven straw-men of the meta-utopia" author: - name: "Cory Doctorow" link: "http://craphound.com/" year: "2001" month: "August" doi: "http://www.well.com/~doctorow/metacrap.htm" abstract: "Metadata is \"data about data\" -- information like keywords, page-length, title, word-count, abstract, location, SKU, ISBN, and so on. Explicit, human-generated metadata has enjoyed recent trendiness, especially in the world of XML. A typical scenario goes like this: a number of suppliers get together and agree on a metadata standard -- a Document Type Definition or scheme -- for a given subject area, say washing machines. They agree to a common vocabulary for describing washing machines: size, capacity, energy consumption, water consumption, price. They create machine-readable databases of their inventory, which are available in whole or part to search agents and other databases, so that a consumer can enter the parameters of the washing machine he's seeking and query multiple sites simultaneously for an exhaustive list of the available washing machines that meet his criteria. If everyone would subscribe to such a system and create good metadata for the purposes of describing their goods, services and information, it would be a trivial matter to search the Internet for highly qualified, context-sensitive results: a fan could find all the downloadable music in a given genre, a manufacturer could efficiently discover suppliers, travelers could easily choose a hotel room for an upcoming trip. A world of exhaustive, reliable metadata would be a utopia. It's also a pipe-dream, founded on self-delusion, nerd hubris and hysterically inflated market opportunities." links: doi: "http://www.well.com/~doctorow/metacrap.htm" tags: - "meta-model" - "XML" - "XML Schema" - "type system" - "data-flow" - "context-aware" - "Meta-Environment" - "search" - "abstract machine" - "metacrap" - "meta-objects" - "meta-data" researchr: "https://researchr.org/publication/doctorow%3A2001" cites: 0 citedby: 0 howpublished: "http://www.well.com/~doctorow/metacrap.htm" kind: "misc" key: "doctorow:2001" - title: "Streams, structures, spaces, scenarios, societies (5s): A formal model for digital libraries" author: - name: "Marcos André Gonçalves" link: "http://buscatextual.cnpq.br/buscatextual/visualizacv.jsp?id=K4763169A6" - name: "Edward A. Fox" link: "http://fox.cs.vt.edu/" - name: "Layne T. Watson" link: "https://researchr.org/alias/layne-t.-watson" - name: "Neill A. Kipp" link: "https://researchr.org/alias/neill-a.-kipp" year: "2004" doi: "http://doi.acm.org/10.1145/984321.984325" abstract: "Digital libraries (DLs) are complex information systems and therefore demand formal foundations lest development efforts diverge and interoperability suffers. In this article, we propose the fundamental abstractions of Streams, Structures, Spaces, Scenarios, and Societies (5S), which allow us to define digital libraries rigorously and usefully. Streams are sequences of arbitrary items used to describe both static and dynamic (e.g., video) content. Structures can be viewed as labeled directed graphs, which impose organization. Spaces are sets with operations on those sets that obey certain constraints. Scenarios consist of sequences of events or actions that modify states of a computation in order to accomplish a functional requirement. Societies are sets of entities and activities and the relationships among them. Together these abstractions provide a formal foundation to define, relate, and unify concepts---among others, of digital objects, metadata, collections, and services---required to formalize and elucidate \"digital libraries\". The applicability, versatility, and unifying power of the 5S model are demonstrated through its use in three distinct applications: building and interpretation of a DL taxonomy, informal and formal analysis of case studies of digital libraries (NDLTD and OAI), and utilization as a formal basis for a DL description language. " links: doi: "http://doi.acm.org/10.1145/984321.984325" tags: - "case study" - "meta-model" - "modeling language" - "digital library" - "language modeling" - "analysis" - "static analysis" - "constraints" - "model-driven development" - "graph-rewriting" - "object-role modeling" - "digital libraries" - "information models" - "abstraction" - "Meta-Environment" - "rewriting" - "taxonomy" - "meta-objects" researchr: "https://researchr.org/publication/GoncalvesFWK04" cites: 0 citedby: 0 journal: "tois" volume: "22" number: "2" pages: "270-312" kind: "article" key: "GoncalvesFWK04" - title: "What do exploratory searchers look at in a faceted search interface?" author: - name: "Bill Kules" link: "https://researchr.org/alias/bill-kules" - name: "Robert Capra" link: "https://researchr.org/alias/robert-capra" - name: "Matthew Banta" link: "https://researchr.org/alias/matthew-banta" - name: "Tito Sierra" link: "https://researchr.org/alias/tito-sierra" year: "2009" doi: "http://doi.acm.org/10.1145/1555400.1555452" abstract: "This study examined how searchers interacted with a web-based, faceted library catalog when conducting exploratory searches. It applied eye tracking, stimulated recall interviews, and direct observation to investigate important aspects of gaze behavior in a faceted search interface: what components of the interface searchers looked at, for how long, and in what order. It yielded empirical data that will be useful for both practitioners (e.g., for improving search interface designs), and researchers (e.g., to inform models of search behavior). Results of the study show that participants spent about 50 seconds per task looking at (fixating on) the results, about 25 seconds looking at the facets, and only about 6 seconds looking at the query itself. These findings suggest that facets played an important role in the exploratory search process." links: doi: "http://doi.acm.org/10.1145/1555400.1555452" tags: - "empirical" - "rule-based" - "meta-model" - "data-flow" - "object-role modeling" - "Meta-Environment" - "search" - "process modeling" researchr: "https://researchr.org/publication/KulesCBS09" cites: 0 citedby: 0 pages: "313-322" booktitle: "JCDL" kind: "inproceedings" key: "KulesCBS09" - title: "Implicit Text Linkages between Medline Records: Using Arrowsmith as an Aid to Scientific Discovery" author: - name: "Don R. Swanson" link: "https://researchr.org/alias/don-r.-swanson" - name: "Neil R. Smalheiser" link: "https://researchr.org/alias/neil-r.-smalheiser" year: "1999" doi: "http://alexia.lis.uiuc.edu/puboff/catalog/trends/48_1abs.html#swanson" links: doi: "http://alexia.lis.uiuc.edu/puboff/catalog/trends/48_1abs.html#swanson" tags: - "discovery" researchr: "https://researchr.org/publication/SwansonS99" cites: 0 citedby: 0 journal: "libt" volume: "48" number: "1" kind: "article" key: "SwansonS99" - title: "Specification and Generation of Digital Libraries into DSpace Using the 5S Framework" author: - name: "Douglas Gorton" link: "https://researchr.org/alias/douglas-gorton" - name: "Weiguo Fan" link: "https://researchr.org/alias/weiguo-fan" - name: "Edward A. Fox" link: "http://fox.cs.vt.edu/" year: "2007" doi: "http://dx.doi.org/10.1007/978-3-540-74851-9_71" abstract: "While digital library (DL) systems continue to become more powerful and usable, a certain amount of inherent complexity remains in the installation, configuration, and customization of out-of-the-box solutions like DSpace and Greenstone. In this work, we build upon past work in the 5S Framework for Digital Libraries and 5SL DL specification language to devise an XML-based model for the specification of DLs for DSpace. We pair this way of specifying DLs with a generator tool which takes a DL specification that adheres to the model and generates a working DSpace instance that matches the specification. " links: doi: "http://dx.doi.org/10.1007/978-3-540-74851-9_71" tags: - "rule-based" - "meta-model" - "XML" - "modeling language" - "XML Schema" - "modeling" - "digital library" - "language modeling" - "digital libraries" - "Meta-Environment" researchr: "https://researchr.org/publication/GortonFF07" cites: 0 citedby: 0 pages: "567-569" booktitle: "ercimdl" kind: "inproceedings" key: "GortonFF07" - title: "eBizSearch: An OAI-Compliant Digital Library for eBusiness" author: - name: "Yves Petinot" link: "https://researchr.org/alias/yves-petinot" - name: "Pradeep B. Teregowda" link: "https://researchr.org/alias/pradeep-b.-teregowda" - name: "Hui Han" link: "https://researchr.org/alias/hui-han" - name: "C. Lee Giles" link: "https://researchr.org/alias/c.-lee-giles" - name: "Steve Lawrence" link: "http://research.google.com/pubs/author103.html" - name: "Arvind Rangaswamy" link: "https://researchr.org/alias/arvind-rangaswamy" - name: "Nirmal Pal" link: "https://researchr.org/alias/nirmal-pal" year: "2003" doi: "http://csdl.computer.org/comp/proceedings/jcdl/2003/1939/00/19390199abs.htm" abstract: "Niche Search Engines offer an efficient alternative to traditional search engines when the results returned by general-purpose search engines do not provide a sufficient degree of relevance and when nontraditional search features are required. Niche search engines can take advantage of their domain of concentration to achieve higher relevance and offer enhanced features. We discuss a new digital library niche search engine, eBizSearch, dedicated to e-business and e-business documents. The ground technology for eBizSearch is CiteSeer, a specialpurpose automatic indexing document digital library and search engine developed at NEC Research Institute. We present here the integration of CiteSeer in the framework of eBizSearch and the process necessary to tune the whole system towards the specific area of e-business. We show how using machine learning algorithms we generate metadata to make eBizSearch Open Archives compliant. eBizSearch is a publicly available service and can be reached at [13]." links: doi: "http://csdl.computer.org/comp/proceedings/jcdl/2003/1939/00/19390199abs.htm" tags: - "machine learning" - "digital library" - "source-to-source" - "C++" - "digital libraries" - "e-science" - "search" - "open-source" researchr: "https://researchr.org/publication/PetinotTHGLRP03" cites: 0 citedby: 0 pages: "199-209" booktitle: "JCDL" kind: "inproceedings" key: "PetinotTHGLRP03"