Search results for: clustering-and-information-retrieval

Clustering and Information Retrieval

Author : Weili Wu
File Size : 21.23 MB
Format : PDF, Mobi
Download : 938
Read : 1043
Download »
Clustering is an important technique for discovering relatively dense sub-regions or sub-spaces of a multi-dimension data distribution. Clus tering has been used in information retrieval for many different purposes, such as query expansion, document grouping, document indexing, and visualization of search results. In this book, we address issues of cluster ing algorithms, evaluation methodologies, applications, and architectures for information retrieval. The first two chapters discuss clustering algorithms. The chapter from Baeza-Yates et al. describes a clustering method for a general metric space which is a common model of data relevant to information retrieval. The chapter by Guha, Rastogi, and Shim presents a survey as well as detailed discussion of two clustering algorithms: CURE and ROCK for numeric data and categorical data respectively. Evaluation methodologies are addressed in the next two chapters. Ertoz et al. demonstrate the use of text retrieval benchmarks, such as TRECS, to evaluate clustering algorithms. He et al. provide objective measures of clustering quality in their chapter. Applications of clustering methods to information retrieval is ad dressed in the next four chapters. Chu et al. and Noel et al. explore feature selection using word stems, phrases, and link associations for document clustering and indexing. Wen et al. and Sung et al. discuss applications of clustering to user queries and data cleansing. Finally, we consider the problem of designing architectures for infor mation retrieval. Crichton, Hughes, and Kelly elaborate on the devel opment of a scientific data system architecture for information retrieval.

Use of Vector Space Model with Hierarchical Clustering in Information Retrieval

Author : Stephen Breeding
File Size : 78.26 MB
Format : PDF, Mobi
Download : 854
Read : 555
Download »

Information Retrieval Technology

Author : Azizah Jaafar
File Size : 59.97 MB
Format : PDF
Download : 501
Read : 400
Download »
This book constitutes the refereed proceedings of the 10th Information Retrieval Societies Conference, AIRS 2014, held in Kuching, Malaysia, in December 2014. The 42 full papers were carefully reviewed and selected from 110 submissions. Seven tracks were the focus of the AIR 2014 and they were IR models and theories; IR evaluation, user study and interactive IR; web IR, scalability and IR in social media; multimedia IR; natural language processing for IR; machine learning and data mining for IR and IR applications.

Clustering and Ranking for Web Information Retrieval

Author : Antonio Gullì
File Size : 79.37 MB
Format : PDF, ePub
Download : 466
Read : 1258
Download »
This book investigates several research problems which arise in modern Web Information Retrieval. First of all we consider the fact that there are many situations where a flat list of ten search results are not enough, and that the users might desire to have a larger number of results grouped on-the-fly in folders of similar topics. In this book, we describe Snaket, a hierarchical clustering meta-search engine which personalizes searches according to the clusters selected on-the-fly by users. Second, we consider those situations where users might desire to access fresh information such as news articles. We present a new ranking algorithm suitable for ranking those fresh type of information. Third, we will discuss numerical methodologies for accelerating the ranking methodologies used in Web Search. An important achievement for this book is that we show how to address the above predominant issues of Web Information Retrieval by using clustering and ranking methodologies. We demonstrate that both clustering and ranking have a mutual reinforcement property that has not yet been studied intensively.

Next Generation Search Engines Advanced Models for Information Retrieval

Author : Jouis, Christophe
File Size : 71.89 MB
Format : PDF, ePub
Download : 492
Read : 788
Download »
Recent technological progress in computer science, Web technologies, and the constantly evolving information available on the Internet has drastically changed the landscape of search and access to information. Current search engines employ advanced techniques involving machine learning, social networks, and semantic analysis. Next Generation Search Engines: Advanced Models for Information Retrieval is intended for scientists and decision-makers who wish to gain working knowledge about search in order to evaluate available solutions and to dialogue with software and data providers. The book aims to provide readers with a better idea of the new trends in applied research.

String Processing and Information Retrieval

Author : Fabio Crestani
File Size : 31.19 MB
Format : PDF, ePub
Download : 202
Read : 815
Download »
This book constitutes the refereed proceedings of the 13th International Conference on String Processing and Information Retrieval, SPIRE 2006, held in Glasgpw, UK in October 2006. The 26 revised full papers and 5 revised short papers presented together with 2 invited talks were carefully reviewed and selected from 102 submissions. The papers are organized in topical sections on Web clustering and text categorisation, strings, user behaviour, Web search algorithms, compression, correction, information retrieval applications, bio-informatics, and Web search engines.

Information Retrieval Systems

Author : Gerald J. Kowalski
File Size : 75.30 MB
Format : PDF, ePub, Mobi
Download : 383
Read : 1079
Download »
The growth of the Internet and the availability of enormous volumes of data in digital form have necessitated intense interest in techniques to assist the user in locating data of interest. The Internet has over 350 million pages of data and is expected to reach over one billion pages by the year 2000. Buried on the Internet are both valuable nuggets to answer questions as well as a large quantity of information the average person does not care about. The Digital Library effort is also progressing, with the goal of migrating from the traditional book environment to a digital library environment. The challenge to both authors of new publications that will reside on this information domain and developers of systems to locate information is to provide the information and capabilities to sort out the non-relevant items from those desired by the consumer. In effect, as we proceed down this path, it will be the computer that determines what we see versus the human being. The days of going to a library and browsing the new book shelf are being replaced by electronic searching the Internet or the library catalogs. Whatever the search engines return will constrain our knowledge of what information is available. An understanding of Information Retrieval Systems puts this new environment into perspective for both the creator of documents and the consumer trying to locate information.

Information Retrieval Technology

Author : Rafael Banchs
File Size : 67.17 MB
Format : PDF, Docs
Download : 389
Read : 300
Download »
This book constitutes the refereed proceedings of the 9th Information Retrieval Societies Conference, AIRS 2013, held in Singapore, in December 2013. The 27 full papers and 18 poster presentations included in this volume were carefully reviewed and selected from 109 submissions. They are organized in the following topical sections: IR theory, modeling and query processing; clustering, classification and detection; natural language processing for IR; social networks, user-centered studies and personalization and applications.

Natural Language Information Retrieval

Author : T. Strzalkowski
File Size : 54.31 MB
Format : PDF, Kindle
Download : 903
Read : 307
Download »
The last decade has been one of dramatic progress in the field of Natural Language Processing (NLP). This hitherto largely academic discipline has found itself at the center of an information revolution ushered in by the Internet age, as demand for human-computer communication and informa tion access has exploded. Emerging applications in computer-assisted infor mation production and dissemination, automated understanding of news, understanding of spoken language, and processing of foreign languages have given impetus to research that resulted in a new generation of robust tools, systems, and commercial products. Well-positioned government research funding, particularly in the U. S. , has helped to advance the state-of-the art at an unprecedented pace, in no small measure thanks to the rigorous 1 evaluations. This volume focuses on the use of Natural Language Processing in In formation Retrieval (IR), an area of science and technology that deals with cataloging, categorization, classification, and search of large amounts of information, particularly in textual form. An outcome of an information retrieval process is usually a set of documents containing information on a given topic, and may consist of newspaper-like articles, memos, reports of any kind, entire books, as well as annotated image and sound files. Since we assume that the information is primarily encoded as text, IR is also a natural language processing problem: in order to decide if a document is relevant to a given information need, one needs to be able to understand its content.

Adaptive Page Clustering for Information Retrieval

Author : 林宜嫻
File Size : 29.96 MB
Format : PDF, ePub, Docs
Download : 511
Read : 301
Download »

Information Retrieval Architecture and Algorithms

Author : Gerald Kowalski
File Size : 20.87 MB
Format : PDF, Docs
Download : 204
Read : 541
Download »
This text presents a theoretical and practical examination of the latest developments in Information Retrieval and their application to existing systems. By starting with a functional discussion of what is needed for an information system, the reader can grasp the scope of information retrieval problems and discover the tools to resolve them. The book takes a system approach to explore every functional processing step in a system from ingest of an item to be indexed to displaying results, showing how implementation decisions add to the information retrieval goal, and thus providing the user with the needed outcome, while minimizing their resources to obtain those results. The text stresses the current migration of information retrieval from just textual to multimedia, expounding upon multimedia search, retrieval and display, as well as classic and new textual techniques. It also introduces developments in hardware, and more importantly, search architectures, such as those introduced by Google, in order to approach scalability issues. About this textbook: A first course text for advanced level courses, providing a survey of information retrieval system theory and architecture, complete with challenging exercises Approaches information retrieval from a practical systems view in order for the reader to grasp both scope and solutions Features what is achievable using existing technologies and investigates what deficiencies warrant additional exploration

Advances in Information Retrieval

Author : Ricardo Baeza-Yates
File Size : 50.53 MB
Format : PDF, Docs
Download : 615
Read : 828
Download »
This book constitutes the proceedings of the 34th European Conference on IR Research, ECIR 2012, held in Barcelona, Spain, in April 2012. The 37 full papers, 28 poster papers and 7 demonstrations presented in this volume were carefully reviewed and selected from 167 submissions. The contributions are organized in sections named: query representation; blogs and online-community search; semi-structured retrieval; evaluation; applications; retrieval models; image and video retrieval; text and content classification, categorisation, clustering; systems efficiency; industry track; and posters.

Advances in Information Retrieval Theory

Author : Leif Azzopardi
File Size : 33.79 MB
Format : PDF, ePub, Docs
Download : 569
Read : 853
Download »
These proceedings contain the refereed papers and posters presented at the S- ond International Conference on the Theory of Information Retrieval (ICTIR 2009), held at Microsoft Research in Cambridge, UK, September 10-11, 2009. This biennial international conference provides an opportunity for the p- sentation of the latest work describing theoretical advances in the ?eld of inf- mation retrieval (IR). The ?rst ICTIR was held in Budapest in October 2007, organizedby Keith van Rijsbergen, Sa ́ndor Dominich, S ́ andor Daran ́ yi, and F- enc Kiss. ICTIR was brought about by the growing interest in the consecutive workshops run at ACM SIGIR each year from 2000 until 2005 on Mathematical and Formal Methods in IR (Athens, Greece, 2000; New Orleans, USA, 2001; Tampere, Finland, 2002; Toronto, Canada, 2003; She?eld, UK, 2004; Salvador, Brazil, 2005). This sustained initiative was in a large part down to the det- mination of San ́ dor Dominich and his passion for all things good, formal and mathematical. The foundation and the success of ICTIR is a direct result of his commitment and dedication to fostering research and development into the theoretical underpinnings of IR. His dedication is epitomized by his two books on the subject: Mathematical Foundations in Information Retrieval published in 2001, and The Modern Algebra of Information Retrieval published in 2008.

Ontology Based Information Retrieval for Healthcare Systems

Author : Vishal Jain
File Size : 75.8 MB
Format : PDF, Kindle
Download : 281
Read : 379
Download »
With the advancements of semantic web, ontology has become the crucial mechanism for representing concepts in various domains. For research and dispersal of customized healthcare services, a major challenge is to efficiently retrieve and analyze individual patient data from a large volume of heterogeneous data over a long time span. This requirement demands effective ontology-based information retrieval approaches for clinical information systems so that the pertinent information can be mined from large amount of distributed data. This unique and groundbreaking book highlights the key advances in ontology-based information retrieval techniques being applied in the healthcare domain and covers the following areas: Semantic data integration in e-health care systems Keyword-based medical information retrieval Ontology-based query retrieval support for e-health implementation Ontologies as a database management system technology for medical information retrieval Information integration using contextual knowledge and ontology merging Collaborative ontology-based information indexing and retrieval in health informatics An ontology-based text mining framework for vulnerability assessment in health and social care An ontology-based multi-agent system for matchmaking patient healthcare monitoring A multi-agent system for querying heterogeneous data sources with ontologies for reducing cost of customized healthcare systems A methodology for ontology based multi agent systems development Ontology based systems for clinical systems: validity, ethics and regulation

Information Retrieval Technology

Author : Asia Information Retrieval Symposium
File Size : 76.81 MB
Format : PDF, Kindle
Download : 390
Read : 1019
Download »
This book constitutes the refereed proceedings of the Second Asia Information Retrieval Symposium, AIRS 2005, held in Jeju Island, Korea, in October 2005. The 32 revised full papers and 36 revised poster papers presented were carefully reviewed and selected from 136 papers submitted. All current issues in information retrieval are addressed: applications, systems, technologies and theoretical aspects of information retrieval in text, audio, image, video and multi-media data. The papers are organized in topical sections on relevance/retrieval models, multimedia IR, natural language processing in IR, enabling technology, Web IR, question answering, document/query models, a special session: digital photo album, TDT/clustering, multimedia/classification, and two poster and demo sessions.

Visualization for Information Retrieval

Author : Jin Zhang
File Size : 46.66 MB
Format : PDF, ePub
Download : 735
Read : 238
Download »
Information visualization offers a way to reveal hidden patterns in a visual presentation and allows users to seek information from a visual perspective. Readers of this book will gain an in-depth understanding of the current state of information retrieval visualization. They will be introduced to existing problems along with technical and theoretical findings. The book also provides practical details for the implementation of an information retrieval visualization system.

Information Retrieval Technology

Author : Hang Li
File Size : 87.40 MB
Format : PDF, ePub, Mobi
Download : 892
Read : 1030
Download »
This book constitutes the thoroughly refereed post-conference proceedings of the 4th Asia Information Retrieval Symposium, AIRS 2008, held in Harbin, China, in May 2008. The 39 revised full papers and 43 revised poster papers presented were carefully reviewed and selected from 144 submissions. All current issues in information retrieval are addressed: applications, systems, technologies and theoretical aspects of information retrieval in text, audio, image, video and multi-media data. The papers are organized in topical sections on IR models image retrieval, text classification, chinese language processing, text processing, application of IR, machine learning, taxonomy, IR methods, information extraction, summarization, multimedia, Web IR, and text clustering.

Author :
File Size : 28.98 MB
Format : PDF
Download : 793
Read : 991
Download »

Advances in Information Retrieval

Author : Sharon McDonald
File Size : 25.51 MB
Format : PDF, Docs
Download : 123
Read : 542
Download »
Theseproceedingscontaintherefereedfulltechnicalpaperspresentedatthe26th Annual European Conference on Information Retrieval (ECIR 2004). ECIR is theannualconferenceoftheBritishComputerSociety’sspecialistgroupinInf- mation Retrieval. This year the conference was held at the School of Computing and Technology at the University of Sunderland. ECIR began life as the - nual Colloquium on Information Retrieval Research. The colloquium was held in the UK each year until 1998 when the event was held in Grenoble, France. Since then the conference venue has alternated between the United Kingdom and Continental Europe, and the event was renamed the European Conference on Information Retrieval. In recent years, ECIR has continued to grow and has become the major European forum for the discussion of research in the ?eld of Information Retrieval. To mark this metamorphosis from a small informal c- loquium to a major event in the IR research calendar, the BCS-IRSG decided to rename the event to the European Conference on Information Retrieval. ECIR2004received88fullpapersubmissions,fromacrossEuropeandfurther a?eldincludingNorthAmerica,ChinaandAustralia,atestamenttothegrowing popularity and reputation of the conference. Out of the 88 submitted papers, 28 were accepted for presentation. All papers were reviewed by at least three reviewers. Among the accepted papers 11 have a student as the primary author, illustrating that the traditional student focus of the original colloquium is alive today.

A Comparative Study of the Hierarchical Document Clustering in Information Retrieval System

Author : Mary J. Sheen
File Size : 54.10 MB
Format : PDF, Kindle
Download : 204
Read : 416
Download »