Visualization for Information Retrieval based on Fast Search Technology
Indonesian Journal of Electrical Engineering and Informatics (IJEEI)
The core of search engine is information retrieval technique. Using information retrieval system backs more retrieval results, some of them more relevant than other, and some is not relevant. While using search engine to retrieve information has grown very substantially, there remain problems with the information retrieval systems. The interface of the systems does not help them to perceive the precision of these results. It is therefore not surprising that graphical visualizations have been
... loyed in search engines to assist users. The main objective of Internet users is to find the required information with high efficiency and effectiveness. In this paper we present brief sides of information visualization's role in enhancing web information retrieval system as in some of its techniques such as tree view, title view, map view, bubble view and cloud view and its tools such as highlighting and Colored Query Result. Introduction The typical generic scenario for searching, retrieving, and displaying information is the following. A user has an information need about a certain topic. With a user interface he/she formulates a query to the system . The query starts an action in the system (search engine, information retrieval (IR) system, digital library, or other) . The system will retrieve (or not) objects and will display them with appropriate messages and layouts in the same graphical user interface (GUI) where the user entered the query (3). Finally, the user decides if the documents are relevant or not. He/she can either exit the system because the information was found or refine the query and start again . Information retrieval (IR) is the task of representing, storing, organizing, and offering access to information items . The problem for search engines is not only to find topic relevant results, but results consistent with the user's information need. How to retrieve desired information from the Internet with high efficiency and good effectiveness is become the main concern of internet user-based  . Search engines interfaces are intuitive and in some cases restricted by the nature of the WWW. There is a limited use of color, no pull-down menus, and limited user interaction. The typical input interface is a simple box where the user fills the terms to search plus button to submit the query. The visualization process of the answers can be text only or more rich and complex with the use of a graphical metaphor. In the text only approach, the user gets a list of the top 10 or 20 best documents that potentially contains the information. The list usually contains the title, its URL, size, date, and an abstract of no more than 4 lines of the document. The user opens each document until the desired information is finally located. This is not a problem when the target document is located in the first 20 answers. It becomes a problem when the output of a query is a list of hundreds or thousands of documents. A graphical metaphor presents a rich interface in which the user can browse, filter, process, and reformulate the query . User behavior, performance and attitude were recorded as well as usability problems. The system had few usability problems and users liked the visualizations, but recall performance was poor. The reasons for poor/good performance were investigated by examining user behavior and search strategies. Better searchers used the visualizations more electively and spent longer on the task, whereas poorer performances were attributable to poor motivation, difficulty in assessing article relevance and poor use of system visualizations  . Hence, visualization is an effective tool to partially solve data overload problems in WWW retrieval when answers contain hundreds of documents. The visualization of quantitative information consist of principles to help achieve the main goal: communicate complex ideas with clarity, precision, and efficiency  . 28 This survey paper contains problems that faced web information retrieval system whether because of the web nature or user activity or searching process itself. Then, how the search engine works and models of information retrieval. Next, the meaning of visualization, information visualization as one of its application and how it enhances web information retrieval system. Finally, real systems used information visualization tool in reducing and solving some of web information retrieval system's problems. Problem Definition The World Wide Web is a huge, widely distributed, global source for information services, hyper-link information, access and usage information and web site content and organization  . There is a huge quantity of text, audio, video, and other documents available on the Internet, on about any subject. Users need to be able to find relevant information to satisfy their particular information needs. There are two ways of searching for information: to use a search engines or to browse directories organized by categories (such as Yahoo Directories). There is still a large part of the Internet that is not accessible (for example private databases and intranets)  . By all measures, the web is enormous and growing at a staggering rate, which has made it increasingly intricate and crucial for both people and programs to have quick and accurate access to web information and services  . It is not surprising that about 85% of internet users surveyed claim to be using search engines and search services to find specific information of interest [5, 6] . The same surveys show, however, those users are not satisfied with the performance of the current generation of search engines; the slow speed of retrieval, communication delays, and poor quality of retrieved results (e.g., noise and broken links) are commonly cited problems  . Search engines have played a key role in the World Wide Web's infrastructure as its scale and impact have escalated. Although search engines are important tools for knowledge discovery on the web, they are far from perfect. The poor quality of retrieved results, handling a huge quantity of information, addressing subjective and timevarying search needs, finding fresh information and dealing with poor quality queries are commonly cited glitches  . There are many problems with different reasons which it can be by the web nature, users, search engine and hardware.