{"id":1870,"date":"2018-12-05T09:47:55","date_gmt":"2018-12-05T09:47:55","guid":{"rendered":"http:\/\/www.gyanvihar.org\/journals\/?p=1870"},"modified":"2019-08-13T10:39:15","modified_gmt":"2019-08-13T10:39:15","slug":"an-analysis-of-index-based-information-retrieval-algorithms","status":"publish","type":"post","link":"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/","title":{"rendered":"An Analysis of Index Based Information Retrieval Algorithms"},"content":{"rendered":"<p style=\"text-align: justify\"><strong><sup>1<\/sup>Deepika Sharma,<sup>2<\/sup>SandeepkumarPoonia<\/strong><\/p>\n<p style=\"text-align: justify\">1,2-JagannathUniversity,Jaipur,Rajasthan,India<\/p>\n<p style=\"text-align: justify\"><strong>Abstract:.From thousands of years humans store information in different ways. As from stone age to today\u2019s computer era there is a requirement to store information as well as retrieve this information in future and use it. Thus along with the storage there is a new requirement arises to retrieve the information also, not only accessing the information is important it also becomes relevant to the user. Nowadays with the advent of computer large amount of information can be stored easily and thus finding relevant information from such large amount of data become necessity. With this requirement their need some system which provides relevant and accurate information to the user according to the query given by the user such a system is known as Information Retrieval system which rely on textual keywords to index and retrieve documents. In IR system, indexing is a technique by which search of information become more accurate, fast and relevant. Index can be generated by the keywords present in the documents stored. Sometimes frequency of the result generated may also be stored in order of the frequency of the keywords presented in documents. This paper deals with analysis and comparison of different types of indexing techniques using different types of concepts and algorithms based on various parameter to find out their <\/strong><strong>advantages and limitations for searching\u00a0<\/strong><strong>the relevant information.<\/strong><\/p>\n<p style=\"text-align: justify\"><strong>I INTRODUCTION<\/strong><\/p>\n<p style=\"text-align: justify\">As the volume of documents and thus information increases in the repository day by day there is a challenge to provide proper and relevant information to the user. Figure 1 shows a working of a typical information retrieval system.The motive behind this paper is to analyze the currently important algorithms for searching the relevant information.<\/p>\n<p style=\"text-align: justify\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1871 aligncenter\" src=\"http:\/\/www.gyanvihar.org\/journals\/wp-content\/uploads\/2018\/12\/45.png\" alt=\"\" width=\"433\" height=\"306\" srcset=\"https:\/\/www.gyanvihar.org\/journals\/uploads\/2018\/12\/45.png 433w, https:\/\/www.gyanvihar.org\/journals\/uploads\/2018\/12\/45-100x70.png 100w\" sizes=\"auto, (max-width: 433px) 100vw, 433px\" \/><\/p>\n<p style=\"text-align: justify\"><strong>\u00a0<\/strong><\/p>\n<p style=\"text-align: justify\"><strong>II RELATED WORK<\/strong><\/p>\n<p style=\"text-align: justify\"><strong>ONTOLOGY BASED DOCUMENT INDEXING<\/strong><\/p>\n<p style=\"text-align: justify\">Working of knowledge intensive organizations like consultancy services, consultancy and supply services, data processing services etc. are dependent on the information which has been gathered from various resources of information which may be inside the organization and\/or external resources like the internet. This is very important to the organization to make the best use of gathered information from these sources. Knowledge intense organizations can do the knowledge management, which deals in the field of various activities relevant in knowledge life cycle: identification, acquisition, development, sharing, use and preservation of organization knowledge. Thus a system is designed for gathering the information for such tasks and the system developed is web based Webocrat system. Webocrat can interact at the knowledge level with the help of language. This language has been provided to the system by ontology with syntax and semantic rules.Use of ontology enables to define concepts and relations representing knowledge about a particular document in domain specific terms.<\/p>\n<p style=\"text-align: justify\"><strong>Scheme Of Document Retrieval<\/strong><\/p>\n<p style=\"text-align: justify\">Here\u00a0\u00a0\u00a0 developed\u00a0\u00a0\u00a0 package\u00a0\u00a0\u00a0 with\u00a0\u00a0\u00a0 three\u00a0\u00a0\u00a0 different approaches to document retrieval: vector representation, latent semantic indexing method (LSI), and ontology-based method used in the Webocrat system.<\/p>\n<p style=\"text-align: justify\"><strong>Vector Representation Approach<\/strong><\/p>\n<p style=\"text-align: justify\">This well knows approach is based on vector representation of document collection. First of all every document is passed through set of pre-processing tools (lower case, stop words filter, document frequency). Then a vector of index term weights is calculated as the document internal representation. These weights are calculated by most often used tf-idf scheme:<\/p>\n<p style=\"text-align: justify\">w = tfijidfij<\/p>\n<p style=\"text-align: justify\">where tfij = f<u>req<\/u><u>ij<\/u> <u>and<\/u> idfi = log N maxefreqijni<\/p>\n<p style=\"text-align: justify\"><em>freq<\/em><em>ij<\/em>is the number of occurrences of term<em> t<\/em><em>i<\/em>in document <em>d<\/em><em>j<\/em>, <em>N<\/em> is number of documents in collection, and <em>n<\/em><em>i<\/em>is the document frequency for term <em>t<\/em><em>i<\/em>in the whole document collection.<\/p>\n<p style=\"text-align: justify\">Such a vector is then normalized to unit length and stored into the term-document matrix <em>A<\/em>, which is internal representation of the whole document collection. In order to find some relevant document to a specific query <em>Q<\/em>it is necessary to represent the query <em>Q<\/em>in the same way as a document <em>D<\/em><em>i<\/em>(i.e. a vector of index term weights). Similarity between a query <em>Q<\/em>and a document<em>D<\/em><em>i<\/em>is computed as cosine of those two normalized vectors (document and query vectors).<\/p>\n<p style=\"text-align: justify\">Di \u00d7\u00a0 Q<\/p>\n<p style=\"text-align: justify\">SimTF-IDF ( Q,Di)\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 =Di\u00a0\u00a0\u00a0 Q<\/p>\n<p style=\"text-align: justify\"><strong>Latent Semantic Indexing Approach<\/strong><\/p>\n<p style=\"text-align: justify\">LSI approach is based on singular value decomposition of tf-idf matrix <em>A<\/em>. By this decomposition three matrixes are computed .<\/p>\n<p style=\"text-align: justify\"><em>A <\/em>= <em>USV<\/em><em><sup>T<\/sup><\/em><\/p>\n<p style=\"text-align: justify\">where<em>S<\/em> is the diagonal matrix of <em>singular values<\/em> and <em>U<\/em>,<em>V <\/em>are matrices of left and right<em> singular vectors. <\/em>If the singular values in <em>S<\/em> are ordered by size, the first <em>k<\/em> largest values may be kept and theremaining smaller ones are set to zero. The product of the resulting matrices is a matrix approximatelyequal to <em>A<\/em>, and is closest to <em>A<\/em> in the least squares sense.<\/p>\n<p style=\"text-align: justify\"><strong>Ontology Based Approach<\/strong><\/p>\n<p style=\"text-align: justify\">Here for document retrieval ontology is used by Webocrat system. In Figure 2 the whole process of query processing by this approach is shown. When a user throw a query then first appropriate concepts are retrieved( here this work can be done manually) Then the set of concepts associated with each document is retrieved from database. Then using simple metric both the sets are compared which expresses the similarity between a document Di and given query Q.Simonto(Q,D) =k<\/p>\n<p style=\"text-align: justify\">Where <em>Q<\/em>conis a set of concepts assigned to query <em>Q<\/em> and <em>D<\/em>conis a set of concepts assigned todocument <em>D<\/em>, and <em>k<\/em> is small constant, e.g. 0.1. Resulted number represents ontology-based similaritymeasure.<\/p>\n<p style=\"text-align: justify\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1879\" src=\"http:\/\/www.gyanvihar.org\/journals\/wp-content\/uploads\/2018\/12\/47.png\" alt=\"\" width=\"436\" height=\"473\" \/><\/p>\n<p style=\"text-align: justify\"><strong>KEYWORD BASED RETREIVAL SYSTEM<\/strong><\/p>\n<p style=\"text-align: justify\">Here introduced a distributed information retrieval framework which is based on the Okapi probabilistic model, this is the framework which is able to achieve the same level of effectiveness as those achieved by a single centralized index system. Okapi model is proposed by Robertson and Spark Jones. This is the enhanced probabilistic retrieval model based on binary independence model. Here used a simplified Okapi weighting function, in which w1 was assigned to a given term t1 in a document d and was computed according to the following formula:<\/p>\n<p style=\"text-align: justify\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1882\" src=\"http:\/\/www.gyanvihar.org\/journals\/wp-content\/uploads\/2018\/12\/48.png\" alt=\"\" width=\"404\" height=\"419\" \/><\/p>\n<p style=\"text-align: justify\"><strong>INDEX BASED INFORMATION RETREIVAL SYSTEM<\/strong><\/p>\n<p style=\"text-align: justify\">This is a three step process:<\/p>\n<p style=\"text-align: justify\"><em>Indexing<\/em>: In this step preprocessing of documents has been done. Here first a file is created for each document containing the words other than stop words (at, the, is, an etc.) also the stemming of the words is done, so that to get the words in their root forms (like: use is the root form of using, used, usable). In the next step frequency of each word has been counted and the words having frequency more than threshold value (based on a formula) is places as a index term. In the collected form all such terms create index table for that document.<\/p>\n<p style=\"text-align: justify\"><em>Formulation: <\/em>First step here is to expand the query based on domain knowledge stored in the form of ontological structure as a tree.<\/p>\n<p style=\"text-align: justify\">Second step is to apply the Preprocessing approach.<\/p>\n<p style=\"text-align: justify\"><em>Comparison: <\/em>The system compares the user query to the stored document representatives, and makes a classification decision about which documents to retrieve and in what order. Documents or parts of documents are displayed. Before searching user can select whether he wants to expand the query using tree or not. This comparison is carried out on the basis of matrix multiplication approach in which document representatives are converted into an id by term matrix and a matrix is generated for query terms. Multiplication of both provides necessary result to identify which document is more relevant to the query. Mathematically it can be shown as:<\/p>\n<p style=\"text-align: justify\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1884\" src=\"http:\/\/www.gyanvihar.org\/journals\/wp-content\/uploads\/2018\/12\/49.png\" alt=\"\" width=\"424\" height=\"331\" \/><\/p>\n<p style=\"text-align: justify\">Here comparison is based on weighted values and implication of inverted document frequency (IDF):<\/p>\n<p style=\"text-align: justify\">Weight is how many times a term appeared in document. So weight implies how relevant the term is for that particular document<\/p>\n<p style=\"text-align: justify\">IDF is inverse of document frequency calculated for incorporating measure that favors terms which occur in fewer documents. The fewer documents a term occurs in, the higher this weight.<\/p>\n<p style=\"text-align: justify\">Thus this weight*IDF factor together will show a greater value if terms are important to document result. System has been tested on sample domain of computer science containing books of chapters and it is able to reduce the number of words to be searched in the file, thereby minimizing the search space. This effectively reduces the searching time as well. Reduction causes search spaces to be reduced more than 90% .<\/p>\n<p style=\"text-align: justify\"><strong>ATTENTION BASED INFORMATION RETREIVAL<\/strong><\/p>\n<p style=\"text-align: justify\">It will be examined how attention data from the user can be exploited in order to enhance and personalize information retrieval. Up to now, nearly all implicit feedback sources that are used for information retrieval are based on mouse and keyboard input like click through, scrolling and annotation behavior. In this work, an unobtrusive eye tracker will be used as an attention evidence source being able to precisely detect read or skimmed document passages. This information will be stored in attention-annotated documents (e.g., containing read, skimmed,highlighted, commented passages). Based on such annotated documents, the user\u2019s current thematic context will be estimated. This context is increasingly taken into account in information retrieval systems<\/p>\n<p style=\"text-align: justify\">.For instance; context can be generated implicitly or explicitly. One of the current challenges is to elicit the context of a user. This can be done explicitly, for example by asking the user, whether a document is currently relevant or not (i.e., explicit relevance feedback). To use such explicitly generated context is suggestive and yields better results in IR than without considering any user context. However, asking the user about explicit feedback requires a higher effort on the user\u2019s side and should therefore be avoided. Thus, implicit feedback recently gained in importance, i.e., observing the user\u2019s actions and environment and trying to infer what might be relevant for him. A very interesting new evidence source for implicit feedback is the user\u2019s eye movements, because mostly they reflect the user\u2019s visual attention directly. The eye trackers of today are unobtrusive and are able to identify the user\u2019s gaze with high precision. Therefore, applying an eye tracker as a new evidence source for the user\u2019s attention introduces a potentially very valuable new dimension of contextual information in information retrieval. It is clear that eye trackers will not be wide spread in the near future due to their expensiveness. However, if becoming less expensive, they might well be interesting for knowledge workers in middle-or large-sized enterprises. focus lies on local desktop and enterprise-wide search. As an eye tracker is not the only source for attention evidence, a model shall be developed, which integrates different attention evidence sources so that a standardized overall level of attention can be derived for any piece of text in a document.<\/p>\n<p style=\"text-align: justify\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1886\" src=\"http:\/\/www.gyanvihar.org\/journals\/wp-content\/uploads\/2018\/12\/50.png\" alt=\"\" width=\"444\" height=\"387\" \/><\/p>\n<p style=\"text-align: justify\">based index.<\/p>\n<p style=\"text-align: justify\"><strong>Attention-Based Index<\/strong><\/p>\n<p style=\"text-align: justify\">The specific composition of a document reflects the mental models of the authors which might be very different from those of the readers. A reader of the document might only regard some parts of it with different intensities, namely those parts, which are of interest to him in his current thematic context (e.g., for his current task). Therefore, a document index for local desktop search, which supports retrieval of already used documents, should consider the user\u2019s degree of attention on the different parts of used documents. Secondly, there usually exists more than one document dealing with a specific topic being of interest to the user. Often, if a knowledge worker is making some document-based inquiry about some topic, he will not only regard one but several documents. To find these used documents at a later time (e.g., if a user remembers something interesting about a topic read some months ago), some connection between them should be maintained in the index.<\/p>\n<p style=\"text-align: justify\"><strong>RETRIEVAL<\/strong>\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <strong>EFFECTIVENESS<\/strong>\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <strong>OF<\/strong>\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <strong>AN\u00a0<\/strong><b>ONTOLOGY-BASED MODEL FOR CONCEPTUAL\u00a0INDEXING<\/b><\/p>\n<p style=\"text-align: justify\">Traditional approaches in information retrieval employ keyword-based techniques to look for relevant data. This paper introduces a concept-based retrieval model, which tackles vocabulary mismatches through the use of domain-dependent ontologies. However, keyword-based searching is not always sufficient for retrieving the most relevant data, essentially because documents may convey desired semantic information even if they do not contain the exact keywords with the query. One way to alleviate the problem of retrieving relevant documents that are indexed with terms, which are superficially distinct but semantically equivalent to query terms, is to index documents according to their meaning rather than keywords (Woods, 1999). Indexing documents based on their semantics rather that their morphological content is known as <em>conceptualindexing<\/em>. This paper, propose a concept-based model for the index structure, which uses domain-dependent topical ontologies. With regard to converting words to meanings, the key element is to identify the concepts that characterize the thematic content of both documents and user issued queries. This paper describes a mechanism for the automatic identification of documents\u2019 thematic terms, which<\/p>\n<p style=\"text-align: justify\">are employed by conceptual indexing module for representing the document\u2019s semantics at the index level. A critical feature in this technique is a disambiguation formula, which ensures that all document\u2019s thematic keywords represent concepts of a single thematic category. It also propose an automatic query semantics detection formula that helps the engines search mechanisms retrieve documents that highly correlate to the users\u2019 search<\/p>\n<p style=\"text-align: justify\">intentions.Ontologies are define sets of representational terms, referred to as concepts, which are interrelated to describe a target world represented by links that are labeled so as to denote the type of relation that holds among concepts. There are two predominant approaches for building ontology; domain dependent and generic. Here domain dependent ontology is used for popular web topics In that respect we picked the top level categories of a popular Web directory, namely Dmoz1, and for each of these topics here developed a small ontology of concepts referring to the topic at hand.<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1887\" src=\"http:\/\/www.gyanvihar.org\/journals\/wp-content\/uploads\/2018\/12\/51-1.png\" alt=\"\" width=\"410\" height=\"272\" \/><\/p>\n<p style=\"text-align: justify\">Finding Document Thematic Words now describe how explore the domain ontologies for converting the words found in a document into meanings. Thekey issue in converting words into meanings is to identify the appropriate concepts within a document that describe thedocument\u2019s thematic content. The computational model adopted here for finding the thematic words in a document relies onthe lexical chaining technique (Barzilay and Elhadad, 1997), which represents the content of a document as a sequence ofsemantically related terms When a document\u2019s term matches more than one concept in the ontology than apply a disambiguation formula so as to ensurethat the term will communicate to the generated chain its correct meaning. Concepts that are strongly associated to each other will be given high similarity scores based on their minimal distance from each other in the ontology\u2019s graph and their own correlation scores based on the number of common concepts that sub assume them in the ontology. Formally, for determining the<\/p>\n<p style=\"text-align: justify\">appropriate sense of a term matching multiple ontology nodes, rely on the matching nodes\u2019 correlation in the ontology graph, where the correlation score of a word pair <em>w<\/em>1 and <em>w<\/em>2 is formally determined as the product of the words\u2019 <em>Depth<\/em> (Song et al., 2004) and their conceptual similarity (<em>Sim<\/em>) (Resnik, 1995) in the enriched Greek WordNet. <em>Depth <\/em>of a word pair (<em>w<\/em>1,<em> w<\/em>2) is defined as:<\/p>\n<p style=\"text-align: justify\">DepthScore (w1,w2) = Depth (w1)<sup>2<\/sup> \u00b7 Depth (w2)<sup>2<\/sup><\/p>\n<p style=\"text-align: justify\">and words\u2019 similarity Sim (<em>w<\/em>1, <em>w<\/em>2) is determined by the set of Word Net concepts that subsume both <em>w<\/em>1 and <em>w<\/em>2 in any sense<\/p>\n<p style=\"text-align: justify\">of either words with a probability Pr:<\/p>\n<p style=\"text-align: justify\">Sim (w1,w2) = max c \u0404 subsumers (w1,w2 )<\/p>\n<p style=\"text-align: justify\">[ &#8211; log Pr (c) ]<\/p>\n<p style=\"text-align: justify\">Finally, we combine the above metrics to compute the Correlation Score between <em>w<\/em>1 and <em>w<\/em>2 that is formally defined as:<\/p>\n<p style=\"text-align: justify\">Correlation (w1,w2) = Depth (w1,w2 ) \u00b7 Sim (w1,w2 )<\/p>\n<p style=\"text-align: justify\">Using the above formulas, document terms are disambiguated and participate in the document\u2019s lexical chain with the sense that has the highest correlation score to the senses of the other terms in that chain. Every chain element is associated with a particular meaning borrowed from the ontology. This way reduction in the document\u2019s content into a sequence of semantically related terms .Ontology-Based Document indexing intuitively, conceptual indexing is enabled through the organization of the collected documents into structures of thematic clusters. In this ontology-based model, a thematic cluster corresponds to the root node concept (i.e. the topic) of domain ontology and the structure of the cluster is represented by the ontology\u2019s hierarchies whose elements are specialized concepts of the root node concept. Based on the above, one way to address the conceptual indexing challenge is to view it from a hierarchical classification perspective. In other words, if here assign to every collected page an appropriate ontology topic than one can use the pages\u2019 lexical chain elements that match the nodes of the topic\u2019s hierarchies as indexing keywords for representing the pages\u2019 semantic content. To pick an appropriate category from the ontology\u2019s topics for representing a page\u2019s semantic content one can essentially need to identify the topical category (-ies) of the page\u2019s chain elements. To decide upon the<\/p>\n<p style=\"text-align: justify\">topical category of a page\u2019s thematic words, here use the ontology\u2019s nodes that match the elements in the page\u2019s chain and than compute a score based on the number of words from the page\u2019s chain that associate to the ontology\u2019s topics (i.e. top level concepts). The calculation of that score, which represents the correlation between the page\u2019s thematic words and the ontology\u2019s topics, is formally defined as follows. Topic-Correlation score (Tscore): The T-score of a lexical LCi for a particular ontology topic Ti is the number of elements of LCi matched with subordinate concepts in Ti divided by thr total number of of elements in LCi<\/p>\n<ul style=\"text-align: justify\">\n<li>of elements of LCi matched<\/li>\n<\/ul>\n<p style=\"text-align: justify\">Tscore=\u00a0\u00a0 # of elements in LCi<\/p>\n<p style=\"text-align: justify\">Based on the above formula, pick the topic that has the highest correlation with the elements in a page\u2019s lexical chain for representing the semantic content of that page. Following the process described above, one can assign to every page an appropriate ontology topic and then use the page\u2019s lexical elements that match the given topic hierarchies as the indexing keywords for the page.<\/p>\n<p style=\"text-align: justify\"><strong>III CONCLUSION<\/strong><\/p>\n<p style=\"text-align: justify\">Based on the algorithm used, the indexing algorithm provides definite index to information retrieval system. A typical IR system should use indexing techniques based on the specific needs of the users. After going through exhaustive analysis of indexing algorithms. It is concluded that existing techniques have limitations particularly in terms of time response, accuracy of results, importance of the results and relevancy of results. An efficient indexing algorithm should meet out these challenges efficiently with compatibility with global standards.<\/p>\n<p style=\"text-align: justify\"><strong>IV REFERENCES<\/strong><\/p>\n<ul style=\"text-align: justify\">\n<li>\u201cIndex based Information Retrieval\u00a0 System\u201d<\/li>\n<\/ul>\n<p style=\"text-align: justify\">Ambesh Negi1, MayurBhirud, Dr. Suresh Jain, Mr. Amit Mittal PG Scholar, IET DAVV, Indore Director, KCBTA, Indore Assistant professor, IET, IndoreInternational Journal of Modern Engineering Research (IJMER) www.ijmer.com Vol.2, Issue.3, May-June 2012 pp-945-948 ISSN: 2249-6645<\/p>\n<ul style=\"text-align: justify\">\n<li>\u201cTerm Proximity Scoring for Keyword-Based Retrieval Systems\u201d Yves Rasolofo and Jacques<\/li>\n<\/ul>\n<p style=\"text-align: justify\">SavoyUniversity de Neuch\u00e2tel, Neuchatel,Switzerland Published in Lecture Notes in Computer Science 2633, 1611-3349, 2003<\/p>\n<ul style=\"text-align: justify\">\n<li>\u201cOntology-based Information\u00a0 Retrieval\u201d\u00a0 Jan<\/li>\n<\/ul>\n<p style=\"text-align: justify\">ParalicDepartment of Cybernetics and AI, Technical University of Kosice,Letna 9, 04011 Kosice, Slovakia Ivan KostialDepartment of Cybernetics and AI, Technical University of Kosice,Letna 9, 04011Kosice, Slovakia<\/p>\n<p style=\"text-align: justify\">[4].\u201dAttention-Based Information Retrieval\u201d\u00a0\u00a0\u00a0 Georg<\/p>\n<p style=\"text-align: justify\">BuscherGerman\u00a0\u00a0 Research\u00a0\u00a0\u00a0 Center\u00a0\u00a0\u00a0 for\u00a0\u00a0\u00a0 Artificial<\/p>\n<p style=\"text-align: justify\">Intelligence (DFKI)<\/p>\n<ul style=\"text-align: justify\">\n<li>\u201cRetrieval Effectiveness OPfAn Ontology Based Model For Conceptual Indexing\u201d Sofia Stamou<\/li>\n<\/ul>\n<p style=\"text-align: justify\">Computer Engineering and Informatics Department, Patras University, 26500 GREECE<\/p>\n","protected":false},"excerpt":{"rendered":"<p>1Deepika Sharma,2SandeepkumarPoonia 1,2-JagannathUniversity,Jaipur,Rajasthan,India Abstract:.From thousands of years humans store information in different ways. As from stone age to today\u2019s computer era there is a requirement to store information as well as retrieve this information in future and use it. Thus along with the storage there is a new requirement arises to retrieve the information also, [&hellip;]<\/p>\n","protected":false},"author":5,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[26,30],"tags":[],"class_list":["post-1870","post","type-post","status-publish","format-standard","hentry","category-international-journal-of-converging-technologies-management","category-volume-2-issue-2-2016"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.7 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>research journal - Research Journal<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Analysis Of Index Based Information Retrieval Algorithms | SGVU\" \/>\n<meta property=\"og:description\" content=\"As the volume of documents and thus information increases in the repository day by day there is a challenge to provide proper and relevant information to the user.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/\" \/>\n<meta property=\"og:site_name\" content=\"research journal\" \/>\n<meta property=\"article:published_time\" content=\"2018-12-05T09:47:55+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2019-08-13T10:39:15+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.gyanvihar.org\/journals\/uploads\/2018\/12\/45.png\" \/>\n\t<meta property=\"og:image:width\" content=\"433\" \/>\n\t<meta property=\"og:image:height\" content=\"306\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"gyanvihar3\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:title\" content=\"Analysis Of Index Based Information Retrieval Algorithms | SGVU\" \/>\n<meta name=\"twitter:description\" content=\"As the volume of documents and thus information increases in the repository day by day there is a challenge to provide proper and relevant information to the user.\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"gyanvihar3\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"16 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/\",\"url\":\"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/\",\"name\":\"An Analysis of Index Based Information Retrieval Algorithms - research journal\",\"isPartOf\":{\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/#primaryimage\"},\"thumbnailUrl\":\"http:\/\/www.gyanvihar.org\/journals\/wp-content\/uploads\/2018\/12\/45.png\",\"datePublished\":\"2018-12-05T09:47:55+00:00\",\"dateModified\":\"2019-08-13T10:39:15+00:00\",\"author\":{\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/#\/schema\/person\/0fee890b071b4083d5422be043bb99e9\"},\"breadcrumb\":{\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/#primaryimage\",\"url\":\"https:\/\/www.gyanvihar.org\/journals\/uploads\/2018\/12\/45.png\",\"contentUrl\":\"https:\/\/www.gyanvihar.org\/journals\/uploads\/2018\/12\/45.png\",\"width\":433,\"height\":306},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.gyanvihar.org\/journals\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"An Analysis of Index Based Information Retrieval Algorithms\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/#website\",\"url\":\"https:\/\/www.gyanvihar.org\/journals\/\",\"name\":\"research journal\",\"description\":\"Research Journal\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.gyanvihar.org\/journals\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/#\/schema\/person\/0fee890b071b4083d5422be043bb99e9\",\"name\":\"gyanvihar3\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.gyanvihar.org\/journals\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/ea51e7e852346f1b6b7715e7b9b893df?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/ea51e7e852346f1b6b7715e7b9b893df?s=96&d=mm&r=g\",\"caption\":\"gyanvihar3\"},\"url\":\"https:\/\/www.gyanvihar.org\/journals\/author\/gyanvihar3\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"research journal - Research Journal","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/","og_locale":"en_US","og_type":"article","og_title":"Analysis Of Index Based Information Retrieval Algorithms | SGVU","og_description":"As the volume of documents and thus information increases in the repository day by day there is a challenge to provide proper and relevant information to the user.","og_url":"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/","og_site_name":"research journal","article_published_time":"2018-12-05T09:47:55+00:00","article_modified_time":"2019-08-13T10:39:15+00:00","og_image":[{"width":433,"height":306,"url":"https:\/\/www.gyanvihar.org\/journals\/uploads\/2018\/12\/45.png","type":"image\/png"}],"author":"gyanvihar3","twitter_card":"summary_large_image","twitter_title":"Analysis Of Index Based Information Retrieval Algorithms | SGVU","twitter_description":"As the volume of documents and thus information increases in the repository day by day there is a challenge to provide proper and relevant information to the user.","twitter_misc":{"Written by":"gyanvihar3","Est. reading time":"16 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/","url":"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/","name":"An Analysis of Index Based Information Retrieval Algorithms - research journal","isPartOf":{"@id":"https:\/\/www.gyanvihar.org\/journals\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/#primaryimage"},"image":{"@id":"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/#primaryimage"},"thumbnailUrl":"http:\/\/www.gyanvihar.org\/journals\/wp-content\/uploads\/2018\/12\/45.png","datePublished":"2018-12-05T09:47:55+00:00","dateModified":"2019-08-13T10:39:15+00:00","author":{"@id":"https:\/\/www.gyanvihar.org\/journals\/#\/schema\/person\/0fee890b071b4083d5422be043bb99e9"},"breadcrumb":{"@id":"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/#primaryimage","url":"https:\/\/www.gyanvihar.org\/journals\/uploads\/2018\/12\/45.png","contentUrl":"https:\/\/www.gyanvihar.org\/journals\/uploads\/2018\/12\/45.png","width":433,"height":306},{"@type":"BreadcrumbList","@id":"https:\/\/www.gyanvihar.org\/journals\/an-analysis-of-index-based-information-retrieval-algorithms\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.gyanvihar.org\/journals\/"},{"@type":"ListItem","position":2,"name":"An Analysis of Index Based Information Retrieval Algorithms"}]},{"@type":"WebSite","@id":"https:\/\/www.gyanvihar.org\/journals\/#website","url":"https:\/\/www.gyanvihar.org\/journals\/","name":"research journal","description":"Research Journal","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.gyanvihar.org\/journals\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.gyanvihar.org\/journals\/#\/schema\/person\/0fee890b071b4083d5422be043bb99e9","name":"gyanvihar3","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.gyanvihar.org\/journals\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/ea51e7e852346f1b6b7715e7b9b893df?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/ea51e7e852346f1b6b7715e7b9b893df?s=96&d=mm&r=g","caption":"gyanvihar3"},"url":"https:\/\/www.gyanvihar.org\/journals\/author\/gyanvihar3\/"}]}},"_links":{"self":[{"href":"https:\/\/www.gyanvihar.org\/journals\/wp-json\/wp\/v2\/posts\/1870","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.gyanvihar.org\/journals\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.gyanvihar.org\/journals\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.gyanvihar.org\/journals\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/www.gyanvihar.org\/journals\/wp-json\/wp\/v2\/comments?post=1870"}],"version-history":[{"count":5,"href":"https:\/\/www.gyanvihar.org\/journals\/wp-json\/wp\/v2\/posts\/1870\/revisions"}],"predecessor-version":[{"id":3038,"href":"https:\/\/www.gyanvihar.org\/journals\/wp-json\/wp\/v2\/posts\/1870\/revisions\/3038"}],"wp:attachment":[{"href":"https:\/\/www.gyanvihar.org\/journals\/wp-json\/wp\/v2\/media?parent=1870"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.gyanvihar.org\/journals\/wp-json\/wp\/v2\/categories?post=1870"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.gyanvihar.org\/journals\/wp-json\/wp\/v2\/tags?post=1870"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}