To search, Click below search items.

 

All Published Papers Search Service

Title

An Efficient Ranking Module for an Arabic Search Engine

Author

Esraa Abd Elraouf, Nagwa Lotfy Badr, Mohamed Fahmy Tolba

Citation

Vol. 10  No. 2  pp. 218-225

Abstract

Searching online provides you with a wealth of information, but not all of it will be useful or of the highest quality. Search engines are distributed programs that dive into the World Wide Web to find relevant information for a given search query. Their fundamental components are: the crawlers, the indexer module, the collection analysis module, the query engine, and the ranking module. Many of today¡¯s search engines use a traditional text process to retrieve pages related to a user¡¯s query. Traditional text processing is done to find all documents using the query terms, or related to the query terms by semantic meaning. With the massive size of the web, this result in thousands of retrieved pages may or may not related to the query. The main function of the ranking module is to sort the search results by relevance or importance using information retrieval (IR) algorithms. There were two kinds of methods in information retrieval, based on content and based on hyper-link. The quantity of computation in systems based on content was very large and the precision in systems based on hyper-link only was not ideal. It was necessary to develop a technique combining the advantages of two systems [3]. Many web users are interested in Arabic web browsing whether the reason is academic or commercial¡¦ suffer to find their search and request over the Arabic search engine etc. As the existing web search engines are designed to perform English web searches. They don¡¯t generate morphological variations of Arabic words but they just match the word as it is. Therefore their results contain only the pages that exactly match the user query. They also don¡¯t consider the different meanings of a word so search results contain unrelated pages to user query. In this research, we focus on implementing an enhanced ranking algorithm by combining both the page content and the Hyper-Link with the focus on Arabic search engines by taking into account the stem and the context of the Arabic word by combining both the count of words related to query in the page and the count of words related to query in outlinks pages of that page to calculate its rank, using external database having the morphological meanings of the most Arabic words. Then sort the pages according its rank. If there is more than one meaning to an input query word in case the user does a query in using only one word, the user may choose the meaning he/she wishes to search for. The search results will largely contain the inflected forms of the word that belong to that meaning. This helps reduce the redundancy that results from morphological search only.

Keywords

Morphological Variations, Information Retrieval, Ranking, Arabic Search Engine

URL

http://paper.ijcsns.org/07_book/201002/20100233.pdf