Impact of Introducing Natural Language Processing Techniques In Information Retrieval

Mohammed Mostafa Mohammed Hamail;

Abstract


Since most Information Retrieval models are mathematically based, the question posed here is: is this sufficient for efficient retrieval of Arabic documents or the model still needs elaboration. To answer these questions, it was necessary to choose and test one of these IR models. Being a one vector­ space approach, and a conceptual indexing technique, the Latent Semantic Indexing (LSI) model was chosen. This ts because it overcomes the deficiencies of the other models. It achieved up to 30% better retrieval performance than the other techniques.
This thesis went through two phases. The first was designing and implementing an experimental system based on this model. The second was measuring the retrieval performance of this system applied to the Arabic language, trying to improve its performance. •This improvement of the performance involved determining the problems faced and trying to handle them using the computational linguistics techniques.
An experimental IR system (ARS) was designed and implemented based

on the LSI model. It was the first time to apply the LSI retrieval model to Arabic. In order to measure the impact of adding linguistic techniques to the LSI model, three experiments were conducted. The Indexing size was calculated and the retrieval performance was measured using precision, recall and Van Rijsbergen combined measure.
The first experiment was the core-system (i.e. LSI model only without any linguistic features). In this experiment, the size of indexing was a total of
7.69 MB of the disk space. The retrieval performance resulted in a high precision but a low recall. This means that, only small numbers of relevant documents were retrieved. Two problems aroused which are inflection and synonymy. The system achieved poor retrieval results concerning these two problems. Regarding the query-length, the retrieval performance of the system degraded gracefully as the query length increased.


Other data

Title Impact of Introducing Natural Language Processing Techniques In Information Retrieval
Other Titles تأثير إدخال أساليب المعالجة الالية للغات الطبيعية فى استرجاع المعلومات
Authors Mohammed Mostafa Mohammed Hamail
Issue Date 2002

Attached Files

File SizeFormat
B11750.pdf997.14 kBAdobe PDFView/Open
Recommend this item

Similar Items from Core Recommender Database

Google ScholarTM

Check

views 2 in Shams Scholar


Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.