PARTITION-BASED PATTERN MATCHING APPROACH FOR EFFICIENT RETRIEVAL OF ARABIC TEXT

Saqib Hakak; Amirrudin Kamsin; Palaiahnakote Shivakumara; Mohd. Yamani Idna Idris

doi:10.22452/mjcs.vol31no3.3

Authors

Saqib Hakak Faculty of Computer Science and Information Technology, University Malaya, Kuala Lumpur, Malaysia
Amirrudin Kamsin Faculty of Computer Science and Information Technology, University Malaya, Kuala Lumpur, Malaysia
Palaiahnakote Shivakumara Faculty of Computer Science and Information Technology, University Malaya, Kuala Lumpur, Malaysia
Mohd. Yamani Idna Idris Faculty of Computer Science and Information Technology, University Malaya, Kuala Lumpur, Malaysia

DOI:

https://doi.org/10.22452/mjcs.vol31no3.3

Keywords:

Partition-based pattern matching, exact matching, Arabic texts, short patterns, digital Quran, information retrieval

Abstract

Encoding for Arabic based on the Unicode Transformation Format (UTF) differs from encoding for English based on the American Standard Code for Information Interchange (ASCII) since the Arabic usage of diacritics, symbols and elongated characters makes searching more challenging in the field of information retrieval. In this paper, we propose a new partition-based pattern matching approach that divides the query words into two equal parts (sub-parts). The proposed approach treats the two divided sub-parts as independent query words and uses a parallel search to match the content in the database. In addition, the proposed approach modifies the conventional brute force pattern matching to speed up the searching process which results in efficient text retrieval from any database. The experimental results are used to evaluate the proposed approach in terms of processing time. The comparative analysis of the existing approaches and the proposed approach reveals that the proposed approach outperforms all other existing approaches in terms of computational time.

PARTITION-BASED PATTERN MATCHING APPROACH FOR EFFICIENT RETRIEVAL OF ARABIC TEXT

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

Most read articles by the same author(s)

Editorial Information

Scope

Submission Guidelines

Indexing

Article Publication Charge

Journal Template

Special Issue

In Press Publication

Awards

Information

Conference

Articles

Top Cited Articles

Most View Articles

Publishing Timeline