The Assessment of the Effect of Query Expansion on Improving the Performance of Scientific Texts Retrieval in Persian

Document Type : Original Research Manuscripts

Authors

1 Ph.D. student in Information Science and Knowledge, Faculty of Management, University of Tehran, Tehran, Iran

2 Assistant Professor, Department in Information Science and Knowledge, Faculty of Management, University of Tehran, Tehran, Iran.

3 Graduate of Industrial Engineering, Saveh Azad University

Abstract
Purpose: This study aims to determine the effect of query expansion on scientific texts retrieval in Persian.
Method: The present study was conducted using a quasi-experimental method. The results are obtained by analyzing 40 initial and expanded queries of postgraduate students in the Faculty of Management, University of Tehran. Query expansion was performed manually using primary research results.
Findings: Query expansion of Persian scientific texts leads to an increase in the number of related retrieved documents, as well as the comprehensiveness and accuracy of retrieving scientific data in Elmnet search engine, which as a result, improves the overall performance of information retrieval.
Results: Nowadays, automatic query expansion is on the agenda of databases. Given that Persian databases are not fully developed, and the existence of specific problems with writing in the Persian language, information literacy training and the method of defining and expressing information requirements and providing them to the information retrieval systems, can have a significant impact on postgraduate students and researchers, to retrieve the required information and save them time and money.

Keywords

Subjects

Abdelmgeid Amin, A. (2008). Using a query expansion technique to improve document retrieval. Information Technologies and Knowledge, 7(2), 343-345.
Amolochitis, E., Christou, I. T., Tan, Z. H., & Prasad, R. (2013). A heuristic hierarchical scheme for academic search and retrieval. Information Processing & Management, 49(6), 1326-1343.
Azad, H. K., & Deepak, A. (2019). Query expansion techniques for information retrieval: a survey. Information Processing & Management, 56(5), 1698-1735. arXiv preprint arXiv:1708.00247.
Baeza-Yates, R., & Ribeiro-Neto, B. (1999). Modern information retrieval: Addison-Wesley Harlow, England.
Bendersky, M., & Croft, W. B. (2008, July). Discovering key concepts in verbose queries. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (pp. 491-498). ACM
Efthimiadis, E. N. (1996). Query Expansion. Annual review of information science and technology (ARIST), 31, 121-87.
Ellis, D. and Haugan, M. (1997). Modeling the Information Seeking Patterns of Engineers and Research Scientists in an Industrial Environment. Journal of Documentation, 53(4), pp. 38-403.
Ellis, D. (1989). A Behavioral Model for Information Retrieval System Design. Journal of Information Science 45(3), pp. 171-212.
            https://doi.org/10.1108/eb026843
Lavrenko, V., & Croft, W. B. (2017, August). Relevance-based language models. In ACM SIGIR Forum (Vol. 51, No. 2, pp. 260-267). ACM.
https://doi.org/10.1145/3130348.3130376
Lee, K. S., Croft, W. B., & Allan, J. (2008, July). A cluster-based resampling method for pseudo-relevance feedback. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (pp. 235-242). ACM.
Li, X., Schijvenaars, B. J., & de Rijke, M. (2017). Investigating queries and search failures in academic search. Information processing & management, 53(3), 666-683.
Li, X., & de Rijke, M. (2019). Characterizing and predicting downloads in academic search. Information Processing & Management, 56(3), 394-407. https://doi.org/10.1016/j.ipm.2018.10.019.
Makri, Blandford and Cox (2008). Investigating the Information-Seeking Behavior of Academic Lawyers: From Ellis's Model to Design. Information Processing & Management 44(2), pp. 613-634.
Mehdi, M., Okoli, C., Mesgari, M., Nielsen, F. Å., & Lanamäki, A. (2017). Excavating the mother lode of human-generated text: A systematic review of research that uses the wikipedia corpus. Information Processing & Management, 53(2), 505-529.
Robertson, S. E., & Jones, K. S. (1976). Relevance weighting of search terms. Journal of the American Society for Information science, 27(3),129-146.
https://doi.org/10.1002/asi.4630270302
Russell-Rose, T., Gooch, P., & Kruschwitz, U. (2021). Interactive query expansion for professional search applications. Business Information Review,
Sharma, D. K., Pamula, R., & Chauhan, D. S. (2021). Semantic approaches for query expansion. Evolutionary Intelligence, 1-16.
Wollersheim, D. (2005). Dynamic query expansion for information retrieval of imprecise medical queries (Doctoral dissertation, La Trobe University).
Zhang, H. (2013). Query enhancement with topic detection and disambiguation for robust retrieval (Doctoral dissertation, Indiana University).
 
Volume 1, Issue 1
Autumn 2021
Pages 38-51

  • Receive Date 24 September 2021
  • Revise Date 27 September 2021
  • Accept Date 27 September 2021