Autosoft Journal

Online Manuscript Access


Evaluation of Search Engine Weight by Considering Repeated Web Page Contents


Authors



Abstract

The ranking of search results largely determines the quality of service (QoS) of a meta-search engine (MSE). To address the demand of big data applications, this paper proposes a new method considering factors such as network bandwidth, client and limit server resources. In this method, Web pages with the same contents (but with different URLs) are identified by calculating similarity among contents of pages traversed by the user and those of pages not yet traversed. Hence, deviation of statistics about the user2019s intent for traversing caused by factors such as ranking differences in the orders of traversing and repeated contents of Web pages can be eliminated. While a search service is being provided, each component search engine (CSE) weight can be given dynamically before returned results receive a second rotary ranking in combination with initial ranking information. Experimental results and statistics show that (1) the numbers of traversals and downloads can be decreased; (2) the ratio of the number of pages clicked by the user to that of pages navigated can also be decreased; (3) the matching degree between searches/traversals and returned results can be increased; and (4) the stability of a search engine can be improved by taking into account the factor of repeated contents of Web pages.


Keywords


Pages

Total Pages: 9
Pages: 589-597

DOI
10.1080/10798587.2017.1316083


Manuscript ViewPdf Subscription required to access this document

Obtain access this manuscript in one of the following ways


Already subscribed?

Need information on obtaining a subscription? Personal and institutional subscriptions are available.

Already an author? Have access via email address?


Published

Volume: 23
Issue: 4
Year: 2017

Cite this document


References

Amento B. Proceedings of the ACM SIGIR

ACM SIGIR Forum 36.2 (2002): n. pag. Crossref. Web. https://doi.org/10.1145/792550

Bun, Khoo Khyou, and Mitsuru Ishizuka. "Emerging Topic Tracking System." Lecture Notes in Computer Science (2001): 125-130. Crossref. Web. https://doi.org/10.1007/3-540-45490-X_13

Cao L. Application Research of Computers

Cetintas, Suleyman, and Luo Si. "Exploration of the Tradeoff Between Effectiveness and Efficiency for Results Merging in Federated Search." Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR ”07 (2007): n. pag. Crossref. Web. https://doi.org/10.1145/1277741.1277869

Chapelle O. The Journal of Machine Learning Research

Costa, Rogério Luís de Carvalho, and Pedro Furtado. "Quality of Experience in Distributed Databases." Distributed and Parallel Databases 29.5-6 (2011): 361-396. Crossref. Web. https://doi.org/10.1007/s10619-011-7083-x

Fetterly D. Journal of Web Engineering

Hassan, Ahmed, Rosie Jones, and Kristina Lisa Klinkner. "Beyond DCG." Proceedings of the third ACM international conference on Web search and data mining - WSDM ”10 (2010): n. pag. Crossref. Web. https://doi.org/10.1145/1718487.1718515

Henzinger, M.R. "Hyperlink Analysis for the Web." IEEE Internet Computing 5.1 (2001): 45-50. Crossref. Web. https://doi.org/10.1109/4236.895141

Howe A. E. Ai Magazine

ACM SIGIR Forum 32.1 (1998): n. pag. Crossref. Web. https://doi.org/10.1145/281250

Joachims, Thorsten. "Optimizing Search Engines Using Clickthrough Data." Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD ”02 (2002): n. pag. Crossref. Web. https://doi.org/10.1145/775047.775067

Kammerer, Yvonne, and Peter Gerjets. "How the Interface Design Influences Users” Spontaneous Trustworthiness Evaluations of Web Search Results." Proceedings of the 2010 Symposium on Eye-Tracking Research & Applications - ETRA ”10 (2010): n. pag. Crossref. Web. https://doi.org/10.1145/1743666.1743736

Lawrence, Steve, and C. Lee Giles. "Inquirus, the NECI Meta Search Engine." Computer Networks and ISDN Systems 30.1-7 (1998): 95-105. Crossref. Web. https://doi.org/10.1016/S0169-7552(98)00095-6

Losada, David E. "Statistical Query Expansion for Sentence Retrieval and Its Effects on Weak and Strong Queries." Information Retrieval 13.5 (2010): 485-506. Crossref. Web. https://doi.org/10.1007/s10791-009-9122-z

Paltoglou, Georgios, Michail Salampasis, and Maria Satratzemi. "Hybrid Results Merging." Proceedings of the sixteenth ACM conference on Conference on information and knowledge management - CIKM ”07 (2007): n. pag. Crossref. Web. https://doi.org/10.1145/1321440.1321487

Salton G. Readings in information retrieval

Wilson M. L. Science 2.1 (2010)

"Journal of the American Society for Information Science and Technology." n. pag. Crossref. Web. https://doi.org/10.1002/(ISSN)1532-2890

Wu, Shengli, and Sally McClean. "Performance Prediction of Data Fusion for Information Retrieval." Information Processing & Management 42.4 (2006): 899-915. Crossref. Web. https://doi.org/10.1016/j.ipm.2005.08.004

Wu, Shengli, and Sally McClean. "Result Merging Methods in Distributed Information Retrieval with Overlapping Databases." Information Retrieval 10.3 (2007): 297-319. Crossref. Web. https://doi.org/10.1007/s10791-007-9023-y

Xu, Jinxi, and W. Bruce Croft. "Query Expansion Using Local and Global Document Analysis." Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR ”96 (1996): n. pag. Crossref. Web. https://doi.org/10.1145/243199.243202

Xue, Gui-Rong et al. "Optimizing Web Search Using Web Click-through Data." Proceedings of the Thirteenth ACM conference on Information and knowledge management - CIKM ”04 (2004): n. pag. Crossref. Web. https://doi.org/10.1145/1031171.1031192

JOURNAL INFORMATION


ISSN PRINT: 1079-8587
ISSN ONLINE: 2326-005X
DOI PREFIX: 10.31209
10.1080/10798587 with T&F
IMPACT FACTOR: 0.652 (2017/2018)
Journal: 1995-Present




CONTACT INFORMATION


TSI Press
18015 Bullis Hill
San Antonio, TX 78258 USA
PH: 210 479 1022
FAX: 210 479 1048
EMAIL: tsiepress@gmail.com
WEB: http://www.wacong.org/tsi/