An archive is a file containing several related files. Many Internet resources, such as freeware, shareware and trail software, are often packaged into archives for easy installation and taking. Additionally, thousands of users search for archives and download them from different sources everyday. In this paper, previous research on archive downloading is extended via proxy cache to support archive searching. Internet proxy cache servers are used to gather a significant number of Web pages, detect those that contain archive links, and then use the obtained data to search archives by description or filename. Two schemes, iterative and backtracking, are proposed to obtain Web pages with archive links. The experimental results indicate that the precision that both of the schemes can achieve is about the same; however, the backtracking scheme reduces the number of checked pages by a factor of 26. Finally, a real system was implemented to demonstrate the proposed approaches.
Article navigation
1 February 2004
Research Article|
February 01 2004
Archive knowledge discovery by proxy cache Available to Purchase
Hsiang‐Fu Yu;
Hsiang‐Fu Yu
Computer Center, at the National Central University, Taiwan, ROC
Search for other works by this author on:
Yi‐Ming Chen;
Yi‐Ming Chen
Department of Information Management, at the National Central University, Taiwan, ROC
Search for other works by this author on:
Li‐Ming Tseng
Li‐Ming Tseng
Distributed System Laboratory, Department of Computer Science & Information Engineering, at the National Central University, Taiwan, ROC
Search for other works by this author on:
Publisher: Emerald Publishing
Online ISSN: 2054-5657
Print ISSN: 1066-2243
© Emerald Group Publishing Limited
2004
Internet Research (2004) 14 (1): 34–47.
Citation
Yu H, Chen Y, Tseng L (2004), "Archive knowledge discovery by proxy cache". Internet Research, Vol. 14 No. 1 pp. 34–47, doi: https://doi.org/10.1108/10662240410516309
Download citation file:
Suggested Reading
E‐profile: Open Archives Initiative Data Providers. Part I: General
Library Hi Tech News (March,2004)
Encoded Archival Description on the Internet
Library Review (June,2003)
Archival Web Sites: A Guide to Creating, Designing, Marketing and Maintaining a Web Site for Archive Services
Records Management Journal (August,2002)
ScotlandsPlaces XML: bespoke XML or XML mapping?
Program (February,2010)
Endpoint study of Internet paths and Web pages transfers
Campus-Wide Information Systems (August,2003)
Related Chapters
Documents on Piero Sraffa at the Archivio Centrale Dello Stato and at the Archivio Storico Diplomatico
Including a Symposium on New Directions in Sraffa Scholarship
Archives and Human Rights: Questioning Notions of Information and Access
Perspectives on Libraries as Institutions of Human Rights and Social Justice
Challenging Colonial Myths With Archival Datasets: Cockatoo Island Prison, 1839–1869
Imperial Crime and Punishment: Approaches from Historical Criminology
Recommended for you
These recommendations are informed by your reading behaviors and indicated interests.
