Skip to main content
Log in

Efficient execution of web navigation sequences

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Web automation applications are widely used for different purposes such as B2B integration and automated testing of web applications. Most current systems build the automatic web navigation component by using the APIs of conventional browsers. While this approach has its advantages, it suffers performance problems for intensive web automation tasks which require real time responses and/or a high degree of parallelism. In this paper, we outline a set of techniques to build a web navigation component able to efficiently execute web navigation sequences. These techniques detect what elements and scripts of the pages accessed during the navigation sequence are needed for the correct execution of the sequence (and, therefore, must be loaded and executed), and what parts of the pages can be discarded. The tests executed with real web sources show that the optimized navigation sequences run significantly faster and consume significantly less resources.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
€32.70 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Bulgaria)

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Alexa: The Web Information Company http://www.alexa.com

  2. Anupam, V., Freire, J., Kumar, B., Lieuwen, D.: Automating web navigation with the WebVCR. Comput. Netw. 33(1–6), 503–517 (2000)

    Article  Google Scholar 

  3. Davulcu H., Yang G., Kifer M., Ramakrishnan I.V.: Computational Aspects of Resilient Data Extraction from Semistructured Sources, ACM Symposium on Principles of Database Systems (PODS), pp. 136–144 (2000)

  4. Document Object Model (DOM): http://www.w3.org/DOM/

  5. HtmlUnit: http://htmlunit.sourceforge.net/

  6. Hupp D., Miller R.C.: Smart Bookmarks: automatic retroactive macro recording on the web. In: Proceedings of the 20th Annual ACM Symposium on User Interface Software and Technology, pp. 81–90. ACM New York, Newport (2007)

  7. iOpus: http://www.iopus.com

  8. Kapow: http://www.openkapow.com

  9. Lingam S., Elbaum S.: Supporting End-Users in the Creation of Dependable Web Clips. WWW 2007, 953–962

  10. Losada J., Raposo J., Pan A., Montoto P.: Efficient execution of web navigation sequences. In: The 14th International Conference on Web Information System Engineering (WISE), pp. 340–353 (2012)

  11. Montoto, P., Pan, A., Raposo, J., Bellas, F., López, J.: Automated browsing in AJAX websites. Data Knowl. Eng. 70(3), 269–283 (2011)

    Article  Google Scholar 

  12. Pan A., Raposo J., Álvarez M., Hidalgo J., Viña A.: Semi automatic wrapper-generation for commercial web sources. In: IFIP WG8.1 Working Conference on Engineering Information Systems in the Internet Context, pp. 265–283. Kluwer, B.V. Deventer, Japan (2002)

  13. QEngine: http://www.adventnet.com/products/qengine/index.html

  14. Safonov A., Konstan J., Carlis J.: Beyond hard-to-reach pages: interactive, parametric web macros. In: 7th Conference on Human Factors & the Web. Madison 2001

  15. Sahi: http://sahi.co.in/w/

  16. Selenium: http://seleniumhq.org/

  17. XML Path Language (XPath): http://www.w3.org/TR/xpath

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to José Losada.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Losada, J., Raposo, J., Pan, A. et al. Efficient execution of web navigation sequences. World Wide Web 17, 921–947 (2014). https://doi.org/10.1007/s11280-013-0259-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-013-0259-8

Keywords

Navigation