WebMonitoring Software System: Finite State Machines for Monitoring the Web

Vesna Pajić1, Duško Vitas2, Gordana Pavlović Lažetić2 and Miloš Pajić1

  1. University of Belgrade, Faculty of Agriculture, Nemanjina 6
    11080 Zemun, Belgrade, Republic of Serbia
    svesna@agrif.bg.ac.rs, paja@agrif.bg.ac.rs
  2. University of Belgrade, Faculty of Mathematics, Studentski trg 17
    11000 Belgrade, Republic of Serbia
    vitas@matf.bg.ac.rs, gordana@matf.bg.ac.rs

Abstract

This paper presents a software system called WebMonitoring. The system is designed for solving certain problems in the process of information search on the web. The first problem is improving entering of queries at search engines and enabling more complex searches than keyword-based ones. The second problem is providing access to web page content that is inaccessible by common search engines due to search engine’s crawling limitations or time difference between the moment a web page is set up on the Internet and the moment the crawler finds it. The architecture of the WebMonitoring system relies upon finite state machines and the concept of monitoring the web. We present the system’s architecture and usage. Some modules were originally developed for the purpose of the WebMonitoring system, and some rely on UNITEX, linguistically oriented software system. We hereby evaluate the WebMonitoring system and give directions for further development.

Key words

finite state automata, finite state transducers, software, web monitoring, electronic dictionaries, web search

Digital Object Identifier (DOI)

https://doi.org/10.2298/CSIS110918036P

Publication information

Volume 10, Issue 1 (Januar 2013)
Year of Publication: 2013
ISSN: 1820-0214 (Print) 2406-1018 (Online)
Publisher: ComSIS Consortium

Full text

DownloadAvailable in PDF
Portable Document Format

How to cite

Pajić, V., Vitas, D., Lažetić, G. P., Pajić, M.: WebMonitoring Software System: Finite State Machines for Monitoring the Web. Computer Science and Information Systems, Vol. 10, No. 1, 1-23. (2013)