RAINBOW homepage

Project description

People

Applications

Downloads and shows

Bibliography

Related projects


logo RAINBOW

Web made by Vojtech Svatek

Complete bibliography

The publications are, for each period, ordered by the name of first author and then by the descending year of publication.

Recent publications (since 2004)

Brand-new additions are marked with in the list.

An04a
Andrt M., Kratky M., Svatek V., Snasel V.: AmphoraWS – webova sluzba pro vyhledavani ve strukturovanych dokumentech. [AmphoraWS – Web service for querying semi-structured data.] In: Datakon'04, Brno 2004. Full paper.
Kr05a
Kratky M., Andrt M., Svatek V.: XML Query Support for Web Information Extraction: A Study on HTML Element Depth Distribution. In: First International Workshop on Representation and Analysis of Web Space (RAWS-05). Full paper.
La05a
Labsky M., Svatek V., Praks P., Svab O.: Information extraction from HTML product catalogues: coupling quantitative and knowledge-based approaches. In: Dagstuhl Seminar on Machine Learning for the Semantic Web, 2005. Full paper.
La05b
Labsky M., Praks P., Svatek V., Svab O.: Multimedia information extraction from HTML product catalogues. In: Workshop on Databases, Texts, Specifications and Objects (DATESO'05), Ostrava 2005. Full paper.
La05c
Labsky M., Vacura M., Praks P.: Web Image Classification for Information Extraction. In: First International Workshop on Representation and Analysis of Web Space (RAWS-05). Full paper.
La05d
Labsky M., Svatek V., Svab O., Praks P., Kratky M., Snasel V.: Information Extraction from HTML Product Catalogues: from Source Code and Images to RDF. In: 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05), IEEE Computer Science, 2005. Full paper.
La04
Labsky M.: Product information extraction from semistructured documents using HMMs. In: Poster papers of Znalosti 2004, Brno, February 2004. Full paper.
La04b
Labsky M., Svatek V.: Information Extraction from Web Product Catalogues. Working paper. Full paper.
La04c
Labsky M., Svatek V., Svab O.: Types and Roles of Ontologies in Web Information Extraction. In: ECML/PKDD04 Workshop on Knowledge Discovery and Ontologies, Pisa. Full paper.
La04d
Labsky M.: Extrakce informaci ze semi-strukturovanych textu pomoci statistickych metod. [Statistical Information Extraction from Semi-structured Texts.] In: Acta Oeconomica Pragensia, 5/2004.
Ne05a
Nemrava J., Svatek V.: Text mining tool for ontology engineering based on use of product taxonomy and web directory. In: Workshop on Databases, Texts, Specifications and Objects (DATESO'05), Ostrava 2005. Full paper.
Ne05b
Nemrava J.: Product taxonomy and web directory as support for ontology engineers. In: ICML Workshop on Learning and Extending Lexical Ontologies by using Machine Learning Methods, Bonn 2005. Full paper.
Sv05b
Svab O., Svatek V.: Proceduralni propojeni nastroju pro extrakci informaci z webovych sidel. [Procedural combination of tools for information extraction from web sites.] In: Poster Papers of Znalosti 2005, High Tatras 2005. Full paper.
Sv04d
Svab O., Labsky M., Svatek V.: RDF-Based Retrieval of Information Extracted from Web Product Catalogues. In: Semantic Web workshop at ACM SIGIR 2004, Sheffield, 2004. Full paper.
Sv04b
Svab O., Svatek V., Kavalec M., Labsky M.: Querying the RDF: Small Case Study in the Bicycle Sale Domain. In: Workshop on Databases, Texts, Specifications and Objects (DATESO'04), also at http://www.ceur-ws.org/Vol-98. Full paper.
Sv06a
Svatek V.: The Rainbow Project: Multiway Analysis of Website Content and Structure. In: Znalosti 2006, Hradec Kralove, February 2006. Full paper.
Sv05a
Svatek V., ten Teije A., Vacura M.: Web Service Composition for Deductive Web Mining: A Knowledge Modelling Approach. In: Znalosti 2005, High Tatras 2005. Full paper.
Sv05c
Svatek V., Vacura M.: Automatic Composition of Web Analysis Tools: Simulation on Classification Templates. In: First International Workshop on Representation and Analysis of Web Space (RAWS-05). Full paper.
Sv05d
Svatek V.: Automated Analysis of the WWW Based on Reusable Resources. Habilitation Thesis, University of Economics, Prague, 2005. Full text.
Sv04e
Svatek V., Snasel V.: Formal Model of Meta-Information Acquisition from Information Resources. In: Workshop on Information Technology - Applications and Theory (ITAT2004), High Tatras 2004. Full paper.
Sv04a
Svatek V., Vavra V.: Semanticka integrace webovych sluzeb. [Semantic integration of web services.] In: Systemova integrace'04, Praha 2004. Full paper.
Sv04c
Svatek V., Labsky M., Vacura M.: Knowledge Modelling for Deductive Web Mining. In: EKAW 2004, Whittlebury Hall, UK, Springer LNCS, to appear. Draft paper (final version available via SpringerLink).
Vo04
Volavka F., Svatek V.: Identifikace navigační struktury webové prezentace na základě topologie odkazů. [Identification of navigation structure of website based on link topology.] In: Znalosti 2004, Brno 2004. Full paper.

Older publications (2001-2003)

Ka02
Kavalec M., Svatek V.: Information Extraction and Ontology Learning Guided by Web Directory. In: ECAI Workshop on NLP and ML for Ontology engineering (OLT-02). Lyon, 2002. Full paper.
Ka01
Kavalec M., Svatek V., Strossa P.: Web Directories as Training Data for Automated Metadata Extraction. In: Semantic Web Mining, Workshop at ECML/PKDD-2001, Freiburg 2001. Full paper.
La03
Labsky M., Svatek V.: Ontology Merging in Context of Web Analysis. In: Workshop on Databases, Texts, Specifications and Objects (DATESO'03), Ostrava 2003. Full paper (ZIP).
St01
Strossa P., Svatek V., Kavalec M.: Towards Intelligent Indexing of Web Pages Using Important Information Indicators. LISP-2001-1 Technical Report, 2001.
Sv01a
Svatek V.: RAINBOW - navrh modularni architektury pro analyzu a zpristupnovani WWW. [RAINBOW - proposal for modular architecture for WWW analysis and information access.] In: Rauch J., Stepankova O. (eds.). Znalosti 2001. Praha 2001, 209-216.
Sv01b
Svatek V., Strossa P., Kavalec M.: Analysis of text on WWW pages using important information indicators. In: (M. Bielikova, ed.) DATAKON, Database Conference, Brno 2001, 359-362.
Sv02a
Svatek V., Kosek J., Braza J., Kavalec M., Klemperer J., Berka P.: Framework and Tools for Multiway Extraction of Web Metadata. In: Information Systems Modelling, Roznov 2002. Full paper.
Sv02b
Svatek V., Kavalec M., Klemperer J.: Towards the Discovery of Implicit Metadata in Commercial Web Pages. In: (Malyankar R., ed.) Collected Posters, ISWC - First International Semantic Web Conference. Sardinia, Italy, June 2002, p.57. Poster summary.
Sv02c
Svatek V., Kosek J., Vacura M.: Ontology Engineering for Multiway Acquisition of Web Metadata. LISP-2002-1 Technical Report, 2002. Full paper.
Sv03a
Svatek V., Berka P., Kavalec M., Kosek J., Vavra V.: Discovering company descriptions on the web by multiway analysis. In: New Trends in Intelligent Information Processing and Web Mining (IIPWM'03), Zakopane 2003. Springer-Verlag, 'Advances in Soft Computing' series, 2003. Full paper.
Sv03b
Svatek V., Vacura M.: Problem-Solving Models of Website Analysis. In: Poster Track of the Twelfth International World Wide Web Conference (WWW2003), Budapest 2003. Extended abstract.
Sv03c
Svatek V., Braza J., Sklenak V.: Towards Triple-Based Information Extraction from Visually-Structured HTML Pages. In: Poster Track of the Twelfth International World Wide Web Conference (WWW2003), Budapest 2003. Extended abstract.
Sv03d
Svatek V., Kosek J., Labsky M., Braza J., Kavalec M., Vacura M., Vavra V., Snasel V.: Rainbow - Multiway Semantic Analysis of Websites. In: 2nd International DEXA Workshop on Web Semantics (WebS03), Prague 2003, IEEE Computer Society Press 2003. Full paper.
Va02
Vacura M.: Multiway Approach to Content Recognition on Internet. LISP-2002-2 Technical Report, 2002. Full paper.
Vo03
Volavka F., Sajal M., Svatek V.: Topology-based discovery of navigation structure within websites. In: Datakon'03, Brno 2003. Full paper.

Very old publications (1999-2000, some related to pre-cursor projects)

Be99a
Berka P., Sochorova M., Svatek V., Sramek D.: The VSEved System for Intelligent WWW Metasearch. In: (Rudas I. J., Madarasz L., eds.:) INES'99 - IEEE Intl. Conf. on Intelligent Engineering Systems, Stara Lesna 1999, 317-321.
Be99b
Berka P., Sochorova M., Svatek V.: Metavyhledavani na WWW s naslednym zpracovanim vysledku. [WWW metasearch with post-processing] In: (Richta, K., ed.:) Datasem'99, Brno 1999.
Ko00
Kosek J., Svatek V.: XML a ontologie jako integracni nastroje pro analyzu a zpristupnovani WWW. [XML and ontologies as integration tools for WWW analysis and information access.] In: (Valenta J., ed.) Datasem'00, Brno 2000.
Sr00
Sramek D., Berka P., Kosek J., Svatek V.: Improving WWW Access - from Single-Purpose Systems to Agent Architectures? In: Cerri S. A., Dochev D. (ed.). Artificial Intelligence: Methodology, Systems, and Application. Berlin : Springer Verlag, 2000, 167-178. Full paper.
Sv00a
Svatek V., Berka P.: URL as starting point for WWW document categorisation. In: (Mariani J., Harman D.:) RIAO'2000 - Content-Based Multimedia Information Access, CID, Paris, 2000, 1693-1702. Full paper.
Sv00b
Svatek, V., and Kavalec, M. Supporting Case Acquisition and Labelling in the Context of Web Mining, in (Zighed D., Komorowski J., Zytkow J.:) Principles of Data Mining and Knowledge Discovery - PKDD2000. Springer, 2000, pp. 626-631. Full paper.