Interrogare collezioni di documenti XML: una interfaccia utente

Oreste Signore - Marco Andreini - Cristian Lucchesi - Silvia Martelli

Ufficio Italiano W3C presso il C.N.R.
Area della Ricerca di Pisa San Cataldo - Via G. Moruzzi, 1 - 56124 Pisa
Email:oreste@w3.org

XML per i Beni Culturali
Esperienze e prospettive per il trattamento di dati strutturati e semistrutturati
Scuola Normale Superiore
Pisa, 25 marzo 2004

Talk layout

logo ist europe This work has been financed by the project QUESTION-HOW (Quality Engineering Solutions via Tools, Information and Outreach for the New Highly-enriched Offerings from W3C: Evolving the Web in Europe), contract IST-2000-28767

User needs in accessing XML data collections

XML has no semantics per se ...

... hence user needs:

Main goals

Architecture: a rough sketch

a rough sketch of the architecture

Architecture: fully web-based

a rough sketch of the architecture
  • The XML Schema is externally annotated in RDF
    • Metadata are also stored in the system objects.
    • RDF description can be imported in the system.
    • The system can export RDF annotation
    • A component allows to add metadata to the system objects
  • The user can browse the structure and formulate the query
  • For each element/attribute the system knows and can show to the user:
    • semantics
    • constraints
  • The query is prepared in a general format, can be mapped onto different search engines

Architecture: main features

The RDF annotation

The Administrator must have an in depth knowledge of the document collection and the related knowledge domain

The query

Frustrating traditional approaches:

Composition:

Normalization:

The sample document collection ...

... and a (partial) graphical representation of it

graphical representation of the sample document structure

Specifying constraints

The XML document collection

The XCDE search engine

Demo

Conclusion and future plans

Thank you for your attention

?


If it isn't on the Web ...
... it doesn't exist

This presentation will be on the Office Web Site (http://www.w3c.it/talks/sns2004/)