Data Formats
<
- Multimedia Data
- storing and searching for imagery, video, audio (text-only --> ignoring embedded multimedia --> searching on textual meta-info --> searching on multimedia content)
- Native Formats
- processing documents which are in MS Word, etc. (search engine may convert, but retains link)
- SGML
- standard notation for rep. doc structure and content (many engines process SGML only to a limited extent)
- Foreign Languages