Khemeia

Stelae's Khemeia is a high-productivity tool for extracting the intelligence from documents.

What appears as flat or seemingly unstructured data, is normally for Khemeia a mine of intelligence – intelligence in terms of document structure, style information and metadata.

Used when content needs to be digitized, re-purposed or transformed in order to enhance the value of your content, Khemeia analyses this apparently unstructured content, extracts the intelligence from it – generating full metadata – and converts it into valid XML, HTML or PDF, to be published electronically.

Yes, finally there is a product, Khemeia, that redefines the mechanics of this whole process of converting and adding value to content, and offers new opportunities in terms of transforming content.

The boundaries of what constitutes data viable for transformation, have now suddenly been extended with our unique technology.

Whether for first time digitization or extracting structure and metadata from existing content, Khemeia is the ideal tool of this job.

Not to be confused with

Khemeia should not be confused with available plug-ins for Word for Acrobat or even semantic matching solutions products. It is a product that is in a class of its own offering revolutionary technology for extracting information from flat documents.

What is Khemeia used for?

This digitization and transformation of content – which is often either done by hand by outsourcing companies, or in many cases not even carried out at all due to the prohibitive costs involved - is exactly what the software automates.

Applications

Applications transforming and processing your digital content include:

  • Converting your legacy content into XML
  • Generating the metadata which describes and defines such content
  • Providing indexed and searchable content giving the end user a much richer experience from the data
  • Producing structured and styled information which can be output as PDF
  • Generating fully styled HTML (i.e. CSS based) to deliver content via the web
  • Providing XML output for metadata harvesters
  • Re-purposing legacy XML that needs to parsed/validated against a new or standardized DTD/XML schema

Formats

Input formats

Khemeia is one solution for multiple input formats of document which include:

  • PDF, RTF, HTML, Word, text, QuarkXPress, Adobe InDesign, OCR formats, XML, SGML, etc.

Multiple output formats

  • XML, PDF, HTML, XMP, XBRL, iXBRL, S1000D, NewsML, NITF, EPUB, other e-book formats, etc.
  • MySQL, SQL Server, Oracle, CMS systems (e.g. Documentum), etc.