Various bioinformatics tools use a great variety of different inhomogeneous formats for reading
data and storing their results. While some formats, like FASTA or CLUSTAL, are common to several
programs, others are utilized by only a single tool. Even worse, prevalent formats often lack
consistency, e.g. there are many different "interpretations" of the FASTA format in use. This can
probably be accredited to the fact that the majority of formats are not well described - a formal
definition is missing.
To adress this problem, the HOBIT project has created
XML replacements for several basic biological
data types and implemented BioDOM. BioDOM is a JAVA library for converting native non-XML output
from various bioinformatic tools to XML formats that can be validated against XML schemas.
Different bioinformatical formats can also be converted online by using the
BioDOM WebService.