<?xml version="1.0" encoding="utf-8" ?><rss version="2.0" xml:base="http://technicalfoundations.ukoln.ac.uk/taxonomy/term/115/all" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>international digital publishing forum: relevant content on this site</title>
    <link>http://technicalfoundations.ukoln.ac.uk/taxonomy/term/115/all</link>
    <description></description>
    <language>en</language>
          <item>
    <title>What is ePub (and why does it matter for metadata and application profiles)?</title>
    <link>http://technicalfoundations.ukoln.ac.uk/blog/what-epub-and-why-does-it-matter-metadata-and-application-profiles</link>
    <description>&lt;p&gt;ePub is a standard packaging format designed for ebook readers. Why is ePub of interest from the point of view of metadata and application profiles?&lt;/p&gt;
&lt;p&gt;&lt;!--break--&gt;&lt;!--break--&gt;&lt;/p&gt;
&lt;p&gt;Here is the definition given in the &lt;a href=&quot;http://en.wikipedia.org/wiki/EPUB&quot;&gt;entry&lt;/a&gt; in &lt;a href=&quot;http://en.wikipedia.org/&quot;&gt;Wikipedia&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;&lt;p&gt;[...] ePub [...] is a free and open &lt;a href=&quot;http://en.wikipedia.org/wiki/E-book&quot; title=&quot;E-book&quot;&gt;e-book&lt;/a&gt; standard by the &lt;a href=&quot;http://en.wikipedia.org/wiki/International_Digital_Publishing_Forum&quot; title=&quot;International Digital Publishing Forum&quot;&gt;International Digital Publishing Forum&lt;/a&gt; (IDPF). Files have the extension &lt;em&gt;.epub&lt;/em&gt;. EPUB is designed for &lt;em&gt;reflowable&lt;/em&gt; content, meaning that the text display can be optimized for the particular display device used by the reader of the EPUB-formatted book. The format is meant to function as a single format that publishers and conversion houses can use in-house, as well as for distribution and sale.&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;That is to say that ePub contains within it the Open Packaging Format (for convenience, we can ignore the other structural parts for the purposes of this discussion), which defines the structure of both the metadata for the item contained within the file and the presentational (XML, XHMTL, CSS) elements of the standard. It is similar in many ways to a .docx file (MS Word 2007 onwards) in being effectively a specialised type of .zip file.&lt;/p&gt;
&lt;p&gt;So why is ePub of interest from the point of view of metadata and application profiles? The IDPF&#039;s Open Packaging Format gives &lt;a href=&quot;http://www.idpf.org/doc_library/epub/OPF_2.0.1_draft.htm#Section1.3.3&quot;&gt;this description&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;&lt;p&gt;Dublin Core metadata is designed to minimize the cataloging burden on authors and publishers, while providing enough metadata to be useful. This specification supports the set of Dublin Core 1.1 metadata elements (&lt;a href=&quot;http://dublincore.org/documents/2004/12/20/dces/&quot; target=&quot;_blank&quot; title=&quot;Dublin Core metadata elements specification.&quot;&gt;http://dublincore.org/documents/2004/12/20/dces/&lt;/a&gt;), supplemented with a small set of additional attributes addressing areas where more specific information is useful. For example, the OPF role attribute added to the Dublin Core creator and contributor elements allows for much more detailed specification of contributors to a publication, including their roles expressed via relator codes. Content providers must include a minimum set of metadata elements, defined in &lt;a href=&quot;http://www.idpf.org/doc_library/epub/OPF_2.0.1_draft.htm#Section2.2&quot; title=&quot;Publication Metadata&quot;&gt;Section 2.2&lt;/a&gt;, and should incorporate additional metadata to enable readers to discover publications of interest.&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;In which case, how is the metadata contained within ePub any different to Dublin Core 1.1? This is the interesting part:&lt;/p&gt;
&lt;blockquote&gt;&lt;p&gt;Because the Dublin Core metadata fields for creator and contributor do not distinguish roles of specific contributors (such as author, editor, and illustrator), this specification adds an optional &lt;strong&gt;&lt;em&gt;role&lt;/em&gt;&lt;/strong&gt; attribute for this purpose. See &lt;a href=&quot;http://www.idpf.org/doc_library/epub/OPF_2.0.1_draft.htm#Section2.2.6&quot; title=&quot;Role&quot;&gt;Section 2.2.6&lt;/a&gt; for a discussion of &lt;strong&gt;&lt;em&gt;role&lt;/em&gt;&lt;/strong&gt;. To facilitate machine processing of Dublin Core creator and contributor fields, this specification adds the optional &lt;strong&gt;&lt;em&gt;file-as&lt;/em&gt;&lt;/strong&gt; attribute for those elements. This attribute is used to specify a normalized form of the contents. See &lt;a href=&quot;http://www.idpf.org/doc_library/epub/OPF_2.0.1_draft.htm#Section2.2.2&quot; title=&quot;File-As&quot;&gt;Section 2.2.2&lt;/a&gt; for a discussion of &lt;strong&gt;&lt;em&gt;file-as&lt;/em&gt;&lt;/strong&gt;. This specification also adds a &lt;strong&gt;&lt;em&gt;scheme&lt;/em&gt;&lt;/strong&gt; attribute to the Dublin Core &lt;strong&gt;&lt;em&gt;identifier&lt;/em&gt;&lt;/strong&gt; element to provide a structural mechanism to separate an identifier value from the system or authority that generated or defined that &lt;strong&gt;&lt;em&gt;identifier&lt;/em&gt;&lt;/strong&gt; value. See &lt;a href=&quot;http://www.idpf.org/doc_library/epub/OPF_2.0.1_draft.htm#Section2.2.10&quot; title=&quot;Scheme.&quot;&gt;Section 2.2.10&lt;/a&gt; for a discussion of &lt;strong&gt;&lt;em&gt;scheme&lt;/em&gt;&lt;/strong&gt;. This specification also adds an &lt;strong&gt;&lt;em&gt;event&lt;/em&gt;&lt;/strong&gt; attribute to the Dublin Core date element to enable content providers to distinguish various publication specific dates (for example, creation, publication, modification). See &lt;a href=&quot;http://www.idpf.org/doc_library/epub/OPF_2.0.1_draft.htm#Section2.2.7&quot; title=&quot;Event.&quot;&gt;Section 2.2.7&lt;/a&gt; for a discussion of &lt;strong&gt;&lt;em&gt;event&lt;/em&gt;&lt;/strong&gt;.&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;Using these addition attributes, it is possible to define more accurately what certain fields contain, a standard, normalised format for agent metadata such as personal names, schemes defining the format in which a particular field is expected to appear, identifiers to provide a mechanism to link that metadata to the generating system or authority, and events to describe more accurately the events that have occurred during the life cycle of the item. By applying such constraints that are beyond the scope of DC 1.1, the ePub format effectively contains a &lt;em&gt;de facto&lt;/em&gt; application profile, identified by its own namespace. Further, &lt;em&gt;ad hoc&lt;/em&gt; metadata can be added using the (X)HTML &lt;strong&gt;&lt;em&gt;meta&lt;/em&gt;&lt;/strong&gt; element:&lt;/p&gt;
&lt;blockquote&gt;&lt;p&gt;One or more optional instances of a &lt;strong&gt;&lt;em&gt;meta&lt;/em&gt;&lt;/strong&gt; element, analogous to the XHTML 1.1 &lt;strong&gt;&lt;em&gt;meta&lt;/em&gt;&lt;/strong&gt; element but applicable to the publication as a whole, may be placed within the metadata element [...]. This allows content providers to express arbitrary metadata beyond the data described by the Dublin Core specification. Individual OPS Content Documents may include the &lt;strong&gt;&lt;em&gt;meta&lt;/em&gt;&lt;/strong&gt; element directly (as in XHTML 1.1) for document-specific metadata. This specification uses the OPF Package Document alone as the basis for expressing publication-level Dublin Core metadata.&lt;/p&gt;&lt;/blockquote&gt;
&lt;p&gt;It would seem, however, that this last option suffers from the weakness that such metadata is invented on the fly, and does not have to follow the constraints of any schema or authority. Nonetheless, it would seem overall that the ePub &quot;application profile&quot; does significantly add to the functionality DC 1.1 in a potentially useful way. Different types of agent defined in DC, such as creator, contributor, can be further defined, for example author, editor, illustrator and thesis supervisor for higher degrees. Potentially, this could be leveraged for use with a number of different types of resources and for various purposes, although ePub by it&#039;s very nature is designed for &lt;em&gt;reflowable&lt;/em&gt; content, which by and large means textual resources such as books, articles, manuals and so on. Illustrations, tables, charts, images and other non-reflowable content can potentially create a problem on the small screens of mobile devices such as ebook readers. The structure of this application profile is very simple and easy to use, unlike for example the classic form of &lt;a href=&quot;http://www.ukoln.ac.uk/repositories/digirep/index/SWAP&quot;&gt;SWAP&lt;/a&gt;, whose structure is based directly upon its conceptual data model, a simplified version of &lt;a href=&quot;http://www.ifla.org/en/publications/functional-requirements-for-bibliographic-records&quot;&gt;FRBR&lt;/a&gt;. It would be extremely interesting to compare the two, since they are fundamentally similar, relatively simple solutions that are limited in scope to online publications and similar resources. It would be most revealing to see whether what SWAP seeks to achieve can be done in a simpler way, and whether either SWAP or the ePub application profile have functionality that the other cannot provide. Ultimately, the purpose of this investigation could be to provide online textual content, for example in repositories, via increasingly popular hand-held devices, and to capitalise on the rapid growth of commercial ebooks. It would probably be necessary to provide .epub files in such systems as well as the usual .pdf and .doc(x) formats that are common in publishing, and consequently in institutional repositories. Either this would need to be done by converting the existing content, and likewise new content after it is deposited, or in addition by providing tools to enable the ePub format to be more immediately accessible to service providers and depositors in future.&lt;/p&gt;
</description>
     <comments>http://technicalfoundations.ukoln.ac.uk/blog/what-epub-and-why-does-it-matter-metadata-and-application-profiles#comments</comments>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/persons/talat-chaudhri">talat chaudhri</category>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/organisations/dcmi">dcmi</category>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/organisations/dublin-core-metadata-initiative">dublin core metadata initiative</category>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/organisations/idpf">idpf</category>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/organisations/international-digital-publishing-forum">international digital publishing forum</category>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/projects/application-profiles-support">application profiles support</category>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/topics/dc">dc</category>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/topics/dcmes-11">dcmes 1.1</category>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/topics/epub">epub</category>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/topics/frbr">frbr</category>
 <category domain="http://technicalfoundations.ukoln.ac.uk/overview/topics/swap">swap</category>
 <pubDate>Wed, 08 Jun 2011 13:35:57 +0000</pubDate>
 <dc:creator>Talat Chaudhri</dc:creator>
 <guid isPermaLink="false">10 at http://technicalfoundations.ukoln.ac.uk</guid>
  </item>
  </channel>
</rss>