<?xml version="1.0" encoding="iso8859-1"?>
<rss version="2.0">
<channel>
<title>wiki.dbpedia.org - Documentation</title>
<link>http://wiki.dbpedia.org/Documentation</link>
<description>History/revisions of wiki.dbpedia.org/Documentation</description>
<language>en-us</language>
<item>
<title>2008-07-07 23:12:02</title>
<link>http://wiki.dbpedia.org/Documentation/show?time=2008-07-07+23%3A12%3A02</link>
<description>&lt;div class="pageBefore">&lt;img src="http://wiki.dbpedia.org/images/z.gif" width="1" height="1" border="0" alt="" style="display:block" align="top" />&lt;/div>&lt;div class="page">
&lt;b>Comparison for &lt;a name=".documentation" href="http://wiki.dbpedia.org/Documentation" class="">/Documentation&lt;/a> of &lt;a href="http://wiki.dbpedia.org/Documentation?time=2008-07-07+23%3A12%3A02">2008-07-07 23:12:02&lt;/a> and &lt;a href="http://wiki.dbpedia.org/Documentation">2008-07-07 23:14:29&lt;/a>&lt;/b>&lt;br />
&lt;br />
&lt;b>Additions:&lt;/b>&lt;br />
&lt;div class="additions">If&amp;nbsp;you want to&amp;nbsp;create your own&amp;nbsp;dumps or&amp;nbsp;work within your IDE&amp;nbsp;or&amp;nbsp;on the&amp;nbsp;console, you&amp;nbsp;should use&amp;nbsp;&lt;em>start.php&lt;/em>. If&amp;nbsp;you first want to&amp;nbsp;learn how&amp;nbsp;DBpedia extraction works, or&amp;nbsp;test /debug new&amp;nbsp;extractors, &lt;em>webStart.php&lt;/em> might best suit your needs, as&amp;nbsp;it gives you&amp;nbsp;a&amp;nbsp;comfortable web&amp;nbsp;debug interface, similar to&amp;nbsp;&lt;a href="http://www4.wiwiss.fu-berlin.de/dbp-debug/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />http://www4.wiwiss.fu-berlin.de/dbp-debug/&lt;/a>. Make sure to&amp;nbsp;download &lt;a href="http://sourceforge.net/projects/rdfapi-php/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />RAP &amp;ndash; RDF&amp;nbsp;API for&amp;nbsp;PHP&lt;/a> first if&amp;nbsp;you want to&amp;nbsp;use the&amp;nbsp;web interface.&lt;/div>&lt;br />
&lt;b>Deletions:&lt;/b>&lt;br />
&lt;div class="deletions">If&amp;nbsp;you want to&amp;nbsp;create your own&amp;nbsp;dumps or&amp;nbsp;work within your IDE&amp;nbsp;or&amp;nbsp;on the&amp;nbsp;console, you&amp;nbsp;should use&amp;nbsp;&lt;em>start.php&lt;/em>. If&amp;nbsp;you first want to&amp;nbsp;learn how&amp;nbsp;DBpedia extraction works, or&amp;nbsp;test /debug new&amp;nbsp;extractors, &lt;em>webStart.php&lt;/em> might best suit your needs, as&amp;nbsp;it gives you&amp;nbsp;a&amp;nbsp;comfortable web&amp;nbsp;debug interface, similar to&amp;nbsp;&lt;a href="http://www4.wiwiss.fu-berlin.de/dbp-debug/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />http://www4.wiwiss.fu-berlin.de/dbp-debug/&lt;/a>. Make sure to&amp;nbsp;download &lt;a href="http://sourceforge.net/projects/rdfapi-php/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />RAP &amp;ndash; RdF&amp;nbsp;API for&amp;nbsp;PHP&lt;/a> first if&amp;nbsp;you want to&amp;nbsp;use the&amp;nbsp;web interface.&lt;/div>&lt;/div>
</description>
</item>
<item>
<title>2007-09-04 19:02:10</title>
<link>http://wiki.dbpedia.org/Documentation/show?time=2007-09-04+19%3A02%3A10</link>
<description>&lt;div class="pageBefore">&lt;img src="http://wiki.dbpedia.org/images/z.gif" width="1" height="1" border="0" alt="" style="display:block" align="top" />&lt;/div>&lt;div class="page">
&lt;b>Comparison for &lt;a  href="http://wiki.dbpedia.org/Documentation" class="">/Documentation&lt;/a> of &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-09-04+19%3A02%3A10">2007-09-04 19:02:10&lt;/a> and &lt;a href="http://wiki.dbpedia.org/Documentation?time=2008-07-07+23%3A12%3A02">2008-07-07 23:12:02&lt;/a>&lt;/b>&lt;br />
&lt;br />
&lt;b>Additions:&lt;/b>&lt;br />
&lt;div class="additions">This pages describes the&amp;nbsp;DBpedia information extraction framework. The&amp;nbsp;framework consists of&amp;nbsp;the interfaces: &lt;tt>Destination&lt;/tt>, &lt;tt>Extractor&lt;/tt>, &lt;tt>Page&amp;nbsp;Collection&lt;/tt>, and&amp;nbsp;&lt;tt>RDFnode&lt;/tt>, plus the&amp;nbsp;essential classes &lt;tt>Extraction&amp;nbsp;Group&lt;/tt>, &lt;tt>Extraction&amp;nbsp;Job&lt;/tt>, &lt;tt>Extraction&amp;nbsp;Manager&lt;/tt>, &lt;tt>Extraction&amp;nbsp;Result&lt;/tt>, and&amp;nbsp;&lt;tt>RDFtriple&lt;/tt>.&lt;br />
The&amp;nbsp;code extraction process is&amp;nbsp;triggered via&amp;nbsp;the ExtractionManager, which starts one&amp;nbsp;or&amp;nbsp;more ExtractionJobs. An&amp;nbsp;ExtractionJob combines one&amp;nbsp;or&amp;nbsp;more ExtractionGroups with a&amp;nbsp;PageCollection. The&amp;nbsp;PageCollection is&amp;nbsp;the data source, e.g., all&amp;nbsp;articles from the&amp;nbsp;Wikipedia SQL-Dump. ExtractionGroups consist of&amp;nbsp;a Destination and&amp;nbsp;one or&amp;nbsp;more Extractors. Possible Destinations include your console, NTriple files and&amp;nbsp;the web&amp;nbsp;interface. Of&amp;nbsp;course you&amp;nbsp;are free to&amp;nbsp;write your own&amp;nbsp;destinations (e.g., databases, RDF/XML files). &lt;br />
Extractors are&amp;nbsp;designed for&amp;nbsp;single specific purposes, e.g., the&amp;nbsp;InfoboxExtractor reads out&amp;nbsp;information from Wikipedia Infoboxes. The&amp;nbsp;ShortAbstractExtractor gets the&amp;nbsp;first paragraph from an&amp;nbsp;article and&amp;nbsp;so&amp;nbsp;on. DBpedia comes with extractors for&amp;nbsp;many purposes already, though you&amp;nbsp;are invited to&amp;nbsp;add your own.&lt;br />
&lt;ul>&lt;li> &lt;tt>start()&lt;/tt>: Initializes the&amp;nbsp;Destination (e.g., creates a&amp;nbsp;new NTriple file). Is&amp;nbsp;called once at&amp;nbsp;the beginning of&amp;nbsp;an ExtractionJob.&lt;/li>&lt;/ul>
PageCollections are&amp;nbsp;the data sources for&amp;nbsp;extraction. A&amp;nbsp;PageCollection loads the&amp;nbsp;page source code for&amp;nbsp;a&amp;nbsp;specific language and&amp;nbsp;one or&amp;nbsp;more pages (Implementations: LiveWikipedia, DatabaseWikipedia).&lt;br />
URI, RDFliteral, and&amp;nbsp;RDFblankNode are&amp;nbsp;implementations of&amp;nbsp;RDFnodes.&lt;br />
In&amp;nbsp;addition, information such as&amp;nbsp;datatype, language, and&amp;nbsp;lexical form of&lt;br />
&lt;ul>&lt;li>&lt;tt>__construct($destination, $metadestination = NULL)&lt;/tt>: $destination is&amp;nbsp;an object of&amp;nbsp;a class, implementing the&amp;nbsp;interface Destination. $metadestination is&amp;nbsp;an optional Destination, where meta information can&amp;nbsp;be&amp;nbsp;stored. It&amp;nbsp;is mainly used by&amp;nbsp;the InfoboxExtractor, which collects all&amp;nbsp;predicate names in&amp;nbsp;a metadestination.&lt;/li>&lt;/ul>
RDFtriple combines a&amp;nbsp;subject, a&amp;nbsp;predicate, and&amp;nbsp;an&amp;nbsp;object, into an&amp;nbsp;RDFtriple. You&amp;nbsp;will usually create RDFnodes via&amp;nbsp;RDFtriples, e.g. &lt;tt>$subject = RDFtriple::URI("Resource&amp;nbsp;Name");&lt;/tt> as&amp;nbsp;RDFtriple can&amp;nbsp;add a&amp;nbsp;common URI&amp;nbsp;prefix for&amp;nbsp;each triple. &lt;br />
&lt;ul>&lt;li>&lt;tt>static function literal($value, $datatype = null, $lang = null)&lt;/tt>: Returns an&amp;nbsp;RDFliteral. $value must be&amp;nbsp;of datatype String; else an&amp;nbsp;error message will be&amp;nbsp;thrown, as $value will be&amp;nbsp;serialized for&amp;nbsp;NTriples and&amp;nbsp;this only works with string variables. $datatype and $lang are&amp;nbsp;optional parameters. $datatype is&amp;nbsp;a string containing the&amp;nbsp;(RDF-) datatype of $value, $lang the&amp;nbsp;language.&lt;/li>&lt;/ul>&lt;/div>&lt;br />
&lt;b>Deletions:&lt;/b>&lt;br />
&lt;div class="deletions">This pages describes the&amp;nbsp;DBpedia information extraction framework. The&amp;nbsp;framework consists of&amp;nbsp;the interfaces: Destination, Extractor, Page&amp;nbsp;Collection and&amp;nbsp;RDFnode, plus the&amp;nbsp;essential classes Extraction&amp;nbsp;Group, Extraction&amp;nbsp;Job, Extraction&amp;nbsp;Manager, Extraction&amp;nbsp;Result and&amp;nbsp;RDFtriple.&lt;br />
The&amp;nbsp;code extraction process is&amp;nbsp;triggered via&amp;nbsp;the ExtractionManager, which starts one&amp;nbsp;or&amp;nbsp;more ExtractionJobs. An&amp;nbsp;ExtractionJob combines one&amp;nbsp;or&amp;nbsp;more ExtractionGroups with a&amp;nbsp;PageCollection. The&amp;nbsp;PageCollection is&amp;nbsp;the data source, e.g. all&amp;nbsp;articles from the&amp;nbsp;Wikipedia SQL-Dump. ExtractionGroups consist of&amp;nbsp;a Destination and&amp;nbsp;one or&amp;nbsp;more Extractors. Possible Destinations include your console, NTriple files and&amp;nbsp;the web&amp;nbsp;interface. Of&amp;nbsp;course you&amp;nbsp;are free to&amp;nbsp;write your own&amp;nbsp;destinations (e.g. databases, RDF/XML files). &lt;br />
Extractors are&amp;nbsp;designed for&amp;nbsp;single specific purposes. E.g. the&amp;nbsp;InfoboxExtractor reads out&amp;nbsp;information from Wikipedia Infoboxes. The&amp;nbsp;ShortAbstractExtractor gets the&amp;nbsp;first paragraph from an&amp;nbsp;article and&amp;nbsp;so&amp;nbsp;on. DBpedia comes with extractors for&amp;nbsp;many purposes already, though you&amp;nbsp;are invited to&amp;nbsp;add your own.&lt;br />
&lt;ul>&lt;li> &lt;tt>start()&lt;/tt>: Initializes the&amp;nbsp;Destination (e.g. creates a&amp;nbsp;new NTriple file). Is&amp;nbsp;called once at&amp;nbsp;the beginning of&amp;nbsp;an ExtractionJob.&lt;/li>&lt;/ul>
PageCollections are&amp;nbsp;the data source for&amp;nbsp;extraction. A&amp;nbsp;PageCollection loads the&amp;nbsp;page source code for&amp;nbsp;a&amp;nbsp;specific language and&amp;nbsp;one or&amp;nbsp;more pages (Implementations: LiveWikipedia, DatabaseWikipedia).&lt;br />
URI, RDFliteral and&amp;nbsp;RDFblankNode are&amp;nbsp;implementations of&amp;nbsp;RDFnodes.&lt;br />
In&amp;nbsp;addition informations such as&amp;nbsp;datatype, language and&amp;nbsp;lexical form of&lt;br />
&lt;ul>&lt;li>&lt;tt>__construct($destination, $metadestination = NULL)&lt;/tt>: $destination is&amp;nbsp;an object of&amp;nbsp;a class, implementing the&amp;nbsp;interface Destination. $metadestination is&amp;nbsp;an optional Destination, where meta information can&amp;nbsp;be&amp;nbsp;stored. It&amp;nbsp;is mainly used by&amp;nbsp;the InfoboxExtractor, which collects all&amp;nbsp;predicate names in&amp;nbsp;a metdestination.&lt;/li>&lt;/ul>
RDFtriple combines a&amp;nbsp;subject, a&amp;nbsp;predicate and&amp;nbsp;an&amp;nbsp;object to&amp;nbsp;a RDFtriple. You&amp;nbsp;will usually create RDFnodes via&amp;nbsp;RDFtriples. E.g. &lt;tt>$subject = RDFtriple::URI("Resource&amp;nbsp;Name");&lt;/tt> as&amp;nbsp;RDFtriple can&amp;nbsp;add a&amp;nbsp;common URI&amp;nbsp;prefix for&amp;nbsp;each triple. &lt;br />
&lt;ul>&lt;li>&lt;tt>static function literal($value, $datatype = null, $lang = null)&lt;/tt>: Returns a&amp;nbsp;RDFliteral. $value must be&amp;nbsp;of datatype String, else an&amp;nbsp;error message will be&amp;nbsp;thrown, as $value will be&amp;nbsp;serialized for&amp;nbsp;NTriples and&amp;nbsp;this only works with string variables. $datatype and $lang are&amp;nbsp;optional parameters. $datatype is&amp;nbsp;a string containing the&amp;nbsp;(RDF-) datatype of $value, $lang the&amp;nbsp;language.&lt;/li>&lt;/ul>&lt;/div>&lt;/div>
</description>
</item>
<item>
<title>2007-08-23 11:21:49</title>
<link>http://wiki.dbpedia.org/Documentation/show?time=2007-08-23+11%3A21%3A49</link>
<description>&lt;div class="pageBefore">&lt;img src="http://wiki.dbpedia.org/images/z.gif" width="1" height="1" border="0" alt="" style="display:block" align="top" />&lt;/div>&lt;div class="page">
&lt;b>Comparison for &lt;a  href="http://wiki.dbpedia.org/Documentation" class="">/Documentation&lt;/a> of &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-23+11%3A21%3A49">2007-08-23 11:21:49&lt;/a> and &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-09-04+19%3A02%3A10">2007-09-04 19:02:10&lt;/a>&lt;/b>&lt;br />
&lt;br />
&lt;b>Additions:&lt;/b>&lt;br />
&lt;div class="additions">&lt;a name="h25-1">&lt;/a>&lt;h1>The DBpedia Information Extraction Framework&lt;/h1>
The&amp;nbsp;DBpedia community uses a&amp;nbsp;flexible and&amp;nbsp;extensible framework to&amp;nbsp;extract different kinds of&amp;nbsp;structured information from Wikipedia. &lt;br />
The&amp;nbsp;DBpedia information extraction framework is&amp;nbsp;written using PHP&amp;nbsp;5. The&amp;nbsp;framework is&amp;nbsp;available from the&amp;nbsp;&lt;a href="http://dbpedia.svn.sourceforge.net/viewvc/dbpedia/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />DBpedia SVN&lt;/a> (GNU GPL&amp;nbsp;License).&lt;br />
This pages describes the&amp;nbsp;DBpedia information extraction framework. The&amp;nbsp;framework consists of&amp;nbsp;the interfaces: Destination, Extractor, Page&amp;nbsp;Collection and&amp;nbsp;RDFnode, plus the&amp;nbsp;essential classes Extraction&amp;nbsp;Group, Extraction&amp;nbsp;Job, Extraction&amp;nbsp;Manager, Extraction&amp;nbsp;Result and&amp;nbsp;RDFtriple.&lt;/div>&lt;br />
&lt;b>Deletions:&lt;/b>&lt;br />
&lt;div class="deletions">&lt;a name="h25-1">&lt;/a>&lt;h1>Documentation of&amp;nbsp;the DBpedia Extraction Framework&lt;/h1>
&lt;strong>Preliminary Version: This document is&amp;nbsp;not complete and&amp;nbsp;might contain errors.&lt;/strong>&lt;br />
This pages describes the&amp;nbsp;DBpedia extraction framework. DBpedia data extraction is&amp;nbsp;written entirely in&amp;nbsp;PHP 5. The&amp;nbsp;framework consists of&amp;nbsp;the interfaces Destination, Extractor, Page&amp;nbsp;Collection and&amp;nbsp;RDFnode, plus the&amp;nbsp;essential classes Extraction&amp;nbsp;Group, Extraction&amp;nbsp;Job, Extraction&amp;nbsp;Manager, Extraction&amp;nbsp;Result and&amp;nbsp;RDFtriple.&lt;/div>&lt;/div>
</description>
</item>
<item>
<title>2007-08-23 11:19:35</title>
<link>http://wiki.dbpedia.org/Documentation/show?time=2007-08-23+11%3A19%3A35</link>
<description>&lt;div class="pageBefore">&lt;img src="http://wiki.dbpedia.org/images/z.gif" width="1" height="1" border="0" alt="" style="display:block" align="top" />&lt;/div>&lt;div class="page">
&lt;b>Comparison for &lt;a  href="http://wiki.dbpedia.org/Documentation" class="">/Documentation&lt;/a> of &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-23+11%3A19%3A35">2007-08-23 11:19:35&lt;/a> and &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-23+11%3A21%3A49">2007-08-23 11:21:49&lt;/a>&lt;/b>&lt;br />
&lt;br />
&lt;b>Additions:&lt;/b>&lt;br />
&lt;div class="additions">&lt;ul>&lt;li>&lt;tt>static function blank($label)&lt;/tt>: Returns a&amp;nbsp;RDFblankNode. $label must be&amp;nbsp;string containing the&amp;nbsp;Blanknode label.
&lt;/li>&lt;li>&lt;tt>static function URI($uri)&lt;/tt>: Returns an&amp;nbsp;URI. $uri is&amp;nbsp;a string containing the&amp;nbsp;URI.&lt;/li>&lt;/ul>&lt;/div>&lt;/div>
</description>
</item>
<item>
<title>2007-08-21 23:31:26</title>
<link>http://wiki.dbpedia.org/Documentation/show?time=2007-08-21+23%3A31%3A26</link>
<description>&lt;div class="pageBefore">&lt;img src="http://wiki.dbpedia.org/images/z.gif" width="1" height="1" border="0" alt="" style="display:block" align="top" />&lt;/div>&lt;div class="page">
&lt;b>Comparison for &lt;a  href="http://wiki.dbpedia.org/Documentation" class="">/Documentation&lt;/a> of &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-21+23%3A31%3A26">2007-08-21 23:31:26&lt;/a> and &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-23+11%3A19%3A35">2007-08-23 11:19:35&lt;/a>&lt;/b>&lt;br />
&lt;br />
&lt;b>Additions:&lt;/b>&lt;br />
&lt;div class="additions">&lt;ul>&lt;li>&lt;tt>static function literal($value, $datatype = null, $lang = null)&lt;/tt>: Returns a&amp;nbsp;RDFliteral. $value must be&amp;nbsp;of datatype String, else an&amp;nbsp;error message will be&amp;nbsp;thrown, as $value will be&amp;nbsp;serialized for&amp;nbsp;NTriples and&amp;nbsp;this only works with string variables. $datatype and $lang are&amp;nbsp;optional parameters. $datatype is&amp;nbsp;a string containing the&amp;nbsp;(RDF-) datatype of $value, $lang the&amp;nbsp;language.&lt;/li>&lt;/ul>&lt;/div>&lt;/div>
</description>
</item>
<item>
<title>2007-08-21 23:29:24</title>
<link>http://wiki.dbpedia.org/Documentation/show?time=2007-08-21+23%3A29%3A24</link>
<description>&lt;div class="pageBefore">&lt;img src="http://wiki.dbpedia.org/images/z.gif" width="1" height="1" border="0" alt="" style="display:block" align="top" />&lt;/div>&lt;div class="page">
&lt;b>Comparison for &lt;a  href="http://wiki.dbpedia.org/Documentation" class="">/Documentation&lt;/a> of &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-21+23%3A29%3A24">2007-08-21 23:29:24&lt;/a> and &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-21+23%3A31%3A26">2007-08-21 23:31:26&lt;/a>&lt;/b>&lt;br />
&lt;br />
&lt;b>Additions:&lt;/b>&lt;br />
&lt;div class="additions">&lt;a name="h25-1">&lt;/a>&lt;h3>Interface Destination&lt;/h3>&lt;a name="h25-2">&lt;/a>&lt;h3>Interface Extractor&lt;/h3>&lt;a name="h25-3">&lt;/a>&lt;h3>Interface PageCollection&lt;/h3>&lt;a name="h25-4">&lt;/a>&lt;h3>Interface RDFnode&lt;/h3>&lt;a name="h25-5">&lt;/a>&lt;h3>Class ExtractionGroup&lt;/h3>&lt;a name="h25-6">&lt;/a>&lt;h3>Class ExtractionJob&lt;/h3>&lt;a name="h25-7">&lt;/a>&lt;h3>Class ExtractionManager&lt;/h3>&lt;a name="h25-8">&lt;/a>&lt;h3>Class ExtractionResult&lt;/h3>&lt;a name="h25-9">&lt;/a>&lt;h3>Class RDFtriple&lt;/h3>&lt;/div>&lt;br />
&lt;b>Deletions:&lt;/b>&lt;br />
&lt;div class="deletions">&lt;strong>Interface Destination&lt;/strong>&lt;br />
&lt;strong>Interface Extractor&lt;/strong>&lt;br />
&lt;strong>Interface PageCollection:&lt;/strong>&lt;br />
&lt;strong>Interface RDFnode&lt;/strong>&lt;br />
&lt;strong>Class ExtractionGroup&lt;/strong>&lt;br />
&lt;strong>Class ExtractionJob&lt;/strong>&lt;br />
&lt;strong>Class ExtractionManager&lt;/strong>&lt;br />
&lt;strong>Class ExtractionResult&lt;/strong>&lt;br />
&lt;strong>Class RDFtriple&lt;/strong>&lt;/div>&lt;/div>
</description>
</item>
<item>
<title>2007-08-21 23:25:19</title>
<link>http://wiki.dbpedia.org/Documentation/show?time=2007-08-21+23%3A25%3A19</link>
<description>&lt;div class="pageBefore">&lt;img src="http://wiki.dbpedia.org/images/z.gif" width="1" height="1" border="0" alt="" style="display:block" align="top" />&lt;/div>&lt;div class="page">
&lt;b>Comparison for &lt;a  href="http://wiki.dbpedia.org/Documentation" class="">/Documentation&lt;/a> of &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-21+23%3A25%3A19">2007-08-21 23:25:19&lt;/a> and &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-21+23%3A29%3A24">2007-08-21 23:29:24&lt;/a>&lt;/b>&lt;br />
&lt;br />
&lt;b>Additions:&lt;/b>&lt;br />
&lt;div class="additions">To&amp;nbsp;get the&amp;nbsp;framework running on&amp;nbsp;your local PC, it&amp;nbsp;is recommended to&amp;nbsp;start with the&amp;nbsp;pre-configured files &lt;em>start.php&lt;/em> or&amp;nbsp;&lt;em>webStart.php&lt;/em>. You&amp;nbsp;will find these in&amp;nbsp;the DBpedia root folder. The&amp;nbsp;extraction code is&amp;nbsp;available via&amp;nbsp;&lt;a href="http://sourceforge.net/projects/dbpedia/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />Sourceforge&lt;/a> SVN&amp;nbsp;(do not&amp;nbsp;download the&amp;nbsp;release version at&amp;nbsp;Sourceforge).&lt;br />
If&amp;nbsp;you want to&amp;nbsp;create your own&amp;nbsp;dumps or&amp;nbsp;work within your IDE&amp;nbsp;or&amp;nbsp;on the&amp;nbsp;console, you&amp;nbsp;should use&amp;nbsp;&lt;em>start.php&lt;/em>. If&amp;nbsp;you first want to&amp;nbsp;learn how&amp;nbsp;DBpedia extraction works, or&amp;nbsp;test /debug new&amp;nbsp;extractors, &lt;em>webStart.php&lt;/em> might best suit your needs, as&amp;nbsp;it gives you&amp;nbsp;a&amp;nbsp;comfortable web&amp;nbsp;debug interface, similar to&amp;nbsp;&lt;a href="http://www4.wiwiss.fu-berlin.de/dbp-debug/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />http://www4.wiwiss.fu-berlin.de/dbp-debug/&lt;/a>. Make sure to&amp;nbsp;download &lt;a href="http://sourceforge.net/projects/rdfapi-php/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />RAP &amp;ndash; RdF&amp;nbsp;API for&amp;nbsp;PHP&lt;/a> first if&amp;nbsp;you want to&amp;nbsp;use the&amp;nbsp;web interface.&lt;br />
If&amp;nbsp;you intend to&amp;nbsp;produce separate output files for&amp;nbsp;each Extractor, you&amp;nbsp;will need an&amp;nbsp;own ExtractionGroup for&amp;nbsp;each extractor (as is&amp;nbsp;done in&amp;nbsp;the standard settings in&amp;nbsp;&lt;em>start.php&lt;/em>). Finally you&amp;nbsp;will need to&amp;nbsp;run your ExtractionJob through an&amp;nbsp;instance of&amp;nbsp;an ExtractionManager.&lt;br />
&lt;ul>&lt;li> &lt;tt>extractPage($pageID, $pageTitle, $pageSource)&lt;/tt>: Includes the&amp;nbsp;actual extraction process. Constructs a&amp;nbsp;new ExtractionResult, extracts data from the&amp;nbsp;source page and&amp;nbsp;stores the&amp;nbsp;extracted data in&amp;nbsp;the ExtractionResult. Is&amp;nbsp;called for&amp;nbsp;each page and&amp;nbsp;must return an&amp;nbsp;ExtractionResult.&lt;/li>&lt;/ul>
PageCollections are&amp;nbsp;the data source for&amp;nbsp;extraction. A&amp;nbsp;PageCollection loads the&amp;nbsp;page source code for&amp;nbsp;a&amp;nbsp;specific language and&amp;nbsp;one or&amp;nbsp;more pages (Implementations: LiveWikipedia, DatabaseWikipedia).&lt;br />
&lt;ul>&lt;li> &lt;tt>getSource($pageTitle)&lt;/tt>: returns the&amp;nbsp;Wikipedia source code for&amp;nbsp;the page $pageTitle&lt;/li>&lt;/ul>
Collects RDFtriples while extraction is&amp;nbsp;in process. Each wiki page needs an&amp;nbsp;own instance of&amp;nbsp;ExtractionResult.&lt;/div>&lt;br />
&lt;b>Deletions:&lt;/b>&lt;br />
&lt;div class="deletions">To&amp;nbsp;get the&amp;nbsp;framework running on&amp;nbsp;your local PC, it&amp;nbsp;is recommended to&amp;nbsp;start with the&amp;nbsp;pre-configured files &lt;em>start.php&lt;/em> or&amp;nbsp;&lt;em>webStart.php&lt;/em>. You&amp;nbsp;will find these in&amp;nbsp;the DBpedia root folder. The&amp;nbsp;extraction code is&amp;nbsp;available via&amp;nbsp;&lt;a href="http://sourceforge.net/projects/dbpedia/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />sourceforge&lt;/a> SVN&amp;nbsp;(do not&amp;nbsp;download the&amp;nbsp;release version at&amp;nbsp;sourceforge).&lt;br />
If&amp;nbsp;you want to&amp;nbsp;create your own&amp;nbsp;dumps or&amp;nbsp;work within your IDE&amp;nbsp;or&amp;nbsp;on the&amp;nbsp;console, you&amp;nbsp;should use&amp;nbsp;&lt;em>start.php&lt;/em>. If&amp;nbsp;you first want to&amp;nbsp;learn how&amp;nbsp;DBpedia extraction works, or&amp;nbsp;test /debug new&amp;nbsp;extractors, &lt;em>webStart.php&lt;/em> might best suit your needs, as&amp;nbsp;it gives you&amp;nbsp;a&amp;nbsp;comfortable web&amp;nbsp;debug interface, similiar to&amp;nbsp;&lt;a href="http://www4.wiwiss.fu-berlin.de/dbp-debug/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />http://www4.wiwiss.fu-berlin.de/dbp-debug/&lt;/a>. Make sure to&amp;nbsp;download &lt;a href="http://sourceforge.net/projects/rdfapi-php/" target="_blank" title="Outgoing link (in new window)" class="outerlink">&lt;img src="http://wiki.dbpedia.org/themes/dbpedia2007/icons/world_link.png" alt="" class="contexticon" />RAP &amp;ndash; RdF&amp;nbsp;API for&amp;nbsp;PHP&lt;/a> first if&amp;nbsp;you want to&amp;nbsp;use the&amp;nbsp;webinterface.&lt;br />
If&amp;nbsp;you intend to&amp;nbsp;produce seperate output files for&amp;nbsp;each Extractor, you&amp;nbsp;will need an&amp;nbsp;own ExtractionGroup for&amp;nbsp;each extractor (as is&amp;nbsp;done in&amp;nbsp;the standard settings in&amp;nbsp;&lt;em>start.php&lt;/em>). Finally you&amp;nbsp;will need to&amp;nbsp;run your ExtractionJob through an&amp;nbsp;instance of&amp;nbsp;an ExtractionManager.&lt;br />
&lt;ul>&lt;li> &lt;tt>extractPage($pageID, $pageTitle, $pageSource)&lt;/tt>: Includes the&amp;nbsp;actual exraction process. Constructs a&amp;nbsp;new ExtractionResult, extracts data from the&amp;nbsp;sourcepage and&amp;nbsp;stores the&amp;nbsp;extracted data in&amp;nbsp;the ExtractionResult. Is&amp;nbsp;called for&amp;nbsp;each page and&amp;nbsp;must return an&amp;nbsp;ExtractionResult.&lt;/li>&lt;/ul>
PageCollections are&amp;nbsp;the data source for&amp;nbsp;extraction. A&amp;nbsp;PageCollection loads the&amp;nbsp;page sourcecode for&amp;nbsp;a&amp;nbsp;specific language and&amp;nbsp;one or&amp;nbsp;more pages (Implementations: LiveWikipedia, DatabaseWikipedia).&lt;br />
&lt;ul>&lt;li> &lt;tt>getSource($pageTitle)&lt;/tt>: returns the&amp;nbsp;Wikipedia sourcecode for&amp;nbsp;the page $pageTitle&lt;/li>&lt;/ul>
Collects RDFtriples while extraction is&amp;nbsp;in process. Each wikipage needs an&amp;nbsp;own instance of&amp;nbsp;ExtractionResult.&lt;/div>&lt;/div>
</description>
</item>
<item>
<title>2007-08-20 17:49:29</title>
<link>http://wiki.dbpedia.org/Documentation/show?time=2007-08-20+17%3A49%3A29</link>
<description>&lt;div class="pageBefore">&lt;img src="http://wiki.dbpedia.org/images/z.gif" width="1" height="1" border="0" alt="" style="display:block" align="top" />&lt;/div>&lt;div class="page">
&lt;b>Comparison for &lt;a  href="http://wiki.dbpedia.org/Documentation" class="">/Documentation&lt;/a> of &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-20+17%3A49%3A29">2007-08-20 17:49:29&lt;/a> and &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-21+23%3A25%3A19">2007-08-21 23:25:19&lt;/a>&lt;/b>&lt;br />
&lt;br />
&lt;b>Additions:&lt;/b>&lt;br />
&lt;div class="additions">&lt;!--notypo-->&lt;fieldset>&lt;legend>&lt;strong> Contents   &lt;/strong>&lt;/legend>&lt;div class="toc1">&lt;a href="#h25-2">1.  Getting started &lt;/a>&lt;/div>&lt;div class="toc1">&lt;a href="#h25-3">2.  Functional overview &lt;/a>&lt;/div>&lt;div class="toc1">&lt;a href="#h25-4">3.  The&amp;nbsp;Interfaces &lt;/a>&lt;/div>&lt;div class="toc2">&lt;a href="#h25-5">3.1. Interface Destination&lt;/a>&lt;/div>&lt;div class="toc2">&lt;a href="#h25-6">3.2. Interface Extractor&lt;/a>&lt;/div>&lt;div class="toc2">&lt;a href="#h25-7">3.3. Interface PageCollection&lt;/a>&lt;/div>&lt;div class="toc2">&lt;a href="#h25-8">3.4. Interface RDFnode&lt;/a>&lt;/div>&lt;div class="toc1">&lt;a href="#h25-9">4. Essential Classes&lt;/a>&lt;/div>&lt;div class="toc2">&lt;a href="#h25-10">4.1. Class ExtractionGroup&lt;/a>&lt;/div>&lt;div class="toc2">&lt;a href="#h25-11">4.2. Class ExtractionJob&lt;/a>&lt;/div>&lt;div class="toc2">&lt;a href="#h25-12">4.3. Class ExtractionManager&lt;/a>&lt;/div>&lt;div class="toc2">&lt;a href="#h25-13">4.4. Class ExtractionResult&lt;/a>&lt;/div>&lt;div class="toc2">&lt;a href="#h25-14">4.5. Class RDFtriple&lt;/a>&lt;/div>&lt;/fieldset>&lt;!--/notypo-->&lt;/div>&lt;/div>
</description>
</item>
<item>
<title>2007-08-20 17:38:53</title>
<link>http://wiki.dbpedia.org/Documentation/show?time=2007-08-20+17%3A38%3A53</link>
<description>&lt;div class="pageBefore">&lt;img src="http://wiki.dbpedia.org/images/z.gif" width="1" height="1" border="0" alt="" style="display:block" align="top" />&lt;/div>&lt;div class="page">
&lt;b>Comparison for &lt;a  href="http://wiki.dbpedia.org/Documentation" class="">/Documentation&lt;/a> of &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-20+17%3A38%3A53">2007-08-20 17:38:53&lt;/a> and &lt;a href="http://wiki.dbpedia.org/Documentation?time=2007-08-20+17%3A49%3A29">2007-08-20 17:49:29&lt;/a>&lt;/b>&lt;br />
&lt;br />
&lt;b>Additions:&lt;/b>&lt;br />
&lt;div class="additions">The&amp;nbsp;code extraction process is&amp;nbsp;triggered via&amp;nbsp;the ExtractionManager, which starts one&amp;nbsp;or&amp;nbsp;more ExtractionJobs. An&amp;nbsp;ExtractionJob combines one&amp;nbsp;or&amp;nbsp;more ExtractionGroups with a&amp;nbsp;PageCollection. The&amp;nbsp;PageCollection is&amp;nbsp;the data source, e.g. all&amp;nbsp;articles from the&amp;nbsp;Wikipedia SQL-Dump. ExtractionGroups consist of&amp;nbsp;a Destination and&amp;nbsp;one or&amp;nbsp;more Extractors. Possible Destinations include your console, NTriple files and&amp;nbsp;the web&amp;nbsp;interface. Of&amp;nbsp;course you&amp;nbsp;are free to&amp;nbsp;write your own&amp;nbsp;destinations (e.g. databases, RDF/XML files). &lt;br />
Extractors are&amp;nbsp;designed for&amp;nbsp;single specific purposes. E.g. the&amp;nbsp;InfoboxExtractor reads out&amp;nbsp;information from Wikipedia Infoboxes. The&amp;nbsp;ShortAbstractExtractor gets the&amp;nbsp;first paragraph from an&amp;nbsp;article and&amp;nbsp;so&amp;nbsp;on. DBpedia comes with extractors for&amp;nbsp;many purposes already, though you&amp;nbsp;are invited to&amp;nbsp;add your own.&lt;br />
Extractors are&amp;nbsp;the core of&amp;nbsp;the data extraction process, as&amp;nbsp;they parse and&amp;nbsp;convert the&amp;nbsp;Wikipedia pages. For&amp;nbsp;each page, the&amp;nbsp;extracted data is&amp;nbsp;stored in&amp;nbsp;an instance of&amp;nbsp;ExtractionResult. &lt;br />
ExtractionGroups connect Extractors with Destinations. If&amp;nbsp;you want to&amp;nbsp;store the&amp;nbsp;ExtractionResults from all&amp;nbsp;Extractors in&amp;nbsp;a single file, one&amp;nbsp;ExtractionGroup is&amp;nbsp;sufficient. Just create a&amp;nbsp;new ExtractionGroup with a&amp;nbsp;Destination and&amp;nbsp;add the&amp;nbsp;extractors you&amp;nbsp;need.&lt;br />
If&amp;nbsp;you intend to&amp;nbsp;produce seperate output files for&amp;nbsp;each Extractor, you&amp;nbsp;will need an&amp;nbsp;own ExtractionGroup for&amp;nbsp;each extractor (as is&amp;nbsp;done in&amp;nbsp;the standard settings in&amp;nbsp;&lt;em>start.php&lt;/em>). Finally you&amp;nbsp;will need to&amp;nbsp;run your ExtractionJob through an&amp;nbsp;instance of&amp;nbsp;an ExtractionManager.&lt;br />
Destinations store extraction results. Included Destinations are&amp;nbsp;NTriple files (&lt;tt>NTripleDumpDestination&lt;/tt>), the&amp;nbsp;console (&lt;tt>SimpleDumpDestination&lt;/tt>) and&amp;nbsp;a&amp;nbsp;web interface (&lt;tt>WebDebugDestination&lt;/tt>).&lt;br />
&lt;ul>&lt;li> &lt;tt>start()&lt;/tt>: Initializes the&amp;nbsp;Destination (e.g. creates a&amp;nbsp;new NTriple file). Is&amp;nbsp;called once at&amp;nbsp;the beginning of&amp;nbsp;an ExtractionJob.
&lt;/li>&lt;li> &lt;tt>accept($extractionResult, $revisionID)&lt;/tt>: Requires an&amp;nbsp;ExtractionResult and&amp;nbsp;a&amp;nbsp;page revision. Reads out&amp;nbsp;each triple from the&amp;nbsp;ExtractionResult and&amp;nbsp;prints it&amp;nbsp;out or&amp;nbsp;stores it&amp;nbsp;in a&amp;nbsp;file. Is&amp;nbsp;called for&amp;nbsp;each page.
&lt;/li>&lt;li>&lt;tt>finish()&lt;/tt>: Closes the&amp;nbsp;destination. Is&amp;nbsp;called once at&amp;nbsp;the end&amp;nbsp;of&amp;nbsp;an ExtractionJob.
&lt;/li>&lt;li> &lt;tt>extractPage($pageID, $pageTitle, $pageSource)&lt;/tt>: Includes the&amp;nbsp;actual exraction process. Constructs a&amp;nbsp;new ExtractionResult, extracts data from the&amp;nbsp;sourcepage and&amp;nbsp;stores the&amp;nbsp;extracted data in&amp;nbsp;the ExtractionResult. Is&amp;nbsp;called for&amp;nbsp;each page and&amp;nbsp;must return an&amp;nbsp;ExtractionResult.
&lt;/li>&lt;li> &lt;tt>finish()&lt;/tt>: Closes the&amp;nbsp;extractor. Is&amp;nbsp;called once at&amp;nbsp;the end&amp;nbsp;of&amp;nbsp;an ExtractionJob.&lt;/li>&lt;/ul>
&lt;strong>Interface PageCollection:&lt;/strong>&lt;br />
PageCollections are&amp;nbsp;the data source for&amp;nbsp;extraction. A&amp;nbsp;PageCollection loads the&amp;nbsp;page sourcecode for&amp;nbsp;a&amp;nbsp;specific language and&amp;nbsp;one or&amp;nbsp;more pages (Implementations: LiveWikipedia, DatabaseWikipedia).&lt;br />
A&amp;nbsp;PageCollection must have the&amp;nbsp;methods:&lt;br />
URI, RDFliteral and&amp;nbsp;RDFblankNode are&amp;nbsp;implementations of&amp;nbsp;RDFnodes.&lt;br />
&lt;strong>Class ExtractionGroup&lt;/strong>&lt;br />
ExtractionGroups combine one&amp;nbsp;or&amp;nbsp;more Extractors with one&amp;nbsp;Destination. If&amp;nbsp;you want to&amp;nbsp;create a&amp;nbsp;single Dump for&amp;nbsp;each Extractor in&amp;nbsp;use, you&amp;nbsp;will need to&amp;nbsp;create a&amp;nbsp;new ExtractionGroup for&amp;nbsp;every Extractor, as&amp;nbsp;an ExtractionGroup can&amp;nbsp;hold only a&amp;nbsp;single Destination.&lt;br />
&lt;ul>&lt;li>&lt;tt>__construct($destination, $metadestination = NULL)&lt;/tt>: $destination is&amp;nbsp;an object of&amp;nbsp;a class, implementing the&amp;nbsp;interface Destination. $metadestination is&amp;nbsp;an optional Destination, where meta information can&amp;nbsp;be&amp;nbsp;stored. It&amp;nbsp;is mainly used by&amp;nbsp;the InfoboxExtractor, which collects all&amp;nbsp;predicate names in&amp;nbsp;a metdestination.&lt;/li>&lt;/ul>
&lt;strong>Class ExtractionJob&lt;/strong>&lt;br />
An&amp;nbsp;ExtractionJob combines one&amp;nbsp;or&amp;nbsp;more ExtractionGroups (Extractors + Destination) &lt;br />
with one&amp;nbsp;PageCollection (data source). ExtractionJobs are&amp;nbsp;executed by&amp;nbsp;the ExtractionManagaer.&lt;br />
&lt;ul>&lt;li>&lt;tt>addExtractionGroup($group)&lt;/tt>: Adds a&amp;nbsp;new ExtractionGroup to&amp;nbsp;the job.&lt;/li>&lt;/ul>
&lt;strong>Class ExtractionManager&lt;/strong>&lt;br />
The&amp;nbsp;ExtractioManager executes ExtractionJobs.&lt;br />
Cycling over all&amp;nbsp;ExtractionGroups, the&amp;nbsp;extraction manager first initializes the&amp;nbsp;Extractors and&amp;nbsp;the Destination (via their &lt;tt>start()&lt;/tt> method).&lt;br />
Next, it&amp;nbsp;iterates over all&amp;nbsp;pages from a&amp;nbsp;PageCollection and&amp;nbsp;passes the&amp;nbsp;page source to&amp;nbsp;each Extractor, triggering its&amp;nbsp;&lt;tt>extractPage()&lt;/tt> method.&lt;br />
Finally it&amp;nbsp;reads out&amp;nbsp;the ExtractionResults from every Extractor and&amp;nbsp;passes it&amp;nbsp;to&lt;br />
&lt;ul>&lt;li>&lt;tt>execute($job)&lt;/tt>: Starts the&amp;nbsp;job as&amp;nbsp;described above. Requires an&amp;nbsp;object of&amp;nbsp;class ExtractionJob.&lt;/li>&lt;/ul>
&lt;strong>Class ExtractionResult&lt;/strong>&lt;br />
Collects RDFtriples while extraction is&amp;nbsp;in process. Each wikipage needs an&amp;nbsp;own instance of&amp;nbsp;ExtractionResult.&lt;/div>&lt;br />
&lt;b>Deletions:&lt;/b>&lt;br />
&lt;div class="deletions">The&amp;nbsp;code extraction process is&amp;nbsp;triggered via&amp;nbsp;the Extraction&amp;nbsp;Manager, which starts one&amp;nbsp;or&amp;nbsp;more Extraction&amp;nbsp;Jobs. An&amp;nbsp;Extraction&amp;nbsp;Job combines one&amp;nbsp;or&amp;nbsp;more Extraction&amp;nbsp;Groups with a&amp;nbsp;Page&amp;nbsp;Collection. The&amp;nbsp;Page&amp;nbsp;Collection is&amp;nbsp;the data source, e.g. all&amp;nbsp;articles from the&amp;nbsp;Wikipedia SQL-Dump. Extraction&amp;nbsp;Groups consist of&amp;nbsp;a Destination and&amp;nbsp;one or&amp;nbsp;more Extractors. Possible Destinations include your console, NTriple files and&amp;nbsp;the web&amp;nbsp;interface. Of&amp;nbsp;course you&amp;nbsp;are free to&amp;nbsp;write your own&amp;nbsp;destinations (e.g. databases, RDF/XML files). &lt;br />
Extractors are&amp;nbsp;designed for&amp;nbsp;single specific purposes. E.g. the&amp;nbsp;Infobox&amp;nbsp;Extractor reads out&amp;nbsp;information from Wikipedia Infoboxes. The&amp;nbsp;Short&amp;nbsp;Abstract&amp;nbsp;Extractor gets the&amp;nbsp;first paragraph from an&amp;nbsp;article and&amp;nbsp;so&amp;nbsp;on. DBpedia comes with extractors for&amp;nbsp;many purposes already, though you&amp;nbsp;are invited to&amp;nbsp;add your own.&lt;br />
Extractors are&amp;nbsp;the core of&amp;nbsp;the data extraction process, as&amp;nbsp;they parse and&amp;nbsp;convert the&amp;nbsp;Wikipedia pages. For&amp;nbsp;each page, the&amp;nbsp;extracted data is&amp;nbsp;stored in&amp;nbsp;an instance of&amp;nbsp;Extraction&amp;nbsp;Result. &lt;br />
Extraction&amp;nbsp;Groups connect Extractors with Destinations. If&amp;nbsp;you want to&amp;nbsp;store the&amp;nbsp;Extraction&amp;nbsp;Results from all&amp;nbsp;Extractors in&amp;nbsp;a single file, one&amp;nbsp;Extraction&amp;nbsp;Group is&amp;nbsp;sufficient. Just create a&amp;nbsp;new Extraction&amp;nbsp;Group with a&amp;nbsp;Destination and&amp;nbsp;add the&amp;nbsp;extractors you&amp;nbsp;need.&lt;br />
If&amp;nbsp;you intend to&amp;nbsp;produce seperate output files for&amp;nbsp;each Extractor, you&amp;nbsp;will need an&amp;nbsp;own Extraction&amp;nbsp;Group for&amp;nbsp;each extractor (as is&amp;nbsp;done in&amp;nbsp;the standard settings in&amp;nbsp;&lt;em>start.php&lt;/em>). Finally you&amp;nbsp;will need to&amp;nbsp;run your Extraction&amp;nbsp;Job through an&amp;nbsp;instance of&amp;nbsp;an Extraction&amp;nbsp;Manager.&lt;br />
Destinations store extraction results. Included Destinations are&amp;nbsp;NTriple files (&lt;tt>NTripleDumpDestination&lt;/tt>), the&amp;nbsp;console (&lt;tt>Simple&amp;nbsp;Dump&amp;nbsp;Destination&lt;/tt>) and&amp;nbsp;a&amp;nbsp;web interface (&lt;tt>Web&amp;nbsp;Debug&amp;nbsp;Destination&lt;/tt>).&lt;br />
&lt;ul>&lt;li> &lt;tt>start()&lt;/tt>: Initializes the&amp;nbsp;Destination (e.g. creates a&amp;nbsp;new NTriple file). Is&amp;nbsp;called once at&amp;nbsp;the beginning of&amp;nbsp;an Extraction&amp;nbsp;Job.
&lt;/li>&lt;li> &lt;tt>accept($extractionResult, $revisionID)&lt;/tt>: Requires an&amp;nbsp;Extraction&amp;nbsp;Result and&amp;nbsp;a&amp;nbsp;page revision. Reads out&amp;nbsp;each triple from the&amp;nbsp;Extraction&amp;nbsp;Result and&amp;nbsp;prints it&amp;nbsp;out or&amp;nbsp;stores it&amp;nbsp;in a&amp;nbsp;file. Is&amp;nbsp;called for&amp;nbsp;each page.
&lt;/li>&lt;li>&lt;tt>finish()&lt;/tt>: Closes the&amp;nbsp;destination. Is&amp;nbsp;called once at&amp;nbsp;the end&amp;nbsp;of&amp;nbsp;an Extracion&amp;nbsp;Job.
&lt;/li>&lt;li> &lt;tt>extractPage($pageID, $pageTitle, $pageSource)&lt;/tt>: Includes the&amp;nbsp;actual exraction process. Constructs a&amp;nbsp;new Extraction&amp;nbsp;Result, extracts data from the&amp;nbsp;sourcepage and&amp;nbsp;stores the&amp;nbsp;extracted data in&amp;nbsp;the Extraction&amp;nbsp;Result. Is&amp;nbsp;called for&amp;nbsp;each page and&amp;nbsp;must return an&amp;nbsp;Extraction&amp;nbsp;Result.
&lt;/li>&lt;li> &lt;tt>finish()&lt;/tt>: Closes the&amp;nbsp;extractor. Is&amp;nbsp;called once at&amp;nbsp;the end&amp;nbsp;of&amp;nbsp;an extraction job.&lt;/li>&lt;/ul>
&lt;strong>Interface Page&amp;nbsp;Collection:&lt;/strong>&lt;br />
Page&amp;nbsp;Collections are&amp;nbsp;the data source for&amp;nbsp;extraction. A&amp;nbsp;Page&amp;nbsp;Collection loads the&amp;nbsp;page sourcecode for&amp;nbsp;a&amp;nbsp;specific language and&amp;nbsp;or&amp;nbsp;more pages (Implementations: Live&amp;nbsp;Wikipedia, Database&amp;nbsp;Wikipedia).&lt;br />
A&amp;nbsp;Page&amp;nbsp;Collection must have the&amp;nbsp;methods:&lt;br />
URI, RDFliteral and&amp;nbsp;RDFblankNode are&amp;nbsp;implementations of&amp;nbsp;RDFnodes.&lt;br />
&lt;strong>Class Extraction&amp;nbsp;Group&lt;/strong>&lt;br />
Extraction&amp;nbsp;Groups combine one&amp;nbsp;or&amp;nbsp;more Extractors with one&amp;nbsp;Destination. If&amp;nbsp;you want to&amp;nbsp;create a&amp;nbsp;single Dump for&amp;nbsp;each Extractor in&amp;nbsp;use, you&amp;nbsp;will need to&amp;nbsp;create a&amp;nbsp;new Extraction&amp;nbsp;Group for&amp;nbsp;every Extractor, as&amp;nbsp;an Extraction&amp;nbsp;Group can&amp;nbsp;hold only a&amp;nbsp;single Destination.&lt;br />
&lt;ul>&lt;li>&lt;tt>__construct($destination, $metadestination = NULL)&lt;/tt>: $destination is&amp;nbsp;an object of&amp;nbsp;a class, implementing the&amp;nbsp;interface Destination. $metadestination is&amp;nbsp;an optional Destination, where meta information can&amp;nbsp;be&amp;nbsp;stored. It&amp;nbsp;is mainly used by&amp;nbsp;the Infobox&amp;nbsp;Extractor, which collects all&amp;nbsp;predicate names in&amp;nbsp;a metdestination.&lt;/li>&lt;/ul>
&lt;strong>Class Extraction&amp;nbsp;Job&lt;/strong>&lt;br />
An&amp;nbsp;Extraction&amp;nbsp;Job combines one&amp;nbsp;or&amp;nbsp;more Extraction&amp;nbsp;Groups (Extractors + Destination) &lt;br />
with one&amp;nbsp;Page&amp;nbsp;Collection (data source). Extraction&amp;nbsp;Jobs are&amp;nbsp;executed by&amp;nbsp;the Extraction&amp;nbsp;Managaer.&lt;br />
&lt;ul>&lt;li>&lt;tt>addExtractionGroup($group)&lt;/tt>: Adds a&amp;nbsp;new Extraction&amp;nbsp;Group to&amp;nbsp;the job.&lt;/li>&lt;/ul>
&lt;strong>Class Extraction&amp;nbsp;Manager&lt;/strong>&lt;br />
The&amp;nbsp;Extractio&amp;nbsp;Manager executes Extraction&amp;nbsp;Jobs.&lt;br />
Cycling over all&amp;nbsp;Extraction&amp;nbsp;Groups, the&amp;nbsp;extraction manager first initializes the&amp;nbsp;Extractors and&amp;nbsp;the Destination (via their &lt;tt>start()&lt;/tt> method).&lt;br />
Next, it&amp;nbsp;iterates over all&amp;nbsp;pages from a&amp;nbsp;Page&amp;nbsp;Collection and&amp;nbsp;passes the&amp;nbsp;page source to&amp;nbsp;each Extractor, triggering its&amp;nbsp;&lt;tt>extractPage()&lt;/tt> method.&lt;br />
Finally it&amp;nbsp;reads out&amp;nbsp;the Extraction&amp;nbsp;Results from every Extractor and&amp;nbsp;passes it&amp;nbsp;to&lt;br />
&lt;ul>&lt;li>&lt;tt>execute($job)&lt;/tt>: Starts the&amp;nbsp;job as&amp;nbsp;described above. Requires an&amp;nbsp;object of&amp;nbsp;class Extraction&amp;nbsp;Job.&lt;/li>&lt;/ul>
&lt;strong>Class Extraction&amp;nbsp;Result&lt;/strong>&lt;br />
Collects RDFtriples while extraction is&amp;nbsp;in process. Each wikipage needs an&amp;nbsp;own instance of&amp;nbsp;Extraction&amp;nbsp;Result.&lt;/div>&lt;/div>
</description>
</item>
</channel>
</rss>
