Sharing Knowledge: Scientific Communication - SSOAR

Weitere Magazine

Empfehlungen

Info

166 Judith Plümer Migration from HTML META to RDF Above we indicated the advantages that RDF has in comparison with HTML META. The question that we want to discuss here is how the software can handle the richer structure. HTML META is in a mathematical sense equivalent to the SOIF format since both store attribute/value pairs. But that precludes the use of RDF on the base of the current harvest software. What we need is a software with the power to handle RDF and the wisdom to integrate SOIF data that come from some gatherer agents in the world and don’t want to update their software. In the CARMEN (http://www.math.uos.de/projects/carmen/) project of the federal ministry of science (Global Info program of the BMBF) tools were developed that can be plugged together to operate MPRESS on the basis of RDF. X-Harvest The X-Harvest [Kokkelink, 2000] software is a modification of the Harvest-NG (http://webharvest.sourceforge.net/ng/) software that substitutes the internal SOIF format by RDF. That means X-Harvest is a substitute for the gatherer component of the Harvest software which stores the summaries of the documents in RDF format. X-Harvest is completely written in Perl. That makes addition and modification of features easy but on the other hand it makes the installation a real challenge: the Harvest-NG software needs the installation of 8 additional Perl modules which require further Perl modules. X-Harvest needs additional Perl modules which piles up to 25 Perl modules totally that have to be installed. Therefore we decided to implement a script that takes care of the installation procedure. But this modular architecture solves for example the problem of character sets because X-Harvest uses the Unicode module and hence handles Unicode characters. As well there are modules for the X-Harvest software that are able to solve the heterogeneity problems that we mentioned above: What do we do with HTML documents without any metadata and with documents coming up in formats that are not able to carry metadata information like PostScript? For this purpose there is a summarizer that generates metadata out of HTML documents on probabilistic guesses and a summarizer for PostScript documents that extracts metadata by given heuristics (http://www.math.uos.de/ projects/carmen/AP11/). The use of heuristics in this context makes complete sense since mathematical papers are at most all of the same structure and mostly offered in PostScript which was generated out of TeX/LaTeX. This, however, is an aspect for which the described solution does not have to be scalable for other disciplines.
MPRESS - transition of metadata formats 167 Transition to RDF creates the heterogenity problem that we now have, which is a zoo of sources offering index data for MPRESS: old harvest gatherers offering SOIF, X-Harvest gatherers offering RDF and agents that export index data in the Open Archives Protocoll (http://www.openarchives.org/). But this is unavoidable as long as one accepts data coming up in the former formats what we want to do in MPRESS. We want to import this zoo of sources into HyREX [Gövert, 2002]. During the CARMEN project a script (broker.pl) was developed that collects data from harvest gatherers, X-Harvest gatherers and interfaces using the Open Archives Protocoll. The data that are stored by this script have to be converted to XML using for example soif2xml. Then the whole set of data is in XML format and converted again using XSLT. This procedure results in a set of XML documents that are valid against the DTD we are using to index the data with HyREX. Bibliography Brickley, Dan; Bray, Tim: “What is RDF?”, http://www.xml.com/pub/a/2001/01/24/rdf.html, 2001. Dalitz, Wolfgang; Grötschel, Martin; Lügger, Joachim: “Information Services for Mathematics in the Internet (Math-Net)”, 15th IMACS World Congress 1997 on <strong>Scientific</strong> Computation, Medelling and Applied Mathematis, Volume 4: Artificial Intelligence and Computer Science, Wissenschaft und Technik, Berlin, 1997, p. 773-778. Gövert, Norvert; Großjohann, Kai: „HyREX Manual“, http://www.is.informatik.uni-duisburg.de/projects/hyrex/manual.pdf. Grötschel, Martin; Lügger, Joachim: “<strong>Scientific</strong> Information Systems and Metadata”, Classification in the Information Age, Springer, 1999, p.3-20. Hardie, T.; Bowman, M.; Hardy, Darren R.; Schwartz, Michael F.; Wessels, Duane: “CIP Index Oject Format for SOIF Objects”, IETF RFC2655, 1999. Hardy, Darren R.; Schwartz, Michael F.: “Customized information extraction as a basis for resource discovery”, ACM Transactions on Computer Systems, Vol. 14 no. 2, 1996, p 171-199. Hardy, Darren R.; Schwartz, Michael F.; Wessels, Duane: “Harvest – Effective use of Internet Information”, University of Colorado at Boulder, Technical Report CU-CS-743-94, 1996. Kokkelink, Stefan: “Simple XML/RDF extension of Harvest-NG”, http://www.math.uos.de/projects/carmen/AP7/DOM/dom.htm, 2000. Kunze, John: “Encoding Dublin Core in HTML”, IETF RFC 2731, 1999. Rivest, R. L.: “The MD5 Message-Digest Algorithm”, IETF RFC 1321, 1992.
Seite 1:
Sharing Knowledge: Scientific Commu
Seite 4 und 5:
Tagungsberichte Herausgegeben vom I
Seite 6 und 7:
Die Deutsche Bibliothek - CIP-Einhe
Seite 8 und 9:
6 Inhalt Infrastrukturen für innov
Seite 11:
Vorwort Zur neunten Frühjahrstagun
Seite 14 und 15:
12 Heike Andermann ted, in 1994-199
Seite 16 und 17:
14 Heike Andermann schaftlerInnen e
Seite 18 und 19:
16 Heike Andermann tung beibehalten
Seite 20 und 21:
18 Heike Andermann NBII). 26 Für d
Seite 23 und 24:
Qualitätssicherung und Nutzung von
Seite 25 und 26:
Seite 27 und 28:
Seite 29 und 30:
Seite 31 und 32:
Seite 33 und 34:
Seite 35 und 36:
Seite 37 und 38:
Seite 39 und 40:
vascoda Das gemeinsame Portal von I
Seite 41 und 42:
vascoda - Das gemeinsame Portal von
Seite 43 und 44:
Seite 45 und 46:
Seite 47:
Seite 50 und 51:
48 Klaus Hahn Abstract The advancem
Seite 52 und 53:
50 Klaus Hahn her“ [6]). So wäre
Seite 54 und 55:
52 Klaus Hahn men auch als Begriff
Seite 56 und 57:
54 Klaus Hahn Arbeiten notwendig (>
Seite 58 und 59:
56 Klaus Hahn Fazit Zur effektiven
Seite 61 und 62:
Unterstützung kooperativer Verfahr
Seite 63 und 64:
Seite 65 und 66:
Seite 67 und 68:
Seite 69 und 70:
Seite 71 und 72:
Seite 73 und 74:
PhysNet und seine Spiegel - Das Pro
Seite 75 und 76:
Seite 77 und 78:
Seite 79 und 80:
Seite 81 und 82:
Seite 83 und 84:
Seite 85 und 86:
Online-Hochschulschriften für die
Seite 87 und 88:
Seite 89 und 90:
Seite 91 und 92:
Seite 93 und 94:
Seite 95:
Seite 98 und 99:
96 Rudi Schmiede, Stephan Körnig s
Seite 100 und 101:
98 Rudi Schmiede, Stephan Körnig s
Seite 102 und 103:
100 Rudi Schmiede, Stephan Körnig
Seite 104 und 105:
Seite 106 und 107:
Seite 108 und 109:
Seite 110 und 111:
108 Jutta von Maurice strument lief
Seite 112 und 113:
110 Jutta von Maurice bare Kopien v
Seite 114 und 115:
112 Jutta von Maurice einschließli
Seite 116 und 117:
114 Jutta von Maurice hebungen. Dem
Seite 118 und 119: 116 Jutta von Maurice http://www.df
Seite 121 und 122: Maßnahmen zur Förderung der Infor
Seite 133 und 134: LIMES - A System for a Distributed
Seite 139: LIMES - A System for a Distributed
Seite 142 und 143: 140 Frank Oldenettel, Michael Malac
Seite 160 und 161: 158 Judith Plümer prints werden du
Seite 162 und 163: 160 Judith Plümer modification, th
Seite 164 und 165: 162 Judith Plümer The summarizers
Seite 166 und 167: 164 Judith Plümer based on a few p
Seite 170 und 171: 168 Judith Plümer Contact Judith P
Seite 172 und 173: 170 Dennis Reil für traditionelle
Seite 174 und 175: 172 Dennis Reil gartner (2002) ist
Seite 176 und 177: 174 Dennis Reil können. Wichtig f
Seite 178 und 179: 176 Dennis Reil nahezu identisch, s
Seite 180 und 181: 178 Dennis Reil Insgesamt kann also
Seite 183 und 184: Reflections on the Value Chain of S
Seite 193: Reflections on the Value Chain of S
Seite 196 und 197: 194 Natascha Schumann, Wolfgang Mei
Seite 207 und 208: ViFaPhys - Virtuelle Fachbibliothek
Seite 215 und 216: Weiterentwicklung von digitalen Bib
Seite 217 und 218: Weiterentwicklung von digitalen Bib
Seite 219 und 220:
Weiterentwicklung von digitalen Bib
Seite 221 und 222:
Seite 223 und 224:
Seite 225 und 226:
Seite 227:
Seite 230 und 231:
228 Markus Kalb, Günther Specht ve
Seite 232 und 233:
230 Markus Kalb, Günther Specht Di
Seite 234 und 235:
232 Markus Kalb, Günther Specht Da
Seite 236 und 237:
234 Markus Kalb, Günther Specht 4.
Seite 238 und 239:
236 Markus Kalb, Günther Specht pe
Seite 240 und 241:
238 Markus Kalb, Günther Specht [O
Seite 242 und 243:
240 Maximilian Stempfhuber � Info
Seite 244 und 245:
242 Maximilian Stempfhuber Weiterle
Seite 246 und 247:
244 Maximilian Stempfhuber Zum zwei
Seite 248 und 249:
246 Maximilian Stempfhuber Abbildun
Seite 251 und 252:
Das didaktische Metadatensystem DML
Seite 253 und 254:
Seite 255 und 256:
Seite 257 und 258:
Seite 259 und 260:
Seite 261 und 262:
Seite 263 und 264:
Seite 265 und 266:
Seite 267 und 268:
Seite 269 und 270:
The C 2 M project: a wrapper genera
Seite 271 und 272:
Seite 273 und 274:
Seite 275 und 276:
Seite 277 und 278:
Seite 279 und 280:
Seite 281 und 282:
Seite 283 und 284:
Seite 285 und 286:
Analyse der Qualität der multimedi
Seite 287 und 288:
Seite 289 und 290:
Seite 291 und 292:
Seite 293 und 294:
Seite 296:
Ziele des erstmals von der Initiati
Alle anzeigen

Sharing Knowledge: Scientific Communication - SSOAR

Erfolgreiche ePaper selbst erstellen

Template löschen?

Als Template speichern?