Sharing Knowledge: Scientific Communication - SSOAR

Weitere Magazine

Empfehlungen

Info

278 Paul van der Vet, Eelco Mossel ct-first-line -> “File generated by C2M” end-of-line which puts the string at the right-hand side on the first line of the target file and ends it with an end-of-line character. Next, at the second line called ct-numbers-line, the numbers of atom lines and bond lines have to be filled in. In the native representation, the numbers of atoms and bonds are not present explicitly. Therefore we have to establish them in another way. The C2M language has the construct: fl-text(Supersymbol, Symbol, NrOccs) that is a canned version of the grammar rule Supersymbol -> Symbol+ except that the number of occurrences of Symbol is fixed to be NrOccs. This can be either an explicit number or a grammar symbol that writes that number at the appropriate place. In the latter case, there are two occurrences of the NrOccs symbol that should be at the right-hand side of the same grammar rule. Using this construct, we may choose to write the entire CT file using the rule: ct-file -> “File generated by C2M” end-of-line space space nr-atom-lines space space nr-bond-lines end-of-line fl-text(ct-atom-lines, ct-atom-line, nr-atom-lines) fl-text(ct-bond-lines, ct-bond-line, nr-bond-lines) which will write a correct CT file provided there are correct rules for ct-atomline and ct-bond-line elsewhere in the grammar for reading. 4 Implementation The current version of C2M is implemented in Prolog. 16 We believe this has considerably eased the development of the system, but we also want to stress that the same functionality can be realised in any other programming language. Indeed, sharing and reuse of C2M specifications should be independent of the programming language or languages used. The implementation has been inspired by work in natural-language processing. For example, the two read steps are syntactic and semantic analysis, precisely as in many natural-language understanding systems. For syntactic analysis, we use the parser generator built into Prolog. Semantic analysis is based on knowledge-based system technology. The converter core of C2M incorporates a 16 L. Sterling and E. Shapiro, The art of Prolog, Cambridge MA: MIT Press, 1994 (2nd edition); I. Bratko, Prolog programming for artificial intelligence, Harlow: Addison-Wesley, 2001 (3rd edition).
The C 2 M project: a wrapper generator for chemistry and biology 279 special-purpose inference engine that uses the semantic parts of a file format specification as knowledge base. A few points in the implementation merit attention. Using grammars in specification files to generate parsers is a very general way to handle the syntax of formats. Because tokens can be delimited by any character, a separate tokenisation step is not foreseen. Because the parser analyses the source file on a character-by-character basis, C2M can handle any type of file but conversion will proceed slower than in the presence of a tokenisation step. Even control codes can be defined as characters and handled appropriately. Molecular structure files for not too large molecules (say, with 1000 atoms or less) are processed within a second. Preliminary experiments have shown that the scaling behaviour of the system approaches linear behaviour. The C2M compiler in fact converts a specification file into a Prolog file. In other words, the compile task can in principle also be done by the C2M converter, given the appropriate specification files. One reason not to do so is efficiency. In specification files we do know what the token delimiters are, and therefore we can insert a tokenisation step before the parsing step. Also, the C2M compiler optimises to some extent. Code optimisation is beyond the capabilities of a format converter such as C2M. Finally, the Prolog implementation we used is Quintus Prolog 17 because it is robust and very fast. Although most constructs are standard Prolog, 18 we sometimes made use of Quintus built-ins because they are more efficient. Quintus Prolog is proprietary software. We are currently contemplating to implement future versions in Java. 5 Conclusion We have given a very cursory overview of the C2M language. The language is designed with ease of use as primary concern, where “ease of use” has been operationalised mainly as “familiar” in a number of ways. User surveys will have to bear out whether we succeeded in this respect. A wide scope has been another design criterion. It has been operationalised by analysing foreign files at the level of individual characters. C2M, however, does not aim to make existing software superfluous. For example, the provision of extensive options for calculations has not been considered because there is well-known and widely available software that can do this. A desktop system built around C2M will perform calculations be having C2M convert some intermediate result into a query to a calculation package, and having C2M convert the package’s output into a suitable form for further processing. 17 http://www.sics.se/isl/quintus/ 18 P. Deransart, A.A. Ed-Dbali, and L. Cervoni, Prolog: the standard, Berlin: Springer, 1996.
Seite 1:
Sharing Knowledge: Scientific Commu
Seite 4 und 5:
Tagungsberichte Herausgegeben vom I
Seite 6 und 7:
Die Deutsche Bibliothek - CIP-Einhe
Seite 8 und 9:
6 Inhalt Infrastrukturen für innov
Seite 11:
Vorwort Zur neunten Frühjahrstagun
Seite 14 und 15:
12 Heike Andermann ted, in 1994-199
Seite 16 und 17:
14 Heike Andermann schaftlerInnen e
Seite 18 und 19:
16 Heike Andermann tung beibehalten
Seite 20 und 21:
18 Heike Andermann NBII). 26 Für d
Seite 23 und 24:
Qualitätssicherung und Nutzung von
Seite 25 und 26:
Seite 27 und 28:
Seite 29 und 30:
Seite 31 und 32:
Seite 33 und 34:
Seite 35 und 36:
Seite 37 und 38:
Seite 39 und 40:
vascoda Das gemeinsame Portal von I
Seite 41 und 42:
vascoda - Das gemeinsame Portal von
Seite 43 und 44:
Seite 45 und 46:
Seite 47:
Seite 50 und 51:
48 Klaus Hahn Abstract The advancem
Seite 52 und 53:
50 Klaus Hahn her“ [6]). So wäre
Seite 54 und 55:
52 Klaus Hahn men auch als Begriff
Seite 56 und 57:
54 Klaus Hahn Arbeiten notwendig (>
Seite 58 und 59:
56 Klaus Hahn Fazit Zur effektiven
Seite 61 und 62:
Unterstützung kooperativer Verfahr
Seite 63 und 64:
Seite 65 und 66:
Seite 67 und 68:
Seite 69 und 70:
Seite 71 und 72:
Seite 73 und 74:
PhysNet und seine Spiegel - Das Pro
Seite 75 und 76:
Seite 77 und 78:
Seite 79 und 80:
Seite 81 und 82:
Seite 83 und 84:
Seite 85 und 86:
Online-Hochschulschriften für die
Seite 87 und 88:
Seite 89 und 90:
Seite 91 und 92:
Seite 93 und 94:
Seite 95:
Seite 98 und 99:
96 Rudi Schmiede, Stephan Körnig s
Seite 100 und 101:
98 Rudi Schmiede, Stephan Körnig s
Seite 102 und 103:
100 Rudi Schmiede, Stephan Körnig
Seite 104 und 105:
Seite 106 und 107:
Seite 108 und 109:
Seite 110 und 111:
108 Jutta von Maurice strument lief
Seite 112 und 113:
110 Jutta von Maurice bare Kopien v
Seite 114 und 115:
112 Jutta von Maurice einschließli
Seite 116 und 117:
114 Jutta von Maurice hebungen. Dem
Seite 118 und 119:
116 Jutta von Maurice http://www.df
Seite 121 und 122:
Maßnahmen zur Förderung der Infor
Seite 123 und 124:
Seite 125 und 126:
Seite 127 und 128:
Seite 129 und 130:
Seite 131 und 132:
Seite 133 und 134:
LIMES - A System for a Distributed
Seite 135 und 136:
Seite 137 und 138:
Seite 139:
Seite 142 und 143:
140 Frank Oldenettel, Michael Malac
Seite 144 und 145:
Seite 146 und 147:
Seite 148 und 149:
Seite 150 und 151:
Seite 152 und 153:
Seite 154 und 155:
Seite 156 und 157:
Seite 158 und 159:
Seite 160 und 161:
158 Judith Plümer prints werden du
Seite 162 und 163:
160 Judith Plümer modification, th
Seite 164 und 165:
162 Judith Plümer The summarizers
Seite 166 und 167:
164 Judith Plümer based on a few p
Seite 168 und 169:
166 Judith Plümer Migration from H
Seite 170 und 171:
168 Judith Plümer Contact Judith P
Seite 172 und 173:
170 Dennis Reil für traditionelle
Seite 174 und 175:
172 Dennis Reil gartner (2002) ist
Seite 176 und 177:
174 Dennis Reil können. Wichtig f
Seite 178 und 179:
176 Dennis Reil nahezu identisch, s
Seite 180 und 181:
178 Dennis Reil Insgesamt kann also
Seite 183 und 184:
Reflections on the Value Chain of S
Seite 185 und 186:
Seite 187 und 188:
Seite 189 und 190:
Seite 191 und 192:
Seite 193:
Seite 196 und 197:
194 Natascha Schumann, Wolfgang Mei
Seite 198 und 199:
Seite 200 und 201:
Seite 202 und 203:
Seite 204 und 205:
Seite 207 und 208:
ViFaPhys - Virtuelle Fachbibliothek
Seite 209 und 210:
Seite 211 und 212:
Seite 213 und 214:
Seite 215 und 216:
Weiterentwicklung von digitalen Bib
Seite 217 und 218:
Seite 219 und 220:
Seite 221 und 222:
Seite 223 und 224:
Seite 225 und 226:
Seite 227:
Seite 230 und 231: 228 Markus Kalb, Günther Specht ve
Seite 232 und 233: 230 Markus Kalb, Günther Specht Di
Seite 234 und 235: 232 Markus Kalb, Günther Specht Da
Seite 236 und 237: 234 Markus Kalb, Günther Specht 4.
Seite 238 und 239: 236 Markus Kalb, Günther Specht pe
Seite 240 und 241: 238 Markus Kalb, Günther Specht [O
Seite 242 und 243: 240 Maximilian Stempfhuber � Info
Seite 244 und 245: 242 Maximilian Stempfhuber Weiterle
Seite 246 und 247: 244 Maximilian Stempfhuber Zum zwei
Seite 248 und 249: 246 Maximilian Stempfhuber Abbildun
Seite 251 und 252: Das didaktische Metadatensystem DML
Seite 269 und 270: The C 2 M project: a wrapper genera
Seite 279: The C 2 M project: a wrapper genera
Seite 285 und 286: Analyse der Qualität der multimedi
Seite 296: Ziele des erstmals von der Initiati
Alle anzeigen

Sharing Knowledge: Scientific Communication - SSOAR

Erfolgreiche ePaper selbst erstellen

Template löschen?

Als Template speichern?