JST Vol. 21 (1) Jan. 2013 - Pertanika Journal - Universiti Putra ...

More documents

Recommendations

Info

Yogan Jaya Kumar, Naomie Salim, Ahmed Hamza Osman and Albaraa Abuobieda “Contradiction”, “Description”, and “Historical background”. Table 1 shows some examples of the sentence pairs that hold CST relationship. Full descriptions of the CST relations are given in Radev (2000). The work on CST can be put in line with Rhetorical Structure Theory (RST) (Taboada & Mann, 2006). The difference between these two theories is that RST aims to capture the rhetorical relation between span of adjacent text units, while CST goes across topically related documents to describe its rhetorical relation. In topically related documents, especially news articles, the information contents are closely connected even though the news story comes from various sources. By referring to the description of CST relations shown in Table 1, it can be seen that these types of relations are essential for the analysis of redundancy, complementarity and contradiction among different information sources. Thus, the ability to automatically identify the types of CST relationship will definitely be handy for tasks related to multi-document analysis. A number of research works have addressed the benefits of CST for summarization task (see for instance, Zhang et al., 2002; Jorge et al., 2010). Nonetheless, a major limitation of these works is that the CST relationships need to be manually annotated by human expert. Human annotation is not only expensive, but it also consumes a lot of time. There have been efforts put to learn the CST relationships in texts. Zhang et al. (2003) used boosting, i.e. a classification algorithm, to identify the presence of CST relationships between sentences. It is an adaptive algorithm which works by iteratively learning previous weak classifiers and adding them to a final strong classifier. The authors experimented with CST annotated article cluster that built the CSTBank corpus. Hence, lexical, syntactic and semantic features were used for data representation. Their classifier was able to identify sentence pairs with no relationship very well, but showed a rather poor performance in classifying the other types of CST relationship. Fig.1: CST general schema (Radev, 2000) 240 Pertanika J. Sci. & Technol. 21 (1): 283 - 298 (2013)
Using SVMs for Classification of Cross-Document Relationships TABLE 1: Some examples of the CST relationship between sentences (source: Zhang et al., 2002) Relationship Description Text span 1 (S1) Text span 2 (S2) Identity The same text appears in more than one location Equivalence Two text spans have the same information content Translation The same information content in different languages Subsumption S1 contains all information in S2, plus additional information not in S2 Tony Blair was elected for a second term today. Derek Bell is experiencing resurgen0ce in his career. Shouts of “Viva la revolucion!” echoed through the night. With 3 wins this year, Green Bay has the best record in the NFL. Contradiction Conflicting information There were 122 people on the downed plane. Historical Background S1 gives historical context to information in S2 This was the fourth time a member of the Royal Family has gotten divorced. Pertanika J. Sci. & Technol. 21 (1): 283 - 298 (2013) Tony Blair was elected for a second term today. Derek Bell is having a “comeback year.” The rebels could be heard shouting, “Long live the revolution”. Green Bay has 3 wins this year. 126 people were aboard the plane. The Duke of Windsor was divorced from the Duchess of Windsor yesterday. In another related work, Miyabe et al. (2008) investigated on the identification of CST relationship types by using cluster-wise classification with SVM classifier. They used a Japanese cross-document relation corpus annotated with CST relationships. The authors proposed using the detected “Equivalence” relations to address the task of “Transition” identification. In particular, similarity through the variable noun phrases was used for transition identification. They obtained F-measure of 75.50% for equivalence and 45.64% for transition. However, their approach is only limited to the two aforementioned relations. Closely related to our work is the approach by Zahri and Fukumoto (2011). The authors determined five types of CST relation between sentences using SVMs. The authors computed the lexical features between sentence pairs using the dataset from the CSTBank corpus. Then, they used the identified CST relations to determine the directionality between the sentences for PageRank (Erkan & Radev, 2004) computation. However there were no experimental results specifically shown on the performance of their CST relationship classification. This is essential because the performance of the classification has direct implication on the final results of the system. METHODS Relying on manually annotated text for CST relationship identification consumes a lot of time and resources. Thus, it is favourable to have a system which can automatically identify the existence of the CST relations between pairs of sentences. However, at this point of time, we are only considering four types of CST relations, namely “Identity”, “Overlap”, “Subsumption”, and “Description”. More details of these relations are given in Table 2. Meanwhile, further details with examples can be found in Zhang et al. (2002). 241
Page 2 and 3:
Journal of Science & Technology Jou
Page 4 and 5:
International Advisory Board Adarsh
Page 6 and 7:
ut bovine milk allergy by far is th
Page 8 and 9:
Selected Articles from UPM-Malaysia
Page 11 and 12:
ISSN: 0128-7680 © 2013 Universiti
Page 13 and 14:
Aspect of Fatigue Analysis of Compo
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Application of Anthropometric Dimen
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Different Media Formulation on Bioc
Page 43 and 44:
Page 45:
Page 48 and 49:
Heng, S. C., Ibrahim, Z. B., Suleim
Page 50 and 51:
Page 52 and 53:
Page 54 and 55:
CONCLUSION Heng, S. C., Ibrahim, Z.
Page 56 and 57:
Y. Yusuf, J. M. Juoi, Z. M. Rosli,
Page 58 and 59:
Page 60 and 61:
Page 62 and 63:
Surface Roughness Y. Yusuf, J. M. J
Page 64 and 65:
DISCUSSION Y. Yusuf, J. M. Juoi, Z.
Page 66 and 67:
Page 69 and 70:
Page 71 and 72:
Modelling of Motion Resistance Rati
Page 73 and 74:
The Empirical Method Modelling of M
Page 75 and 76:
Page 77 and 78:
Page 79 and 80:
Page 81 and 82:
Page 83 and 84:
Page 85 and 86:
Page 87 and 88:
Assessment of Heavy Metal Pollution
Page 89 and 90:
TABLE 1: Size of mussels in average
Page 91 and 92:
Page 93 and 94:
TABLE 5: Heavy metal concentrations
Page 95 and 96:
Page 97 and 98:
Page 99 and 100:
Page 101 and 102:
Page 103 and 104:
Page 105 and 106:
Page 107 and 108:
Page 109 and 110:
Physico-Chemical and Electrical Pro
Page 111 and 112:
Page 113 and 114:
Page 115 and 116:
TABLE 1: (continued) Physico-Chemic
Page 117 and 118:
Page 119:
Page 122 and 123:
Norhashila Hashim et al. effectiven
Page 124 and 125:
Statistical Analysis Norhashila Has
Page 126 and 127:
Norhashila Hashim et al. Post-hoc a
Page 128 and 129:
Norhashila Hashim et al. Cubero, S.
Page 130 and 131:
S. Ismail and K. A. Mohd Atan 4 4 p
Page 132 and 133:
S. Ismail and K. A. Mohd Atan The f
Page 134 and 135:
Lemma 1.1 The equation S. Ismail an
Page 136 and 137:
CONCLUSION S. Ismail and K. A. Mohd
Page 138 and 139:
INTRODUCTION Tajau, R. et al. A hig
Page 140 and 141:
Tajau, R. et al. the acrylate’s c
Page 142 and 143:
Tajau, R. et al. double bond) and C
Page 144 and 145:
Tajau, R. et al. Ulanski, P., & Ros
Page 146 and 147:
Aji, I. S., Zinudin, E. S., Khairul
Page 148 and 149:
Aji, I. S., Zinudin, E. S., Khairul
Page 150 and 151:
CONCLUSION Aji, I. S., Zinudin, E.
Page 152 and 153:
D. Bachtiar, S. M. Sapuan, E. S. Za
Page 154 and 155:
Page 156 and 157:
TABLE 1: (continue) 4 D. Bachtiar,
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
N. Maizatul, I. Norazowa, W. M. Z.
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
REFERENCES N. Maizatul, I. Norazowa
Page 171 and 172:
Page 173 and 174:
CBIR Using Colour and Shape Fused F
Page 175 and 176:
Page 177 and 178:
Page 179 and 180:
Page 181 and 182:
Toward Automatic Semantic Annotatin
Page 183 and 184:
Page 185 and 186:
Page 187 and 188:
Page 189 and 190:
Page 191 and 192:
Page 193 and 194:
Page 195 and 196:
Issues on Trust Management in Wirel
Page 197 and 198:
Issues on Trust Management in Wirel
Page 199 and 200: Trust Value MF-ID 1 0.8 0.6 0.4 0.2
Page 201 and 202: Issues on Trust Management in Wirel
Page 203 and 204: ISSN: 0128-7680 © 2013 Universiti
Page 205 and 206: A Negation Query Engine for Complex
Page 213 and 214: REFERENCES A Negation Query Engine
Page 217 and 218: The Problem Association Rule Mining
Page 219 and 220: Association Rule Mining threshold t
Page 221 and 222: Association Rule Mining must be sor
Page 223 and 224: Association Rule Mining On the othe
Page 225: Association Rule Mining Quinlan, J.
Page 228 and 229: Che Mustapha Yusuf, J., Mohd Su’u
Page 238 and 239: Az Azrinudin Alidin and Fabio Crest
Page 252 and 253: Yogan Jaya Kumar, Naomie Salim, Ahm
Page 254 and 255: Yogan Jaya Kumar, Naomie Salim, Ahm
Page 256 and 257: REFERENCES Yogan Jaya Kumar, Naomie
Page 258 and 259: Hasiah Mohamed@Omar, Azizah Jaafar
Page 273 and 274: The Role of Similarity Measurement
Page 281 and 282: TABLE 5: Winners MRS The Role of Si
Page 283: REFEREES FOR THE PERTANIKA JOURNAL
Page 286 and 287: Guidelines for Authors Publication
Page 288 and 289: table should be prepared on a separ
Page 290 and 291: The Journal’s review process What
Page 293 and 294: Usability of Educational Computer G
show all

JST Vol. 21 (1) Jan. 2013 - Pertanika Journal - Universiti Putra ...

Create successful ePaper yourself

Delete template?

Save as template?