Automatic Gathering of Newspaper Articles on Internet Abuse from ...
Automatic Gathering of Newspaper Articles on Internet Abuse from ...
Automatic Gathering of Newspaper Articles on Internet Abuse from ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Possibilities for Further Work<br />
● Improve the prototype (better cleaning, avoid duplicates, …)<br />
● Add more sources and languages<br />
● Apply to further domains <str<strong>on</strong>g>of</str<strong>on</strong>g> interest<br />
● Assist users in specifying their interest pr<str<strong>on</strong>g>of</str<strong>on</strong>g>iles<br />
● Further document analysis: Extracti<strong>on</strong> <str<strong>on</strong>g>of</str<strong>on</strong>g> informati<strong>on</strong> such as<br />
● names <str<strong>on</strong>g>of</str<strong>on</strong>g> people and organisati<strong>on</strong>s<br />
● geographical references, etc.<br />
● Add document similarity measure to database<br />
● Cross-language keyword assignment and document comparis<strong>on</strong><br />
● Visualisati<strong>on</strong> <str<strong>on</strong>g>of</str<strong>on</strong>g> the document collecti<strong>on</strong> or subsets <str<strong>on</strong>g>of</str<strong>on</strong>g> it,<br />
using document maps