24.12.2013 Views

Slides - University of Washington

Slides - University of Washington

Slides - University of Washington

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>University</strong> <strong>of</strong> <strong>Washington</strong><br />

Ling472 Introduction to Computational Linguistics<br />

Lecture 1:<br />

Introduction<br />

Primitive tokenization<br />

$ cat moby_dick.html | # echo the text<br />

tr [:upper:] [:lower:] | # convert to lower case<br />

tr ' ' '\n' |<br />

# put each word on a line<br />

grep ‐v ^$ |<br />

# get rid <strong>of</strong> blank lines<br />

grep ‐v '

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!