import nltk from nltk. The rules are pretty simple. Start studying Using Punctuation #1. first, last - the range of elements to process value - the value of elements to remove policy - the execution policy to use. If I use nltk. Tokenization is breaking the sentence into words and punctuation, and it is the first step to processing text.