Corpus Linguistics with Python#

This section includes topics on corpus linguistics with Python. In particular, it will demonstrate how to perform several important corpus analyses using the Python language.

Topics include:

  • Web crawling

  • Reading structured corpora data

  • Concordance analysis

  • Frequency lists

  • Collocations

  • N-gram analysis

  • Vectorization and bag-of-words

  • Segmentation, Tokenization, and Parsing

  • Pattern/Construction Extraction