Blog Archives

HTML Extraction Party

Tagged with: , , , , , , ,
Posted in

We’re having a party! Of sorts – an afternoon tutorial on how to extract texts from HTML (web) documents. We’ll be using a single case study, provided by Stephen Wittek, but the principles will be transferable to any HTML-based texts