Tonight we had a full crew in our swanky VCET meeting room, first tackling our PDF parser and trying to get it to extract the text of a specified table of contents entry. We decided that there’s enough variation in companies’ 10-K formatting to warrant re-evaluating our approach to parsing 10-Ks. As mentioned in our winter 2015 roundup, we’ll regroup and approach this soon in a more modern, API-esque way.
New member Doug contributed a lot of interesting ideas and energy. We built an SQLite database from a spreadsheet he’s been filling in, setting the stage for next week, when we’ll start building an MVP of his SMS service for child literacy, using Twilio’s API. Super-exciting!
In the coming weeks we’ll be making our way through book two of Real Python, during which time we’ll start building little web apps, including but not limited to a 10-K interface and a site for Doug’s literacy TaaS (text as a service?).