1. Work Flow:
A: There are two options:
1. Executing the py script using cron periodically.
2. Executing the script manually as & when required.
2 more spreadsheets containing list of startups & List of VC firms will be derived from the main spreadsheet.
These spreadsheets will be filled either by scrapy or manually (How to use scrapy isn’t established yet, options may need to be
explored)
Schema/DB diagram is prepared.
Collection1- Angels- email primary key
Collection2- VC Firm- Firm name Primary key
Collection3- Startup- Name Primary key
Collection 2 & collection3 will be linked to Collection1.
As we are using mongo, it will give us flexibility to modify the schema if required without much problem.
ColumbiaAngels
Website
Google Form Spreadsheet Mongodb
A