This is the evolving architecture of ContentMine (contentmine.org) architecture. It includes an overview ( slide #2, ) showing getpapers, quickscrape, norma and ami.
The key container is the CTree and the architecture shows where components are added or transformed to this.
These slides are dated and may be out-of-date wrt code. Some diagrams are autogenerated from *.dot files.
Please use http://discuss.contentmine.org/c/software as the main source of up-to-date info. Feel free to ask questions, offer help, critique, etc.
All s/w is Open (BSD, Apache2)