Overall
- Major rewrite of NLP pipeline
- Many miscellaneous improvements and fixes
NLP
- Many performance and speed improvments improvements to NER
- Rewritten, ~2x faster dependency parser
- Rewritten hierarchical cross-document coreference
- Rewritten universal schema relation extraction model and epistemological database
- Faster tokenization with JFlex (50x faster, ~500k tokens/second...