Exploring Natural Language Processing in Ruby - Tokyo Rubyist Meetup (April 9th, 2015)
This presentation will cover 3 natural language processing gems I’ve released over the past year:
* Pragmatic Segmenter (a sentence boundary detection gem)
* Chat Correct (a gem for English teachers/students that provides error analysis when an incorrect sentence is diffed with a correct sentence)
* Word Count Analyzer (a gem that analyzes a string for potential “word count gray areas” which cause tools to report different word counts)
The talk will cover various aspects of building these gems including working from first principles, testing edge cases, and getting comfortable with regular expressions. I’ll also introduce a project that is currently in-progress - a new algorithm for parallel text alignment and some of the related challenges with building it.