LexPredict Open Sources The 1910 Version of Black’s Law – The World’s Most Well Known Legal Dictionary is Now a Data Object
July 24, 2017
Chicago, IL USA
For Immediate Release
LEXPREDICT OPEN SOURCES
THE 1910 VERSION OF BLACK’S LAW
THE WORLD’SMOST WELLKNOWN LEGALDICTIONARY IS NOWADATA OBJECT
At their core, many academic and commercial
applications of natural language processing
and machine learning can benefit from a
controlled lexicon of expert-selected terms
(i.e., a dictionary). This is especially true of
highly technical language, such as legal text.
However, after a search of the existing
landscape, we were unable to find a high-
quality open source or freely-available legal
dictionary. Instead, the best existing versions,
when available, exist under some form of
restrictive licensing conditions.
Thus, in furtherance of both the legal
profession as well as a range of legal
technology providers and solutions, we are announcing another step in our broader open source
plan that we outlined earlier this month. Namely, we are making available on Github the 1910
Version of Black’s Law (i.e., Black’s Law 2nd Edition) as a structured data object. This early version
of arguably the premier legal dictionary is made available under the open source GPL license which
should allow both researchers and commercial providers to operate with limited restrictions.
Our open-source document analytics platform, ContraxSuite, is just one of the projects that will
benefit. We expect many other researchers and companies to both benefit and contribute.
The current contents are organized by
language, locale, and topic, and can be
accessed in (CSV, JSON, or Excel). We
anticipate that the contents and
organization of this legal dictionary
repository will evolve and expand as the
community participates and our products
evolve. However, our vision is to work towards multi-domain, multi-lingual, and cross-lingual
resources for legal and regulatory text. The current version is still being actively cleaned and
corrected and we anticipate that this refinement process alone many take several weeks.
III. Final Thought
We believe that the absence of such high-quality linguistic
inputs represents a significant bottleneck in the academic
and commercial space. We hope this offering will
help. Please check back for more as we continue down
the path to #OpenSourceLegal.