Database support

03 Aug 2020

Since a lot of the data retrieved from the API comes in JSON format, and NLTK outputs are also in JSON, it stands to reason to use a JSON Documents based database to store the partial results and help to process and analyze the data, something on the line of MongoDB or CouchDB; but these two feels like overkill, so TinyDB looks more appropriate for the task at hand, installing it is as simple as:

pip install tinydb

At this point, it was necessary to do significant modifications to retrieve the whole text of the selected version of the bible, store the data inside TinyDB, and also retrieve the whole text of a Chapter. All the associated code and the resulting databases are available at the following GitHub repo: https://github.com/murillocjr/theophilusnlp

Previous	Home	Next
Tokenizing with NLTK	θεόφιλος Journey	Milestone 1

jorge@home:~$

Archive

About

Source Code

Database support