diff --git a/todo b/todo new file mode 100644 index 0000000..2c7e8cc --- /dev/null +++ b/todo @@ -0,0 +1,6 @@ +[ ] Refactor website table to generic document table (maybe using URN instead of URL?) +[ ] Define tokens table FKed to document table +[ ] Refactor index.py to tokenize input into tokens table +[ ] Define N-Grams table +[ ] Add N-Gram generation to index.py +