From 8605ee6b2c8ed8b6da9f7a4bd3d1427f48673878 Mon Sep 17 00:00:00 2001 From: rmgr Date: Sat, 2 Mar 2024 19:58:10 +1030 Subject: [PATCH] Add todo file --- todo | 6 ++++++ 1 file changed, 6 insertions(+) create mode 100644 todo diff --git a/todo b/todo new file mode 100644 index 0000000..2c7e8cc --- /dev/null +++ b/todo @@ -0,0 +1,6 @@ +[ ] Refactor website table to generic document table (maybe using URN instead of URL?) +[ ] Define tokens table FKed to document table +[ ] Refactor index.py to tokenize input into tokens table +[ ] Define N-Grams table +[ ] Add N-Gram generation to index.py +