- https://stackoverflow.com/questions/25665114/understanding-interactive-brokers-tick-events
- https://epchan.blogspot.com/2015/04/beware-of-low-frequency-data.html
Download data using the modules in src.downloaders etc.
ticker_name_mapping requires an internet connection!!
-
python -m src.preprocessing.news_preprocessing --ticker_name_mapping
-
python -m src.preprocessing.news_preprocessing --process_body
-
python -m src.preprocessing.data_merger initial_merge
-
python -m src.preprocessing.data_merger merge_overnight_news
-
python -m src.preprocessing.data_merger merge_daily_indicators
IF anything stripper should be applied AFTER initial merge... otherwise we would have to merge again...
-
python -m src.preprocessing.news_preprocessing --stripper
-
python -m src.preprocessing.adding_jaccard
-
python -m src.preprocessing.make_learning_dataset
#! MLM Trainer should have access only to parsed_body
(mayber merged_msg
in future?) of the training set.
- python -m src.preprocessing.tokenizer
- python -m src.model.mlm_train
See python -m src.model.training -h
for more info.
python -m src.evaluation.predictions
Keep alive by pasting the following in the the developer console:
function ClickConnect(){
console.log("Working");
document.querySelector("colab-connect-button").click()
}
setInterval(ClickConnect,60000)
Add to the start of notebook for correct configuration
google_colab = True
if google_colab:
from google.colab import drive
import os
drive.mount('/content/drive')
cwd="/content/drive/MyDrive/NewsTrading/trading_bot"
%cd /content/drive/MyDrive/NewsTrading/trading_bot
%pip install -r requirements_clean.txt
os.environ["TRADING_BOT_CONFIG_PATH"] = "src/config_gcs.yaml"