Skip to content

Releases: KarelZe/thesis

Changes between 8 May - 14 May

14 May 06:01
5d4efd6
Compare
Choose a tag to compare

Took Friday and Saturday off to get uni-related work done.

What's Changed

Empirical Study ⚗️

  • Clean up of outdated files ♻️ by @KarelZe in #355
  • Implemented correct feature importance measures (WIP) @KarelZe in #322
    • includes SAGE values with zero-one loss and permutation in groups. Also, opened an issue iancovert/sage#18 to discuss the idea and implementation with the authors. (WIP)
    • includes visualizations of categorical embeddings with highly promising results.
    • includes new approach to calculate attention maps (Cheffer et al) (WIP)

Writing 📖

  • Add paragraphs on label smoothing, lr warmup, optimizer, and viz🤖 by @KarelZe in #350
  • Add results hyperparameter search gradient-boosting 😺 by @KarelZe in #352
  • Rewrite chapter Hyperparameter Search with updated results 🗺️ by @KarelZe in #354
  • Chapter on the selection of supervised methods👩‍🎓 (WIP) by @KarelZe in #353

Other Changes

Outlook 🔭

See https://github.com/users/KarelZe/projects/1/views/4.

Full Changelog: 23-19...23-20

Changes between 1 May - 7 May

08 May 05:00
8caafe7
Compare
Choose a tag to compare

What's Changed

Empirical Study ⚗️

  • Implement Pre-Training🛝 (WIP) by @KarelZe in #343
  • Implement and Study Feature Importances🪄 (WIP) by @KarelZe in #322
  • identified that random feature permutation won't work as expected

Writing 📖

  • Extend chapter on hyperparameter tuning, training of supervised / semi-supervised methods 📖 by @KarelZe in #342
    • includes new insights in the training configuration of models
    • includes new insights on the hyperparameters and their necessity
    • identified smaller errors that led to largely fluctuating errors

Other Changes

Outlook 🔭

See https://github.com/users/KarelZe/projects/1/views/4.

Full Changelog: 23-18...23-19

Changes between 24 April - 30 April

01 May 20:44
9f40b2d
Compare
Choose a tag to compare

What's Changed

Empirical Study ⚗️

Writing 📖

  • Rewrite chapter on Pre-Training and Rewrite selection of semi-supervised methods🤖 by @KarelZe in #316
  • Add improved visualizations 🖼️ by @KarelZe in #318
  • Edit in review comments👩‍🎓 by @KarelZe in #319
  • Various writing improvements📖 by @KarelZe in #324
  • Discussion on computational demand + smaller fixes🏭 by @KarelZe in #338
  • Add in missing page numbers🔢 by @KarelZe in #340

Other Changes

Outlook 🔭
See https://github.com/users/KarelZe/projects/1/views/4.

Full Changelog: 23-17...23-18

Changes between 17 April - 23 April

23 Apr 19:26
abe93fc
Compare
Choose a tag to compare

What's Changed

Empirical Study ⚗️

  • Started with EDA on unlabelled data. Still have to make sense of the results.
  • Continued working on the invalid gradient problem. Haven't yet figured out, how to reproduce it reliably.

Writing 📖

  1. Reworked chapter on token embeddings
  2. Reworked chapter on FT-Transformer
  3. Reworked chapter on decision trees
  4. Shortened several chapters
  5. Added chapter on Attention Mechanism
  6. Added chapter on Gradient Boosting Procedure
  7. Added discussion on Selection of Semi-Supervised Approaches
  8. Added chapter on Pre-Training of Transformers
  9. Various other improvements: notation, viz, typos, 🇺🇸 / 🇬🇧 dialect, etc.

Other Changes

Outlook 🔭

See https://github.com/users/KarelZe/projects/1/views/4.

Full Changelog: 23-16...23-17

Changes between 10 April - 16 April

16 Apr 19:47
b9ee21c
Compare
Choose a tag to compare

What's Changed

Empirical Study ⚗️

  • Implement proper training setup for transformers🤖 by @KarelZe in #292
  • Remove TabTransformer🤖 by @KarelZe in #305

Writing 📖

Other Changes

Outlook

Full Changelog: 23-15...23-16

Changes between 3 April - 9 April

09 Apr 12:17
1cac0b4
Compare
Choose a tag to compare

What's Changed

Empirical Study ⚗️

  • Fix totals in tables📊 by @KarelZe in #276
  • Add retraining / semi-supervised mode to gradient boosting😺 by @KarelZe in #278
  • Create summary statistics classical trade classification rules📊 by @KarelZe in #279
  • Code review of data preparation notebooks😈 by @KarelZe in #280
  • Run studies for SelfTrainingClassifier🅾️ by @KarelZe in #249
  • Fix statistical tests in effective spread calculation🌄 by @KarelZe in #281
  • Add transfer learning results🔄️ by @KarelZe in #285
  • Select benchmark on validation set🔧 by @KarelZe in #291
  • Delete references to Docker⚓ by @KarelZe in #294

Writing 📖

  • Chapter on evaluation metric🪙 by @KarelZe in #216
  • Delete outdated files and add questions for meeting❌ by @KarelZe in #283
  • Chapter on Semi-Supervised Learning🦯 by @KarelZe in #284
  • Various improvements: evaluation metric, hyperparameter tuning, and application study🎩 by @KarelZe in #286

Other Changes

Full Changelog: 23-14...23-15

Changes between 27 March and 2 April

02 Apr 17:40
2477438
Compare
Choose a tag to compare

What's Changed

Empirical Study ⚗️

  • Allow unclassified in ClassicalClassifier🏦 by @KarelZe in #219
  • Implement Self-Training for CatBoost⭕ by @KarelZe in #215
  • Extend result generation🏁 by @KarelZe in #228
  • Improve Result Tables🖨️ by @KarelZe in #234
  • Fix midpoint/spread in ClassicalClassifier🐞 by @KarelZe in #235
  • Improve feature engineering notebook🤏 by @KarelZe in #236
  • Remove from feature set mode none the zero imputation🐞 by @KarelZe in #239
  • Generate ISE / CBOE supervised results of Gradient Boosting🐈 by @KarelZe in #243
  • Improvement of resumable studies and SelfTrainingClassifier🅾️ by @KarelZe in #246 and in #224
  • Run studies for SelfTrainingClassifier🅾️ (WIP) by @KarelZe in #249
  • Add visualizations of hyperparameter search space and fix minor typos🌔 by @KarelZe in #248

Writing 📖

  • Chapter on Feature Engineering🪄 by @KarelZe in #212
  • Update chapter on dataset/results 📑 by @KarelZe in #237
  • Run studies for SelfTrainingClassifier🅾️ (WIP) by @KarelZe in #249
  • Add chapter on random feature permutation🔀 (WIP) by @KarelZe in #217

Other Changes

Outlook 🔭

See https://github.com/users/KarelZe/projects/1/views/4.

Full Changelog: 23-13...23-14

Changes between 20 March and 26 March

26 Mar 20:13
f4de02a
Compare
Choose a tag to compare

Picked work again on Thursday.

What's Changed

Empirical Study ⚗️

Writing 📖

Other Changes

Outlook 🛩️

https://github.com/users/KarelZe/projects/1/views/4?filterQuery=status%3A%22In+Progress%22%2C%22todo%22+

Full Changelog: 23-12...23-13

Changes between 13 March and 19 March

19 Mar 18:56
22640ef
Compare
Choose a tag to compare

Didn't work 100 % on thesis. Spent most time on exam prep.

BwHPC Cluster is down until Friday. Thus, I will spend my time after the exam on writing ✏️ .

What's Changed

Empirical Study ⚗️

  • Generate results for classical classifier + effective spread👸 by @KarelZe in #200
  • Automatic generation of results tables🏇 by @KarelZe in #201
  • Automatic result / viz generation for gradient boosting🙀 by @KarelZe in #203
  • Add ROC / Recall curves to notebooks🦉 by @KarelZe in #204
  • Extended pipeline for result generation🛕 by @KarelZe in #202
  • Gathered some ideas on how to retrieve the feature importances / need to correct probabilities.

Outlook🎒

  • exam prep (Mo - Wed)
  • write the chapter on data preprocessing incl. viz
  • shorten / rewrite the chapter on feature engineering
  • prewrite the sub-chapter on random feature permutation. Make sure it is the best possible choice.
  • create prototype for grouped random feature permutation
  • review and test #137

Full Changelog: 23-11...23-12

Changes between 6 March and 12 March

13 Mar 08:01
91d7063
Compare
Choose a tag to compare

Didn't work 100 % on thesis. Spent some time on exam prep.

What's Changed

Writing 📖

Other Changes

Outlook🎒

  • finish remaining tasks from last week
  • exam prep

Full Changelog: 23-10...23-11