Chan Woo Kim – Medium

Chan Woo Kim

Pinned

Published in
TDS Archive

Generalized Attention Mechanism: BigBird’s Theoretical Foundation and General Transformers Models

As a little background, BigBird is a model released from Google Research relatively recently that can handle much lengthier sequences of…

Dec 19, 2021

Generalized Attention Mechanism: BigBird’s Theoretical Foundation and General Transformers Models

Dec 19, 2021

Published in
TDS Archive

[New Hugging Face Feature] Constrained Beam Search with 🤗 Transformers

A new Hugging Face feature allows you customize and guide your language model outputs (like forcing a certain sequence within the output).

Mar 22, 2022

[New Hugging Face Feature] Constrained Beam Search with 🤗 Transformers

Mar 22, 2022

Published in
Explained Relentlessly: Deep Learning & Natural Language Processing

Tutorial: Storing Large A.I. Models with ‘gdrive’ (don’t use Git LFS)

Git-LFS is a short-term solution that shouldn’t be used for your AI projects. Learn how to use gdrive to save & export your models.

Jul 3, 2021

Tutorial: Storing Large A.I. Models with ‘gdrive’ (don’t use Git LFS)

Jul 3, 2021

Published in
Explained Relentlessly: Deep Learning & Natural Language Processing

ANCE Contrastive Learning for Dense Retrieval: Sampling Negative Examples & The Variation…

Why negative examples are so important in training Neu-IR models and how ANCE obtains the mathematically optimized distribution.

Jun 25, 2021

ANCE Contrastive Learning for Dense Retrieval: Sampling Negative Examples & The Variation…

Jun 25, 2021

How to Set up code-server daemon on Ubuntu: Root Access, Docker, systemd, NGINX Reverse-Proxy, and…

systemctl code-server + nginx publicly expose. Or use Docker to set it up. How to use Nginx as reverse-proxy to IP white-list.

Jun 23, 2021

How to Set up code-server daemon on Ubuntu: Root Access, Docker, systemd, NGINX Reverse-Proxy, and…

Jun 23, 2021

Published in
Explained Relentlessly: Deep Learning & Natural Language Processing

Retrieval-Augmented Generation (RAG): Control Your Model’s Knowledge and… Hallucinations!

Large LMs are really good today. But they’re known to hallucinate. Maybe we should let it cheat a little with Wikipedia.

Jun 20, 2021

Retrieval-Augmented Generation (RAG): Control Your Model’s Knowledge and… Hallucinations!

Jun 20, 2021

Published in
Explained Relentlessly: Deep Learning & Natural Language Processing

[Part I] Predicting on Text Pairs with Transformers: Cross-Encoding with BERT

[Code Implementation] Make SOTA sentence pair regression using BERT’s Cross-Encoding with SequenceClassification.

Jun 18, 2021

[Part I] Predicting on Text Pairs with Transformers: Cross-Encoding with BERT

Jun 18, 2021

Published in
Explained Relentlessly: Deep Learning & Natural Language Processing

Autoencoders & Power of Mathematical Optimizations

I don’t know if many people agree with this sentiment but I was pretty shocked when I learned how A.I. worked the first time. Growing up, I

Nov 23, 2018

Autoencoders & Power of Mathematical Optimizations

Nov 23, 2018

Published in
Explained Relentlessly: Deep Learning & Natural Language Processing

Overfitting (What They Are & Train, Validation, Test & Regularization)

I’ve taken a bit of a hiatus because I’ve been busy with school and other things and I’ve been spending time learning more ML/DL stuff…

Apr 15, 2018

Overfitting (What They Are & Train, Validation, Test & Regularization)

Apr 15, 2018

Published in
Explained Relentlessly: Deep Learning & Natural Language Processing

Document Classification Part 4: Variations of This Approach (Malware Detection Using Document…

In this article I will explain an application of the concepts so far that goes beyond just categorizing text documents.

Mar 22, 2018

Document Classification Part 4: Variations of This Approach (Malware Detection Using Document…

Mar 22, 2018

Chan Woo Kim

Chan Woo Kim

Deep Learning Researcher at Williams College

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech