NLP Interview Refresher Repository

I have created this repository for revision purposes, especially for an upcoming NLP interview. As part of a skill test for a company, I will be performing Bert Sentiment Classification on the restaurant review dataset

There are numerous tutorials available on Kaggle for this challenge dating back to 2020. Some tutorials include explanations about Transformer-related concept, but a significant number do not.

The primary goal of this repository is to fill in knowledge gaps and provide learners with a comprehensive perspective on BERT, Transformers, and the Hugging Face library, which are frequently discussed concepts in NLP interviews.

The study notes reside in the vault directory, meant to be opened with obsidian. This setup minimizes friction when introducing new concepts. You can seamlessly explore side notes, read, close them, and return to the main note. During subsequent reviews, familiarity with the side notes allows for efficient learning—skipping already grasped content. This approach offers multiple benefits without disrupting traditional learning methods. The content of vault is high-level, which I found suitable for interview revision. However, where possible, I leave pointers in case readers want to follow the rabbit hole.

The ML codes are in the code directory. Feel free to play around with them, explore the repository and leverage it for your own NLP interview preparation!

TODO

codes: demo tokenizers and CLS based on huggingface/transformers#7540 and https://discuss.huggingface.co/t/how-to-get-cls-embeddings-from-bertfortokenclassification-model/9276/2

in codes/notebooks/BertModel outputs and CLS token.ipynb

vault: Training details: Optimizer and Scheduler
experiment agenda and implementaion: MlOps

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
codes		codes
vault		vault
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Interview Refresher Repository

TODO

About

Releases

Packages

Languages

xtfocus/aimesoft_interview

Folders and files

Latest commit

History

Repository files navigation

NLP Interview Refresher Repository

TODO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages