Skip to content
My Knowledge Base
ViTSTR
Initializing search
nithish96/nitish96.github.io
My Knowledge Base
nithish96/nitish96.github.io
Home
Books
Books
📚 Overview
Building Applications with AI agents
Building Applications with AI agents
Introduction
Chapter 4: Tool Use
Chapter 5: Orchestration
Chapter 6: Knowledge and Memory
Chapter 7. Learning in Agentic Systems
Chapter 8. From One Agent to Many
LLM From Scratch
LLM From Scratch
Introduction
2. Working with Text Data
3. Coding Attention Mechanisms
4. Implementing a GPT Model from Scratch
5. Pretraining on Unlabeled Data
6. Fine-tuning for Classification
7. Fine-Tuning to Follow Instructions
Understanding Deep Learning
Understanding Deep Learning
00. Introduction
05. Loss Functions
12. Transformers
Computer Vision
Computer Vision
Concepts
Concepts
Attention
CTC Decoding
Text Detection
Text Detection
CRAFT
DBNet
Text Recognition
Text Recognition
ViTSTR
Transformers
Transformers
Data efficient Image Transformer (DeiT)
Detection Transformer
Swin Transformer
Vision Transformer
Languages
Languages
Python
Python
Multiprocessing
Pandas
Threading
MLOps
MLOps
Docker
Kubernetes
NLP
NLP
Courses
Courses
Standford
LLM
LLM
LLAMA
LLAMA
3. PreTraining
4. Post Training
6. Inference
7. Vision
8. Speech
ViTSTR
¶
Vision Transformer for Scene Text Recognition
Back to top