Skip to content

Nithish Duvvuru

ViTSTR

Initializing search

nithish96/nitish96.github.io

Nithish Duvvuru

nithish96/nitish96.github.io

Home
Books
Books
- LLM From Scratch
  LLM From Scratch
- UDL
  UDL
- Transformers
  Transformers
Computer Vision
Computer Vision
- Concepts
  Concepts
  - Attention
  - CTC Decoding
- Text Detection
  Text Detection
  - CRAFT
  - DBNet
- Text Recognition
  Text Recognition
  - ViTSTR
- Transformers
  Transformers
Languages
Languages
- Python
  Python
MLOps
MLOps
- Docker
- Kubernetes
NLP
NLP
- Courses
  Courses
  - Standford
- LLM
  LLM
  - LLAMA
    LLAMA
    
    3. PreTraining
    
    4. Post Training
    
    6. Inference
    
    7. Vision
    
    8. Speech

ViTSTR

Vision Transformer for Scene Text Recognition

ViTSTR Architecture

Data efficient Image Transformer (DeiT)

Made with Material for MkDocs