Skip to main content

References

This chapter provides a curated list of links to official documentation, key research papers, and other resources for further study.


1. Core Technologies


2. Key Research Papers & Blog Posts

Vision-Language-Action (VLA) Models

Vision Transformers (ViT)

  • An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

3. Community Resources