AI & ML interests
Google ❤️ Open Source AI
Recent Activity
Papers
FIT: A Large-Scale Dataset for Fit-Aware Virtual Try-On
CoGate-LSTM: Prototype-Guided Feature-Space Gating for Mitigating Gradient Dilution in Imbalanced Toxic Comment Classification
-
MedGemma - Radiology Explainer Demo
🩺241Radiology Image & Report Explainer Demo. Built with MedGemma
-
Appoint Ready - MedGemma Demo
📋200Simulated Pre-visit Intake Demo built using MedGemma
-
Radiology Learning Companion
🏃27A demo showcasing a medical learning experience of CXR image
-
EHR Navigator Agent With MedGemma
🩺53Search and navigate electronic health records
-
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper • 2402.13217 • Published • 40 -
google/videoprism-base-f16r288
Video Classification • Updated • 12.5k • 101 -
google/videoprism-large-f8r288
Video Classification • Updated • 372 • 20 -
google/videoprism-lvt-base-f16r288
Video Classification • Updated • 14.5k • 12
-
Path Foundation Demo
🔬45Browse pathology image library
-
CXR Foundation Demo
🩻22Demo usage of the CXR Foundation model embeddings
-
MedGemma - Radiology Explainer Demo
🩺241Radiology Image & Report Explainer Demo. Built with MedGemma
-
Appoint Ready - MedGemma Demo
📋200Simulated Pre-visit Intake Demo built using MedGemma
-
google/gemma-3-4b-it-qat-q4_0-gguf
Image-Text-to-Text • 4B • Updated • 11.6k • 254 -
google/gemma-3-4b-pt-qat-q4_0-gguf
Image-Text-to-Text • 4B • Updated • 465 • 26 -
google/gemma-3-1b-it-qat-q4_0-gguf
Text Generation • 1.0B • Updated • 1.12k • 124 -
google/gemma-3-1b-pt-qat-q4_0-gguf
Text Generation • 1.0B • Updated • 83 • 14
-
google-t5/t5-base
Translation • Updated • 1.4M • • 773 -
google-t5/t5-small
Translation • 60.5M • Updated • 2.12M • • 541 -
google-t5/t5-large
Translation • 0.7B • Updated • 547k • • 256 -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 18
-
google/siglip-so400m-patch14-384
Zero-Shot Image Classification • 0.9B • Updated • 2.15M • 670 -
google/siglip-so400m-patch14-224
Zero-Shot Image Classification • 0.9B • Updated • 25k • 58 -
google/siglip-so400m-patch16-256-i18n
Zero-Shot Image Classification • 1B • Updated • 425 • 31 -
google/siglip-base-patch16-256-multilingual
Zero-Shot Image Classification • 0.4B • Updated • 21.7k • 53
-
Compare Siglip1 Siglip2
🚀54Compare SigLIP1 and SigLIP2 on zero shot classification
-
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
Paper • 2502.14786 • Published • 164 -
google/siglip2-base-patch16-224
Zero-Shot Image Classification • Updated • 607k • 93 -
google/siglip2-base-patch16-256
Zero-Shot Image Classification • 0.4B • Updated • 93.6k • 8
-
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper • 2412.03555 • Published • 135 -
google/paligemma2-3b-pt-224
Image-Text-to-Text • Updated • 19.8k • 168 -
google/paligemma2-3b-pt-448
Image-Text-to-Text • 3B • Updated • 29.6k • 47 -
google/paligemma2-3b-pt-896
Image-Text-to-Text • 3B • Updated • 952 • 26
-
google/paligemma-3b-ft-ai2d-224-jax
Image-Text-to-Text • Updated • 2 • 5 -
google/paligemma-3b-ft-ai2d-448-jax
Image-Text-to-Text • Updated • 3 • 1 -
google/paligemma-3b-ft-aokvqa-da-224-jax
Image-Text-to-Text • Updated • 2 • 1 -
google/paligemma-3b-ft-aokvqa-da-448-jax
Image-Text-to-Text • Updated • 2 • 1
-
google/timesfm-1.0-200m
Time Series Forecasting • Updated • 1.74k • 807 -
google/timesfm-1.0-200m-pytorch
Time Series Forecasting • Updated • 11.8k • 31 -
google/timesfm-2.0-500m-jax
Time Series Forecasting • Updated • 45 • 20 -
google/timesfm-2.0-500m-pytorch
Time Series Forecasting • 0.5B • Updated • 32.8k • 251
-
MedGemma - Radiology Explainer Demo
🩺241Radiology Image & Report Explainer Demo. Built with MedGemma
-
Appoint Ready - MedGemma Demo
📋200Simulated Pre-visit Intake Demo built using MedGemma
-
Radiology Learning Companion
🏃27A demo showcasing a medical learning experience of CXR image
-
EHR Navigator Agent With MedGemma
🩺53Search and navigate electronic health records
-
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper • 2402.13217 • Published • 40 -
google/videoprism-base-f16r288
Video Classification • Updated • 12.5k • 101 -
google/videoprism-large-f8r288
Video Classification • Updated • 372 • 20 -
google/videoprism-lvt-base-f16r288
Video Classification • Updated • 14.5k • 12
-
Path Foundation Demo
🔬45Browse pathology image library
-
CXR Foundation Demo
🩻22Demo usage of the CXR Foundation model embeddings
-
MedGemma - Radiology Explainer Demo
🩺241Radiology Image & Report Explainer Demo. Built with MedGemma
-
Appoint Ready - MedGemma Demo
📋200Simulated Pre-visit Intake Demo built using MedGemma
-
google/gemma-3-4b-it-qat-q4_0-gguf
Image-Text-to-Text • 4B • Updated • 11.6k • 254 -
google/gemma-3-4b-pt-qat-q4_0-gguf
Image-Text-to-Text • 4B • Updated • 465 • 26 -
google/gemma-3-1b-it-qat-q4_0-gguf
Text Generation • 1.0B • Updated • 1.12k • 124 -
google/gemma-3-1b-pt-qat-q4_0-gguf
Text Generation • 1.0B • Updated • 83 • 14
-
Compare Siglip1 Siglip2
🚀54Compare SigLIP1 and SigLIP2 on zero shot classification
-
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
Paper • 2502.14786 • Published • 164 -
google/siglip2-base-patch16-224
Zero-Shot Image Classification • Updated • 607k • 93 -
google/siglip2-base-patch16-256
Zero-Shot Image Classification • 0.4B • Updated • 93.6k • 8
-
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper • 2412.03555 • Published • 135 -
google/paligemma2-3b-pt-224
Image-Text-to-Text • Updated • 19.8k • 168 -
google/paligemma2-3b-pt-448
Image-Text-to-Text • 3B • Updated • 29.6k • 47 -
google/paligemma2-3b-pt-896
Image-Text-to-Text • 3B • Updated • 952 • 26
-
google/paligemma-3b-ft-ai2d-224-jax
Image-Text-to-Text • Updated • 2 • 5 -
google/paligemma-3b-ft-ai2d-448-jax
Image-Text-to-Text • Updated • 3 • 1 -
google/paligemma-3b-ft-aokvqa-da-224-jax
Image-Text-to-Text • Updated • 2 • 1 -
google/paligemma-3b-ft-aokvqa-da-448-jax
Image-Text-to-Text • Updated • 2 • 1
-
google-t5/t5-base
Translation • Updated • 1.4M • • 773 -
google-t5/t5-small
Translation • 60.5M • Updated • 2.12M • • 541 -
google-t5/t5-large
Translation • 0.7B • Updated • 547k • • 256 -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 18
-
google/siglip-so400m-patch14-384
Zero-Shot Image Classification • 0.9B • Updated • 2.15M • 670 -
google/siglip-so400m-patch14-224
Zero-Shot Image Classification • 0.9B • Updated • 25k • 58 -
google/siglip-so400m-patch16-256-i18n
Zero-Shot Image Classification • 1B • Updated • 425 • 31 -
google/siglip-base-patch16-256-multilingual
Zero-Shot Image Classification • 0.4B • Updated • 21.7k • 53
-
google/timesfm-1.0-200m
Time Series Forecasting • Updated • 1.74k • 807 -
google/timesfm-1.0-200m-pytorch
Time Series Forecasting • Updated • 11.8k • 31 -
google/timesfm-2.0-500m-jax
Time Series Forecasting • Updated • 45 • 20 -
google/timesfm-2.0-500m-pytorch
Time Series Forecasting • 0.5B • Updated • 32.8k • 251