Searching
Immich uses Postgres as its search database for both metadata and contextual CLIP search.
Contextual CLIP search is powered by the pgvecto.rs extension, utilizing machine learning models like CLIP to provide relevant search results. This allows for freeform searches without requiring specific keywords in the image or video metadata.
Advanced Search Filters
In addition, Immich offers advanced search functionality, allowing you to find specific content using customizable search filters. These filters include location, one or more faces, specific albums, and more. You can try out the search filters on the Demo site.
The filters smart search allows you to search by include:
- People
- Location
- Country
- State
- City
- Camera
- Make
- Model
- Date range
- File name or extension
- Media type
- Image (including live/motion photos)
- Video
- All
- Condition
- Not in any album
- Archived
- Favorited
- Rating
- Computer
- Mobile
Some search examples:



Configuration
Navigating to Administration > Settings > Machine Learning Settings > Smart Search
will show the options available.
CLIP models
More powerful models can be used for more accurate search results, but are slower and can require more server resources. Check the dropdowns below to see how they compare in memory usage, speed and quality by language.
Once you've chosen a model, follow these steps:
- Copy the name of the model (e.g.
ViT-B-16-SigLIP__webli
) - Go to the Smart Search settings
- Paste the model name into the Model Name section
- Save the settings
- Go to the Job Status page
- Click "All" next to "Smart Search" to begin re-processing your assets with the new model
- (Optional) Confirm that the logs for the server and machine learning service don't have relevant errors
In rare instances, changing the model might leave bits of the old model's incompatible data in the database, causing errors when processing Smart Search jobs. If you notice errors like this in the logs, you can change the model back to the previous one and save, then repeat steps 3-7.
Please note that memory and execution time values are only estimates: actual usage will be different depending on many factors. As such, it's mainly intended as a way to compare the relative tradeoffs of each model.
Reference
Memory and execution time estimates were obtained without acceleration on a 7800x3D processor running bare metal Linux. All testing and evaluation was done at f32 precision (the default in Immich).
Execution Time (ms): After warming up the model with one pass, the mean execution time of 100 passes with the same input.
Memory (MiB): The peak RSS usage of the process afer performing the above timing benchmark. Does not include image decoding, concurrent processing, the web server, etc., which are relatively constant factors.
Recall (%): Evaluated on Crossmodal-3600, the average of the recall@1, recall@5 and recall@10 results for zeroshot image retrieval.
Pareto Optimal: Whether the model is not completely outclassed by another model. Try to use models that are optimal for the languages relevant to you. Specifically, for a given model and language, if there's another model that's better for that language in at least one respect (memory usage, execution time, recall) while being at least as good for that language in every other way, then the model is not optimal for that language.
English
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
ViT-H-14-378-quickgelu__dfn5b | 5049 | 108.4 | 75.73 | ✅ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 75.44 | ✅ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 75.19 | ✅ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 75.09 | ❌ |
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 75.07 | ❌ |
ViT-H-14-quickgelu__dfn5b | 4701 | 38.74 | 75.01 | ❌ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 74.92 | ❌ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 74.9 | ❌ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 74.87 | ❌ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 74.87 | ❌ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 74.77 | ❌ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 74.28 | ❌ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 74.26 | ✅ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 73.15 | ✅ |
ViT-L-14-quickgelu__dfn2b | 2212 | 20.49 | 72.78 | ✅ |
ViT-SO400M-14-SigLIP-384__webli | 4417 | 72.19 | 72.58 | ❌ |
ViT-L-16-SigLIP-384__webli | 3396 | 47.6 | 72.57 | ❌ |
ViT-B-16-SigLIP-512__webli | 1828 | 26.17 | 72.47 | ✅ |
ViT-B-16-SigLIP-384__webli | 1128 | 13.53 | 72.45 | ✅ |
ViT-L-16-SigLIP-256__webli | 3160 | 23.84 | 72.44 | ❌ |
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 72.37 | ❌ |
ViT-B-16-SigLIP__webli | 1081 | 5.77 | 71.64 | ✅ |
ViT-B-16-SigLIP-256__webli | 1102 | 7.11 | 71.63 | ❌ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 71.45 | ❌ |
ViT-H-14__laion2b-s32b-b79k | 4676 | 39.06 | 71.33 | ❌ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 71.19 | ❌ |
ViT-L-14__laion2b-s32b-b82k | 2233 | 20.56 | 69.86 | ❌ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 69.66 | ❌ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 69.38 | ❌ |
ViT-B-32__laion2b-s34b-b79k | 1001 | 2.29 | 68.78 | ✅ |
ViT-L-14__laion400m_e31 | 2183 | 19.87 | 68.53 | ❌ |
ViT-B-16-plus-240__laion400m_e32 | 1246 | 6.95 | 68.53 | ❌ |
ViT-B-16-plus-240__laion400m_e31 | 1263 | 6.94 | 68.53 | ❌ |
ViT-L-14__laion400m_e32 | 2218 | 19.73 | 68.51 | ❌ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 68.41 | ❌ |
ViT-B-32__laion2b_e16 | 1004 | 2.38 | 68.41 | ❌ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 68.33 | ❌ |
ViT-B-16__laion400m_e31 | 991 | 5.04 | 66.96 | ✅ |
ViT-B-16__laion400m_e32 | 975 | 4.98 | 66.95 | ✅ |
ViT-B-32__laion400m_e31 | 999 | 2.28 | 65.65 | ✅ |
ViT-B-32__laion400m_e32 | 1003 | 2.35 | 65.49 | ❌ |
ViT-L-14__openai | 2212 | 19.91 | 60.12 | ❌ |
ViT-B-32__openai | 1004 | 2.26 | 59.37 | ✅ |
RN50x64__openai | 5079 | 48.79 | 59.36 | ❌ |
RN50x16__openai | 2221 | 15.87 | 59.17 | ❌ |
ViT-L-14-336__openai | 2616 | 43.45 | 59.09 | ❌ |
RN50__openai | 913 | 2.39 | 58.32 | ✅ |
ViT-B-16__openai | 985 | 5.03 | 58.27 | ❌ |
RN50x4__openai | 1416 | 5.85 | 57.88 | ❌ |
RN50__cc12m | 914 | 2.37 | 57.75 | ✅ |
RN101__openai | 1111 | 3.21 | 57.7 | ❌ |
RN101__yfcc15m | 1111 | 3.22 | 50.11 | ❌ |
RN50__yfcc15m | 908 | 2.34 | 48.28 | ✅ |
Arabic
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 77.3 | ✅ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 76.44 | ✅ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 74.03 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 73.19 | ✅ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 69.31 | ✅ |
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 69.29 | ❌ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 69.29 | ❌ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 68.64 | ✅ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 68.35 | ✅ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 68.25 | ✅ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 68.23 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 67.56 | ❌ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 67.28 | ❌ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 66.89 | ✅ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 66.52 | ❌ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 64.1 | ✅ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 61.71 | ✅ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 60.7 | ✅ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 59.66 | ✅ |
Bengali
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 76.16 | ✅ |
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 75.83 | ❌ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 73.75 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 73.34 | ✅ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 36.43 | ✅ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 26.56 | ❌ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 26.54 | ❌ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 26.19 | ❌ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 26.19 | ❌ |
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 25.92 | ❌ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 25.15 | ❌ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 24.18 | ❌ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 21.44 | ❌ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 21.11 | ❌ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 20.94 | ✅ |
Chinese (Simplified)
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 77.49 | ✅ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 77.19 | ✅ |
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 76.98 | ❌ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 72.89 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 72.65 | ✅ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 72.52 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 67.83 | ❌ |
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 67.81 | ❌ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 67.51 | ❌ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 67.39 | ❌ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 67.33 | ❌ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 67.23 | ❌ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 67.05 | ❌ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 66.87 | ✅ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 66.24 | ❌ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 66.1 | ✅ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 65.56 | ❌ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 64.39 | ❌ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 62.56 | ❌ |
Croatian
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 87.46 | ✅ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 87.19 | ✅ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 82.98 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 82.92 | ✅ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 81.93 | ✅ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 73.77 | ✅ |
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 73.21 | ❌ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 73.2 | ✅ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 72.95 | ❌ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 72.89 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 72.88 | ❌ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 72.85 | ✅ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 72.69 | ✅ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 70.73 | ❌ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 70.45 | ✅ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 70.43 | ❌ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 69.97 | ✅ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 54.31 | ❌ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 53.3 | ❌ |
ViT-H-14-378-quickgelu__dfn5b | 5049 | 108.4 | 35.64 | ❌ |
ViT-H-14-quickgelu__dfn5b | 4701 | 38.74 | 35.17 | ❌ |
ViT-L-16-SigLIP-256__webli | 3160 | 23.84 | 33.65 | ❌ |
ViT-L-16-SigLIP-384__webli | 3396 | 47.6 | 33.55 | ❌ |
ViT-B-16-SigLIP-256__webli | 1102 | 7.11 | 20.05 | ✅ |
Cusco Quechua
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 38.08 | ✅ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 37.87 | ✅ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 33.41 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 33.06 | ✅ |
Czech
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 73.76 | ✅ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 71.57 | ✅ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 69.86 | ✅ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 67.49 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 67.15 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 63.62 | ❌ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 63.35 | ✅ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 63.09 | ❌ |
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 63.07 | ❌ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 62.98 | ✅ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 62.82 | ✅ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 62.73 | ✅ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 62.29 | ✅ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 62.12 | ✅ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 61.74 | ✅ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 61.52 | ✅ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 61.01 | ✅ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 54.81 | ❌ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 54.31 | ❌ |
ViT-L-16-SigLIP-256__webli | 3160 | 23.84 | 33.58 | ❌ |
ViT-L-16-SigLIP-384__webli | 3396 | 47.6 | 33.48 | ❌ |
ViT-H-14-378-quickgelu__dfn5b | 5049 | 108.4 | 32.38 | ❌ |
ViT-H-14-quickgelu__dfn5b | 4701 | 38.74 | 32.32 | ❌ |
ViT-B-16-SigLIP__webli | 1081 | 5.77 | 22.89 | ✅ |
ViT-B-16-SigLIP-512__webli | 1828 | 26.17 | 22.66 | ❌ |
ViT-B-16-SigLIP-256__webli | 1102 | 7.11 | 22.6 | ❌ |
ViT-B-16-SigLIP-384__webli | 1128 | 13.53 | 22.25 | ❌ |
Danish
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 87.16 | ✅ |
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 86.88 | ❌ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 84.18 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 84.03 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 83.75 | ❌ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 83.32 | ✅ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 83.25 | ❌ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 82.3 | ✅ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 82.19 | ❌ |
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 81.87 | ❌ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 81.44 | ✅ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 81.42 | ✅ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 80.0 | ✅ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 79.82 | ✅ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 79.08 | ✅ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 75.07 | ✅ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 74.84 | ✅ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 67.68 | ❌ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 67.2 | ❌ |
ViT-H-14-quickgelu__dfn5b | 4701 | 38.74 | 65.59 | ❌ |
ViT-H-14-378-quickgelu__dfn5b | 5049 | 108.4 | 65.36 | ❌ |
ViT-L-14-quickgelu__dfn2b | 2212 | 20.49 | 42.31 | ✅ |
ViT-L-16-SigLIP-256__webli | 3160 | 23.84 | 41.46 | ❌ |
ViT-L-16-SigLIP-384__webli | 3396 | 47.6 | 40.52 | ❌ |
ViT-B-16-SigLIP-512__webli | 1828 | 26.17 | 31.31 | ✅ |
ViT-B-16-SigLIP-256__webli | 1102 | 7.11 | 30.97 | ✅ |
ViT-B-16-SigLIP__webli | 1081 | 5.77 | 30.87 | ✅ |
ViT-B-16-SigLIP-384__webli | 1128 | 13.53 | 30.51 | ❌ |
Dutch
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 80.05 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 79.81 | ❌ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 79.72 | ❌ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 79.72 | ✅ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 79.64 | ✅ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 79.49 | ✅ |
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 79.41 | ❌ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 79.31 | ❌ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 78.92 | ✅ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 78.48 | ✅ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 78.22 | ❌ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 78.0 | ✅ |
ViT-H-14-378-quickgelu__dfn5b | 5049 | 108.4 | 77.22 | ❌ |
ViT-H-14-quickgelu__dfn5b | 4701 | 38.74 | 76.69 | ❌ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 75.94 | ✅ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 75.6 | ❌ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 75.33 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 75.04 | ❌ |
ViT-L-16-SigLIP-384__webli | 3396 | 47.6 | 72.97 | ❌ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 72.72 | ✅ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 72.06 | ✅ |
ViT-L-16-SigLIP-256__webli | 3160 | 23.84 | 72.06 | ❌ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 70.81 | ✅ |
ViT-L-14-quickgelu__dfn2b | 2212 | 20.49 | 69.82 | ✅ |
ViT-SO400M-14-SigLIP-384__webli | 4417 | 72.19 | 67.54 | ❌ |
ViT-B-16-SigLIP-512__webli | 1828 | 26.17 | 66.77 | ✅ |
ViT-B-16-SigLIP-384__webli | 1128 | 13.53 | 66.6 | ✅ |
ViT-B-16-SigLIP-256__webli | 1102 | 7.11 | 65.67 | ✅ |
ViT-B-16-SigLIP__webli | 1081 | 5.77 | 65.29 | ✅ |
ViT-H-14__laion2b-s32b-b79k | 4676 | 39.06 | 41.1 | ❌ |
ViT-L-14__laion2b-s32b-b82k | 2233 | 20.56 | 34.29 | ❌ |
ViT-L-14__laion400m_e31 | 2183 | 19.87 | 29.65 | ❌ |
ViT-L-14__laion400m_e32 | 2218 | 19.73 | 29.56 | ❌ |
ViT-B-32__laion2b-s34b-b79k | 1001 | 2.29 | 29.54 | ✅ |
ViT-B-32__laion2b_e16 | 1004 | 2.38 | 29.36 | ❌ |
ViT-B-16-plus-240__laion400m_e31 | 1263 | 6.94 | 27.76 | ❌ |
ViT-B-16-plus-240__laion400m_e32 | 1246 | 6.95 | 27.76 | ❌ |
ViT-B-16__laion400m_e32 | 975 | 4.98 | 25.67 | ✅ |
ViT-B-32__laion400m_e32 | 1003 | 2.35 | 25.59 | ❌ |
ViT-B-16__laion400m_e31 | 991 | 5.04 | 25.53 | ❌ |
ViT-B-32__laion400m_e31 | 999 | 2.28 | 25.52 | ✅ |
ViT-L-14__openai | 2212 | 19.91 | 22.31 | ❌ |
RN50x64__openai | 5079 | 48.79 | 22.27 | ❌ |
ViT-L-14-336__openai | 2616 | 43.45 | 21.8 | ❌ |
RN50x16__openai | 2221 | 15.87 | 20.69 | ❌ |
Filipino
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 67.57 | ✅ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 65.64 | ✅ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 61.21 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 59.42 | ✅ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 36.81 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 35.72 | ❌ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 34.75 | ❌ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 34.63 | ❌ |
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 34.39 | ❌ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 34.27 | ❌ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 34.14 | ❌ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 33.98 | ❌ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 30.57 | ❌ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 30.57 | ❌ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 30.05 | ✅ |
ViT-L-16-SigLIP-384__webli | 3396 | 47.6 | 24.92 | ❌ |
ViT-L-16-SigLIP-256__webli | 3160 | 23.84 | 24.02 | ❌ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 23.37 | ✅ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 22.69 | ✅ |
Finnish
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 84.27 | ✅ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 83.93 | ✅ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 79.41 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 78.94 | ✅ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 75.49 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 63.46 | ❌ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 63.16 | ✅ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 63.08 | ✅ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 63.03 | ❌ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 62.28 | ❌ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 61.92 | ❌ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 61.81 | ❌ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 61.76 | ❌ |
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 61.05 | ❌ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 57.8 | ❌ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 57.69 | ❌ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 57.05 | ✅ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 40.26 | ❌ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 40.06 | ❌ |
ViT-L-16-SigLIP-256__webli | 3160 | 23.84 | 31.75 | ❌ |
ViT-L-16-SigLIP-384__webli | 3396 | 47.6 | 31.74 | ❌ |
French
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 88.01 | ✅ |
ViT-H-14-378-quickgelu__dfn5b | 5049 | 108.4 | 87.74 | ❌ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 87.69 | ✅ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 87.6 | ✅ |
ViT-H-14-quickgelu__dfn5b | 4701 | 38.74 | 87.58 | ✅ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 87.51 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 87.23 | ❌ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 86.9 | ✅ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 86.9 | ✅ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 86.44 | ✅ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 86.44 | ❌ |
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 86.28 | ❌ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 86.11 | ❌ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 86.08 | ✅ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 84.49 | ❌ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 84.3 | ✅ |
ViT-L-14-quickgelu__dfn2b | 2212 | 20.49 | 83.03 | ✅ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 82.93 | ❌ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 82.27 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 82.14 | ❌ |
ViT-L-16-SigLIP-384__webli | 3396 | 47.6 | 80.96 | ❌ |
ViT-L-16-SigLIP-256__webli | 3160 | 23.84 | 80.64 | ❌ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 80.28 | ✅ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 79.65 | ✅ |
ViT-B-16-SigLIP-512__webli | 1828 | 26.17 | 77.4 | ✅ |
ViT-B-16-SigLIP-384__webli | 1128 | 13.53 | 76.88 | ✅ |
ViT-B-16-SigLIP__webli | 1081 | 5.77 | 76.3 | ✅ |
ViT-B-16-SigLIP-256__webli | 1102 | 7.11 | 75.68 | ❌ |
ViT-SO400M-14-SigLIP-384__webli | 4417 | 72.19 | 69.59 | ❌ |
ViT-H-14__laion2b-s32b-b79k | 4676 | 39.06 | 68.36 | ❌ |
ViT-L-14__laion2b-s32b-b82k | 2233 | 20.56 | 61.78 | ❌ |
ViT-L-14__laion400m_e32 | 2218 | 19.73 | 58.4 | ❌ |
ViT-L-14__laion400m_e31 | 2183 | 19.87 | 58.35 | ❌ |
ViT-B-16-plus-240__laion400m_e31 | 1263 | 6.94 | 57.17 | ❌ |
ViT-B-16-plus-240__laion400m_e32 | 1246 | 6.95 | 57.17 | ❌ |
ViT-B-32__laion2b_e16 | 1004 | 2.38 | 57.05 | ✅ |
ViT-B-32__laion2b-s34b-b79k | 1001 | 2.29 | 56.08 | ✅ |
ViT-B-16__laion400m_e31 | 991 | 5.04 | 52.96 | ✅ |
ViT-B-16__laion400m_e32 | 975 | 4.98 | 52.83 | ✅ |
ViT-B-32__laion400m_e32 | 1003 | 2.35 | 51.88 | ❌ |
ViT-B-32__laion400m_e31 | 999 | 2.28 | 51.82 | ✅ |
RN50x64__openai | 5079 | 48.79 | 42.86 | ❌ |
ViT-L-14-336__openai | 2616 | 43.45 | 42.81 | ❌ |
ViT-L-14__openai | 2212 | 19.91 | 42.54 | ❌ |
RN50x16__openai | 2221 | 15.87 | 41.72 | ❌ |
RN50x4__openai | 1416 | 5.85 | 38.85 | ❌ |
RN101__openai | 1111 | 3.21 | 36.79 | ❌ |
ViT-B-16__openai | 985 | 5.03 | 36.47 | ❌ |
ViT-B-32__openai | 1004 | 2.26 | 35.17 | ✅ |
RN50__openai | 913 | 2.39 | 34.44 | ✅ |
German
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 90.04 | ✅ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 89.97 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 89.85 | ❌ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 89.81 | ✅ |
ViT-H-14-378-quickgelu__dfn5b | 5049 | 108.4 | 89.77 | ❌ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 89.69 | ✅ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 89.45 | ✅ |
ViT-H-14-quickgelu__dfn5b | 4701 | 38.74 | 89.44 | ❌ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 89.39 | ✅ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 89.35 | ✅ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 89.03 | ❌ |
ViT-L-16-SigLIP2-256__webli | 2830 | 23.77 | 88.82 | ✅ |
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 88.55 | ❌ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 88.42 | ❌ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 87.19 | ❌ |
ViT-B-16-SigLIP2__webli | 3038 | 5.81 | 86.44 | ✅ |
ViT-L-14-quickgelu__dfn2b | 2212 | 20.49 | 84.81 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 84.81 | ❌ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 84.58 | ❌ |
ViT-B-32-SigLIP2-256__webli | 3061 | 3.31 | 84.44 | ✅ |
ViT-B-16-SigLIP-i18n-256__webli | 3029 | 6.87 | 83.33 | ✅ |
ViT-L-16-SigLIP-384__webli | 3396 | 47.6 | 82.75 | ❌ |
ViT-L-16-SigLIP-256__webli | 3160 | 23.84 | 82.32 | ❌ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 81.63 | ✅ |
ViT-B-16-SigLIP-512__webli | 1828 | 26.17 | 76.76 | ✅ |
ViT-B-16-SigLIP-384__webli | 1128 | 13.53 | 76.33 | ✅ |
ViT-B-16-SigLIP__webli | 1081 | 5.77 | 75.19 | ✅ |
ViT-B-16-SigLIP-256__webli | 1102 | 7.11 | 75.07 | ❌ |
ViT-SO400M-14-SigLIP-384__webli | 4417 | 72.19 | 64.61 | ❌ |
ViT-H-14__laion2b-s32b-b79k | 4676 | 39.06 | 52.81 | ❌ |
ViT-L-14__laion2b-s32b-b82k | 2233 | 20.56 | 42.88 | ❌ |
ViT-L-14__laion400m_e32 | 2218 | 19.73 | 38.65 | ❌ |
ViT-L-14__laion400m_e31 | 2183 | 19.87 | 38.37 | ❌ |
ViT-B-32__laion2b_e16 | 1004 | 2.38 | 37.65 | ✅ |
ViT-B-32__laion2b-s34b-b79k | 1001 | 2.29 | 36.6 | ✅ |
ViT-B-16-plus-240__laion400m_e31 | 1263 | 6.94 | 35.44 | ❌ |
ViT-B-16-plus-240__laion400m_e32 | 1246 | 6.95 | 35.44 | ❌ |
ViT-B-16__laion400m_e31 | 991 | 5.04 | 32.46 | ✅ |
ViT-B-16__laion400m_e32 | 975 | 4.98 | 32.31 | ✅ |
ViT-B-32__laion400m_e31 | 999 | 2.28 | 31.85 | ✅ |
ViT-B-32__laion400m_e32 | 1003 | 2.35 | 31.81 | ❌ |
RN50x64__openai | 5079 | 48.79 | 28.41 | ❌ |
ViT-L-14__openai | 2212 | 19.91 | 27.63 | ❌ |
ViT-L-14-336__openai | 2616 | 43.45 | 27.09 | ❌ |
RN50x16__openai | 2221 | 15.87 | 24.48 | ❌ |
RN50x4__openai | 1416 | 5.85 | 23.49 | ❌ |
RN50__openai | 913 | 2.39 | 20.91 | ✅ |
ViT-B-16__openai | 985 | 5.03 | 20.83 | ❌ |
RN101__openai | 1111 | 3.21 | 20.39 | ❌ |
Greek
Model | Memory (MiB) | Execution Time (ms) | Recall (%) | Pareto Optimal |
---|---|---|---|---|
nllb-clip-large-siglip__mrl | 4248 | 75.44 | 74.58 | ✅ |
nllb-clip-large-siglip__v1 | 4226 | 75.05 | 73.28 | ✅ |
XLM-Roberta-Large-ViT-H-14__frozen_laion5b_s13b_b90k | 4014 | 39.14 | 71.28 | ✅ |
nllb-clip-base-siglip__mrl | 4696 | 16.95 | 69.16 | ✅ |
nllb-clip-base-siglip__v1 | 4675 | 15.17 | 68.21 | ✅ |
XLM-Roberta-Base-ViT-B-32__laion5b_s13b_b90k | 3030 | 3.2 | 64.69 | ✅ |
ViT-gopt-16-SigLIP2-384__webli | 6585 | 146.84 | 61.64 | ❌ |
ViT-gopt-16-SigLIP2-256__webli | 6475 | 64.51 | 61.03 | ❌ |
ViT-SO400M-16-SigLIP2-384__webli | 3854 | 56.57 | 60.63 | ❌ |
ViT-SO400M-14-SigLIP2-378__webli | 3940 | 72.25 | 60.41 | ❌ |
ViT-SO400M-16-SigLIP2-512__webli | 4050 | 107.67 | 60.1 | ❌ |
ViT-SO400M-14-SigLIP2__webli | 3622 | 27.63 | 60.06 | ❌ |
ViT-SO400M-16-SigLIP2-256__webli | 3611 | 27.84 | 60.06 | ❌ |
ViT-L-16-SigLIP2-512__webli | 3358 | 92.59 | 59.44 | ❌ |
ViT-L-16-SigLIP2-384__webli | 3057 | 51.7 | 59.44 |