본문 바로가기
분석

STS embedding models benchmark

by Joseph.Lee 2026. 5. 26.

몇개 검색용으로 쓸 embedding model 을 찾아보고 있는 중에 비교했던 내용을 공유한다.

CPU 에서 동작한 결과이다.

Embedding STS Benchmark

Model Dataset Samples Spearman Elapsed(s) Total tokens Tokens/s Avg sec/pair
jhgan/ko-sbert-sts English(STS-B val) 1500 0.7762 41.30 84667 2050.22 0.02753
jhgan/ko-sbert-sts Korean(KLUE-STS val) 519 0.7863 10.89 20184 1852.63 0.02099
snunlp/KR-SBERT-V40K-klueNLI-augSTS English(STS-B val) 1500 0.6292 48.66 103223 2121.15 0.03244
snunlp/KR-SBERT-V40K-klueNLI-augSTS Korean(KLUE-STS val) 519 0.7341 9.37 16447 1755.90 0.01805
paraphrase-multilingual-MiniLM-L12-v2 English(STS-B val) 1500 0.8747 8.35 53774 6437.41 0.00557
paraphrase-multilingual-MiniLM-L12-v2 Korean(KLUE-STS val) 519 0.6590 3.55 20702 5832.06 0.00684
Xenova/paraphrase-multilingual-MiniLM-L12-v2 (fp16) English(STS-B val) 1500 0.8747 15.47 53774 3475.86 0.01031
Xenova/paraphrase-multilingual-MiniLM-L12-v2 (fp16) Korean(KLUE-STS val) 519 0.6589 9.52 20702 2175.41 0.01834
Xenova/paraphrase-multilingual-MiniLM-L12-v2 (int8) English(STS-B val) 1500 0.8731 15.38 53774 3496.04 0.01025
Xenova/paraphrase-multilingual-MiniLM-L12-v2 (int8) Korean(KLUE-STS val) 519 0.6505 9.30 20702 2226.92 0.01791
Xenova/all-MiniLM-L12-v2 (original) English(STS-B val) 1500 0.8750 12.80 48709 3805.89 0.00853
Xenova/all-MiniLM-L12-v2 (original) Korean(KLUE-STS val) 519 0.3142 33.11 49995 1510.08 0.06379
Xenova/all-MiniLM-L12-v2 (fp16) English(STS-B val) 1500 0.8750 14.63 48709 3328.82 0.00975
Xenova/all-MiniLM-L12-v2 (fp16) Korean(KLUE-STS val) 519 0.3141 39.41 49995 1268.62 0.07593
Xenova/all-MiniLM-L12-v2 (int8) English(STS-B val) 1500 0.8706 13.55 48709 3595.29 0.00903
Xenova/all-MiniLM-L12-v2 (int8) Korean(KLUE-STS val) 519 0.2973 33.15 49995 1507.96 0.06388

 

테스트에 사용 된 코드와 결과는 다음에 있습니다:

https://github.com/jclab-joseph/blog-artifacts/blob/main/2026-05/26-embedding-models/benchmark_results.md

 

blog-artifacts/2026-05/26-embedding-models/benchmark_results.md at main · jclab-joseph/blog-artifacts

Contribute to jclab-joseph/blog-artifacts development by creating an account on GitHub.

github.com

 

 

반응형

'분석' 카테고리의 다른 글

인터넷 등기소 프로그램 (IPRT) 분석  (2) 2024.10.03
Intel vPro CIRA Packet  (0) 2023.01.05

댓글