Linq's AI Retrieval Model Achieves the Top Spot on the HuggingFace MTEB Leaderboard

BOSTON, June 5, 2024 /PRNewswire/ -- Linq, a generative AI startup, announced that its large embedding model "Linq-Embed-Mistral" ranked first in the text retrieval evaluation on HuggingFace's "Massive Text Embedding Benchmark (MTEB)" leaderboard, outpacing competitors like NVIDIA, Salesforce, Google, OpenAI, and Cohere. This evaluation is run by HuggingFace, the world's largest machine learning platform.

Linq's embedding model achieved a score of 60.2 points in the text retrieval category, securing the top position. This placed Linq ahead of NVIDIA, which scored 59.4 points, and Voyage AI, which scored 58.3 points. Google's model followed with a score of 55.7, while OpenAI and Cohere scored 55.4 and 55.0 points, respectively.

The MTEB leaderboard by HuggingFace ranks the performance of embedding models across seven categories, including classification, clustering, pair classification, reranking, retrieval, semantic textual similarity (STS), and summarization. Linq's embedding model demonstrated excellent performance not only in the text retrieval category but also in other categories, earning an overall rank of third.

The MTEB lists more than 300 embedding models, highlighting the competitive yet manageable landscape of embedding model technology. Linq's top performance in this specific benchmark underscores its superiority in embedding model technology.

Embedding models are critical in generative AI, particularly for addressing the hallucination problem of large language models (LLMs) by employing retrieval-augmented generation (RAG) technology. RAG allows models to produce reliable outputs by accessing the latest data or internal documents not available within the LLM.

Leading this project, Dr. Junseong Kim stated, "Our research demonstrates that due to the broad topic diversity and challenging difficulty of retrieval data, GPT-generated data is not perfect and requires thorough verification and refinement. Through these processes, we can achieve quality comparable to human-labeled data, ultimately attaining the best retrieval performance based on the MTEB benchmark dataset. This study shows that through elaborate data crafting and filtering using GPT, we can create models optimized for retrieval-augmented generation (RAG) and maximize performance in specific fields." Additionally, he emphasized, "Not only is refined data crucial, but optimized training methodologies and rapid experimental cycles are also key to maximizing retrieval performance."

Linq's Co-founder & CEO, Jacob Choi, emphasized, "Accurate search is crucial for generative AI enterprises' adoption. We're proud to have developed the core embedding model to achieve this, and we'll keep expanding and refining it to ensure precise text searches in specialized fields like finance and legal." Choi noted that while 2023 saw the rise of B2C use cases for generative AI with the advent of ChatGPT, 2024 will witness the growth of B2B (business-to-business) applications with improved accuracy and security technologies.

Massive Text Embedding Benchmark (MTEB) BEIR Retrieval Score in HuggingFace. as of May 30, 2024.

[Company Description]

Founded in 2022, Linq (Wecover Platforms Inc) was established by MIT Electrical and Computer Engineering graduate Jacob Choi and MIT Computational Science and Engineering Ph.D. Subeen Pang. In 2021, Choi was named in Forbes' "30 Under 30" in the science category for his AI neuromorphic computing research. Linq received early investments from KakaoVentures, Smilegate Investment, and Yellowdog in 2022. In 2023, Linq won the Samsung Open Collaboration hosted by Samsung Financial Networks and was selected for MassChallenge Fintech cohort, the largest non-equity accelerator in the U.S., continuing its collaboration with KPMG US.

Contact: Jacob Choi (jacob.choi@getlinq.com)

source: Linq (Wecover Platforms Inc)

【你點睇？】民主派初選案，45名罪成被告判囚4年2個月至10年不等，你認為判刑是否具阻嚇作用？► 立即投票

1	【大行炒Ｄ乜】快手績後遭多行削目標，麥格理狠降蔚來至「中性」
2	恒指全日跌１０３點結束三連升，市況續悶內需內房領跌
3	《盤前攻略》美股三指個別走，恒指料低開，快手第四季收入料放緩
4	《新股上市》國產高端美妝品牌毛戈平通過主板上市聆訊
5	《盤後部署》內地政策空窗港股難突破，蔚來扭虧無期恐被小理拋離
6	恒指表現續悶半日跌２４點報１９６８０，網易新遊戲獲批升２％
7	星展預計明年香港ＧＤＰ增長約２﹒５％，料美國下月繼續減息
8	【截收意向】港鐵東涌東站第一期收意向書，長實、新地、恒地遞交
9	富途回應大幅裁員：不實，人員流動佔比為５％，屬正常經營迭代
10	快手季績符預期，憂業務放緩遭大行降目標，股價插一成可否博反彈

1	《法證攻防－林恩》小米將公布業績，京東暫錄五連跌
2	《窩輪豪情－梁業豪》轉入反覆上落的橫行市
3	《人棄我取－陳萬賢》美股是等待，港股更多是期待
4	《套期保值－蕭猷華》現水平吸納京東集團
5	《地產人眼－盧展豪》電商平台開設實體店，進佔家具市場
6	《菲常論證－溫蕎菲》攜程重上５００元關，小米績後回調
7	《連場取勝－連敬涵》阿里巴巴上升空間有限，暫只宜觀望
8	《股林淘金－林家亨》小米汽車有改善，單車虧損降至三萬七
9	《缸邊隨筆－石鏡泉》滬深３００指數
10	《投資智慧－鄧聲興》緊抓熱點推新產品，攜程業績持續增長

1	高息定存 \| 中銀上調3個月至3.6厘，東亞新增至尊理財定存
2	高息定存 \| 工銀亞洲3個月存息加至3.6厘，華僑調整快閃優惠
3	高息定存 \| 一周高息合集，多家銀行加定存息，邊間3個月有5厘？
4	港股 \| 蕭猷華：現水平吸納京東集團
5	順豐上市 \| 順豐今招股入場費7333元，引入小米、太保等基投
6	港股 \| 午市前瞻 \| 金監局新指示恒指跌幅擴大百度優勢大惟變現需時
7	順豐上市 \|【FOCUS】慷慨派息+四面受敵，順豐招股謀國際化
8	大國博弈 \| 【FOCUS】油金股匯冷看「蘑菇雲」，惟普京底牌不止於此
9	小米上季經調整盈利升逾4%勝預期，汽車年交付目標升至13萬輛
10	47人案判刑 \| 首被告戴耀廷判囚10年，區諾軒判監6年9個月

1	高息定存 \| 銀行紛搶存，恒生3個月加至3.6厘，創興高達3.9厘
2	高息定存 \| 中銀上調3個月至3.6厘，東亞新增至尊理財定存
3	美國大選2024 \| 2024美國大選即時結果，特朗普宣布勝利
4	理財通 \| 證監會：首批試點計劃券商名單出爐，續優化擴大理財通
5	恒指公司與沙特交易所簽署合作意向協議書，探索產品開發等
6	內地救市見效樓市有起色，惟再有內房抽水可以點揀？
7	港股 \| 蕭猷華：重磅消息來襲，股市勢必波動
8	美國大選2024 \|【FOCUS】侵侵勝券在握，防美元反高潮
9	瀚亞專家投資智慧：市場動盪下，低波幅如何成為避險關鍵？
10	美國大選 \| 【FOCUS】「垃圾」牽動選票，美媒各有盤算
11	高息定存 \| 一周高息合集，多家銀行加定存息，華僑3個月最高4厘
12	高息定存 \| 創興加3個月存息至3.6厘，渣打6個月3.48厘
13	高息定存 \| 特朗普勝選美元走強，富邦一個月美元定存5.98厘
14	港股 \| 午市前瞻 \| 人行買斷式逆回購刺激料有限內房板塊短線向好可吼
15	高息定存 \| 一周高息合集，銀行6個月最高3.6厘，3個月4厘
16	美國大選 \| 法國外貿銀行：若60%關稅屬實，損內地GDP增長率1百分點
17	恒指 \| 恒指午後升逾300點，人大常委開會期間中資金融股造好
18	把握股市大浪未贏錢先享獎賞開立東亞戶口賺高達HK$3,800獎賞
19	高息定存 \| 工銀亞洲3個月存息加至3.6厘，華僑調整快閃優惠
20	2025 多元資產部署解鎖環球股匯債市潛力
21	TAOBAO \| 市傳淘寶租中港城4萬呎舖，料開設大型體驗家具館
22	神州經脈 \| 6萬億化債政策出台，滬指全周升逾5%，人幣跌
23	專訪 \| 洪灝：情緒不等於信心，市場關注人大會議勿捉錯用神（有片）
24	電池之戰 \| 【FOCUS】寧王搶佔增混商機，固態電池更牽暗戰
25	無人機 \| 美團：冀借助港府推動低空經濟，盡快拓香港無人機配送服務
26	大家樂牛油 \| 大家樂否認轉用內地牛油，澄清荷蘭生產自家品牌維寶牛油醬
27	神州經脈 \| 人大常委會下月初開會，MLF縮量續做，滬指升兩周
28	澳門派錢 \| 澳門明年度預算案提出續推現金分享等惠民措施
29	攜程 \| 騰訊南非大股東確認15億美元清倉攜程，稱積極管理投資組合
30	【FOCUS】國產機鬥內捲，小米鮎魚上身

大國博弈

戰爭壓倒歐盟火車頭，德國政經雙輸

貨幣攻略

高息定存 | 工銀亞洲3個月存息加至3.6厘，華僑調整快閃優...

傾力救市

提振A股 | 高盛：繼續給予A股市場「高配」建議

說說心理話