跳到主要内容
版本:1.10.0

模型支持说明

概述

我们基于推理卡的软硬件打通了众多算法模型,覆盖了大语言模型(LLM)、计算机视觉(CV)、自然语言处理(NLP)、光学字符识别(OCR)、搜索推荐、语音、多模态等主流领域,并且有完整、成熟的软件栈帮助您进行部署和运维。本文以表格形式列举出完成推理任务的模型,以及相关的数据指标。

表格中缩写含义如下:

  • NN:Neural Network

  • fps:frame per second

  • sps:sentence per second

  • pps:product per second

  • fp16:floating point 16-bit

计算机视觉

NN吞吐时延单位时延NN说明板卡数
arcface5154.4fps0.05fp16,参数量41.57M,arcface,输入112*1121
arcface_ir50896.54fps0.0011fp16,参数量41.57M,arcface_ir50,输入112*1121
atmosphere_vulgar7128.6fps0.09fp16,参数量22.5M,atmosphere_vulgar,输入224*2241
BiT1640.2fps0.0006fp16,参数量24.37M,BiT,输入224*2241
centernet_x530.96fps0.0019fp16,参数量13.56M,centernet_x,输入512*5121
Conformer5448.2fps0.09fp16,参数量17.85M,Conformer,输入32*5121
conformer_ctc_zh_trail
_3098504_iter_18000
10863.9fps0.09fp16,参数量1.2M,conformer_ctc_zh_trail_3098504_iter_18000,输入32*5121
content_classify2348.99fps0.0004fp16,参数量8.63M,content_classify,输入260*2601
CSPResNet501595.49fps0.0006fp16,参数量20.6M,Resnet50,输入256*2561
cv_model_016014.6fps0.09fp16,参数量22.75M,cv_model_01,输入224*2241
cv_model_025011.8fps0.12fp16,参数量22.47M,cv_model_02,输入224*2241
cv_model_036273fps0.09fp16,参数量22.47M,cv_model_03,输入224*2241
db_res18_epoach123_up20199.5fps0.64fp16,参数量11.64M,db_res18_epoach123_up20,输入960*4801
deeplabv325.45fps0.0393fp16,参数量55.38M,v3,输入519*5191
DenseNet1216739.4fps0.08fp16,参数量7.67M,densenet121,输入224*2241
EfficientNet-B09584.79fps0.0001fp16,参数量3.86M,B0,输入112*961
EfficientNet-B05398.4fps0.11fp16,参数量5.02M,B0,输入224*2241
EfficientNet-B13277.67fps0.0003fp16,参数量7.4M,B1,输入240*2401
EfficientNet-B51343.33fps0.0007fp16,参数量28.9M,B5,输入224*2241
EfficientNetV21974.27fps0.0005fp16,参数量12.96M,V2,输入288*2881
EfficientNetV2_s5950.4fps0.09fp16,参数量19.29M,V2,输入224*2241
face_bbox_landmark_dets862.95fps0.0012fp16,参数量4.03M,face_bbox_landmark_dets,输入640*6401
FaceNet17191.2fps0.3fp16,参数量22.38M,FaceNet,输入160*1601
fairface12772.6fps0.25fp16,参数量20.3M,fairface,输入224*2241
fairface_resnet341607.14fps0.0006fp16,参数量20.3M,fairface_resnet34,输入224*2241
fairmot207.4fps0.16fp16,参数量4.77M,fairmot,输入608*10881
fast_reid1953.9fps0.0095fp16,参数量22.41M,fast_reid,输入256*1281
fer231.15fps0.0043fp16,参数量19.1M,fer,输入44*441
FCOS3111.6fps0.16fp16,参数量30.85M,FCOS,输入800*12161
GhostNet2354.74fps0.0004fp16,参数量4.93M,GhostNet,输入224*2241
GLEAN16.2fps1.54fp16,参数量151.61M,GLEAN,输入32x321
goods_tag_fashion_gender6448.1fps0.08fp16,参数量22.47M,goods_tag_fashion_gender,输入224*2241
hastag547.9fps0.94fp16,参数量23.97M,hastag,输入256*2561
HarDNet 39DS13008.37fps0.0001fp16,参数量3.34M,HarDNet 39DS,输入224*2241
HarDNet 684702.6fps0.0002fp16,参数量16.76M,HarDNet 68,输入224*2241
HarDNet 852547.6fps0.0004fp16,参数量34.99M,HarDNet 85,输入224*2241
hotsoon_live_v6_turbo7155.2fps0.07fp16,参数量22.47M,v6,输入224*2241
hotsoon_live_v8726.6fps0.78fp16,参数量22.55M,v8,输入256*2561
HRNet-W183383.43fps0.0003fp16,参数量20.36M,HRNet-W18,输入224*2241
HRNetV2-W44422.39fps0.0024fp16,参数量63.91M,HRNetV2-W44,输入224*2241
HRNet-ResNet50854.85fps0.0012fp16,参数量32.4M,HRNet_pose_resnet50,输入384*2881
Inception-v33118.11fps0.0003fp16,参数量22.72M,v3,输入299*2991
Lightweight OpenPose1061.62fps0.0009fp16,参数量3.89M,Lightweight OpenPose,输入368*5121
MobileNetV29132.14fps0.0001fp16,参数量5.8M,v2,输入224*2241
MobileNetv311828.36fps0.0001fp16,参数量2.41M,v3,输入224*2241
MobileNetV3 large13191.29fps0.0001fp16,参数量5.22M,V3,输入224*2241
MobileNetV3 small13963.4fps0.0001fp16,参数量2.42M,V3,输入224*2241
model_goods_search159.5fps3.2fp16,参数量84.08M,model_goods_search,输入224*2241
model_goods_universal
_emb_v6_serving
6370.4fps0.09fp16,参数量22.72M,v6,输入224*2241
mp_cls3_fpn534.5fps0.96fp16,参数量25.04M,mp_cls3_fpn,输入224*2241
multi_task_resnet3262fps0.18fp16,参数量22.41M,multi_task_resnet,输入320*3201
pose_hrnet_w321975.7fps0.02fp16,参数量27.19M,pose_hrnet_w32,输入512*5121
pose_hrnet_w48389.61fps0.0026fp16,参数量60.61M,pose_hrnet_w48,输入384*2881
PP-LCNet-0.25x14307.58fps0.0001fp16,参数量1.45M,0.25x,输入224*2241
PP-LCNet-0.35x14462.53fps0.0001fp16,参数量1.57M,0.35x,输入224*2241
PP-LCNet-0.5x14603.74fps0.0001fp16,参数量1.8M,0.5x,输入224*2241
PP-LCNet-0.75x14302.47fps0.0001fp16,参数量2.26M,0.75x,输入224*2241
PP-LCNet-1.0x14011.8fps0.0001fp16,参数量2.83M,1.0x,输入224*2241
PP-LCNet-1.5x12983.27fps0.0001fp16,参数量4.31M,1.5x,输入224*2241
PP-LCNet-2.0x12537.93fps0.0001fp16,参数量6.24M,2.0x,输入224*2241
PP-LCNet-2.5x9184.52fps0.0001fp16,参数量8.63M,2.5x,输入224*2241
PSEnet325.3fps1.58fp16,参数量27.39M,PSEnet,输入736*13121
Pseudo-3D632.15fps0.0016fp16,参数量62.75M,Pseudo-3D,输入160*1601
rec_0530_add3048.6fps0.16fp16,参数量1.98M,rec_0530_add,输入32*5121
regnet_quan_hist_mask6477.5fps0.08fp16,参数量8.05M,regnet_quan_hist_mask,输入224*2241
regnet_quan_hist_mask_live5883.5fps0.09fp16,参数量7.98M,regnet_quan_hist_mask_live,输入224*2241
RegNetX-800MF3213.05fps0.0003fp16,参数量6.91M,RegNetX-800MF,输入224*2241
RepVGG245.67fps0.0041fp16,参数量139.29M,RepVGG,输入256*2561
ResNet1015020.8fps0.11fp16,参数量42.52M,resnet101,输入224*2241
ResNet182727.2fps0.0004fp16,参数量11.14M,Resnet18,输入224*2241
ResNet347779.2fps0.07fp16,参数量20.79M,resnet34,输入224*2241
ResNet508443.2fps0.07fp16,参数量24.35M,V2,输入224*2241
resnet50_fcnn40.3fps6.31fp16,参数量15.52M,resnet50_fcnn,输入832*8321
resnet50_hotsoon280.9fps1.82fp16,参数量33.84M,resnet50_hotsoon,输入224*2241
ResNet50_v1p56638.99fps0.0002fp16,参数量24.35M,v1.5,输入224*2241
ResNet50_v26423.27fps0.0002fp16,参数量24.41M,v2,输入224*2241
resnet50-torchvision-v0_10_07287.74fps0.0001fp16,参数量24.35M,v0,输入224*2241
ResNeXt501238.72fps0.0008fp16,参数量23.84M,ResNeXt50,输入224*2241
RetinaFace_ResNet5051.8fps0.35fp16,参数量26.0M,RetinaFace_ResNet50,输入1024*10241
RetinNnet_ResNet50_FPN10931.1fps0.05fp16,参数量36.18M,RetinNnet_ResNet50_FPN,输入640*6401
SENet413.57fps0.0024fp16,参数量116.38M,SENet,输入224*2241
SEResNeXt988.8fps0.51fp16,参数量24.31M,SEResNeXt,输入320*3201
SE_ResNeXt1011873.24fps0.0005fp16,参数量46.82M,ResNeXt101,输入224*2241
SE_ResNeXt502468.61fps0.0004fp16,参数量26.35M,ResNeXt50,输入224*2241
SE_ResNet1810986.57fps0.0001fp16,参数量11.26M,ResNet18,输入224*2241
SE_ResNet347313.26fps0.0001fp16,参数量20.98M,ResNet34,输入224*2241
SE_ResNet503583.2fps0.0003fp16,参数量26.86M,ResNet50,输入224*2241
ShuffleNet V212542.61fps0.0001fp16,参数量2.17M,v2,输入224*2241
SlowFast1978.2fps0.03fp16,参数量32.85M,SlowFast,输入224*2241
SqueezeNet1_112884.47fps0.0001fp16,参数量1.18M,SqueezeNet1_1,输入224*2241
SSD3003306.52fps0.0003fp16,参数量6.5M,SSD300,输入300*3001
smoke_hotsoon_live_v25210.1fps0.12fp16,参数量22.47M,v2,输入256*2561
Swin Transformer2030.86fps0.0005fp16,参数量27.48M,Swin Transformer,输入224*2241
tongcheng_081911972.2fps0.04fp16,参数量10.66M,tongcheng,输入224*2241
TSM143.4fps0.08fp16,参数量22.73M,v2,输入256*2561
U-Net 2D1225.71fps0.0008fp16, 参数量7.4M, U-Net 2D,输入256*2561
3D U-Net444.3fps1.15fp16,参数量29.75M,3D U-Net,输入128*1281
VAN5195.72fps0.0002fp16,参数量3.92M,VAN,输入224*2241
VGG111969.73fps0.0006fp16,参数量126.71M,vgg11,输入224*2241
VGG131648.32fps0.0006fp16,参数量126.88M,vgg13,输入224*2241
VGG161447.69fps0.0007fp16,参数量131.95M,vgg16,输入224*2241
VGG191242.05fps0.0008fp16,参数量137.01M,vgg19,输入224*2241
video_jitter227.11fps0.0044fp16,参数量22.47M,video_jitter,输入224*2241
Vision Transformer6464.02fps0.0002fp16,参数量83.78M,Vision Transformer,输入224*2241
Xception1463.08fps0.0007fp16,参数量21.77M,Xception,输入299*2991
YOLOv3500.51fps0.002fp16,参数量7.11M,YOLOv3,输入416*4161
YOLOv5511.87fps0.0002fp16,参数量6.9M,YOLOv5,输入640*4801
YOLOv5242.86fps0.0041fp16,参数量6.91M,YOLOv5,输入768*7681
YOLOv5m284.65fps0.0035fp16,参数量20.19M,YOLOv5m,输入640*6401
YOLOv5s366.32fps0.0027fp16,参数量6.89M,YOLOv5s,输入640*6401
YOLO11s557.41fps0.0018fp16,参数量9.05M,YOLO11s,输入640*6401
YOLO11s-cls3155.16fps0.0003fp16,参数量6.4M,YOLO11s-cls,输入224*2241
YOLO11s-obb170.96fps0.0058fp16,参数量9.32M,YOLO11s-obb,输入1024*10241
YOLO11s-pose686.11fps0.0015fp16,参数量9.5M,YOLO11s-pose,输入640*6401
YOLO11s-seg279.54fps0.0036fp16,参数量9.67M,YOLO11s-seg,输入640*6401
yolox_m378.76fps0.0026fp16,参数量24.13M,yolox_m,输入640*6401
yolov8_n905.46fps0.0011fp16,参数量3.05M,yolov8_n,输入640*6401

自然语言处理

NN吞吐时延单位时延NN说明板卡数
ALBERT2903sps0.17fp16,参数量7.44M,ALBERT,输入长度1281
ALBERT679.72sps0.0015fp16,参数量84.83M,ALBERT,输入长度3841
ALBERT-zh-base3523.07sps0.0003fp16,参数量10.06M,ALBERT-zh-base,输入长度1281
ALBERT-zh-large616.49sps0.0016fp16,参数量15.78M,ALBERT-zh-large,输入长度1281
ALBERT-zh-small13950.31sps0.0001fp16,参数量4.52M,ALBERT-zh-small,输入长度1281
ALBERT-zh-tiny17372.22sps0.0001fp16,参数量3.89M,ALBERT-zh-tiny,输入长度1281
ALBERT-zh-xlarge78.41sps0.0128fp16,参数量1158.93M,xlarge,输入长度1281
BERT3520.76sps0.0003fp16,参数量81.78M,BERT,输入长度1281
BERT1302.85sps0.0008fp16,参数量81.87M,BERT,输入长度2561
BERT537.19sps0.0019fp16,参数量103.75M,BERT,输入长度3841
BERT614.06sps0.0016fp16,参数量82.06M,BERT,输入长度5121
bert_128_mixin_asr
_end2end
1715.3sps0.39fp16,参数量100.98M,bert_128_mixin_asr_end2end,输入长度1281
bert_256_fever_
review_nlp_e2e_eco_rate
666.8sps0.92fp16,参数量141.54M,bert_256_fever_review_nlp_e2e_eco_rate,输入长度2561
bert-base-chinese104.89sps0.0095fp16,参数量97.53M,bert-base-chinese,输入长度3841
BERT-Base602.78sps0.0017fp16,参数量103.75M,BERT-Base,输入长度3841
BERT-Base3581.6sps0.04fp16,参数量98.66M,BERT-base,输入长度1281
bert_base_chinese1137.2sps0.07fp16,参数量96.97M,bert_base_chinese,输入长度2561
BERT-BiLSTM233.6sps0.24fp16,参数量506.74M,BERT-BiLSTM,输入长度6001
bert_classify_45_huggingface4090.5sps0.13fp16,参数量104.09M,bert_classify_45_huggingface,输入长度451
BERT-Large139.22sps0.0072fp16,参数量318.62M,BERT-Large,输入长度3841
bge-base-zh-v1.5113.89sps0.0088fp16,参数量97.53M,bge-base-zh-v1.5,输入长度3841
bge_m339.93sps0.025fp16,参数量541.45M,bge-m3,输入长度1281
bge-reranker-base108.83sps0.0092fp16,参数量265.16M,bge-reranker-base,输入长度5121
bge_small_zh_v1p5363.28sps0.0028fp16,参数量22.84M,bge_small_zh_v1p5,输入长度3841
bvrbert20307sps0.03fp16,参数量76.96M,bvrbert,输入长度321
ChineseBERT-wwm-ext757.2sps0.06fp16,参数量96.97M,ChineseBERT-wwm-ext,输入长度5121
chinese-roberta-wwm-ext97.26sps0.0103fp16,参数量97.53M,chinese-roberta-wwm-ext,输入长度5121
chinese_roberta_wwm
_ext_large
30.97sps0.0323fp16,参数量310.44M,chinese_roberta_wwm_ext_large,输入长度5121
Chinese-XLNet-base118sps0.13fp16,参数量112.69M,Chinese-XLNet-base,输入长度5121
DeBERTa_lay61442.3sps0.36fp16,参数量138.88M,DeBERTa_lay6,输入长度641
DistilBERT1185.61sps0.008fp16,参数量63.2M,DistilBERT,输入长度3841
Erlangshen-SimCSE-110M-Chinese479.32sps0.0021fp16,参数量97.53M,Erlangshen-SimCSE-110M-Chinese,输入长度2561
model_politics_scorer_643370.4sps0.19fp16,参数量100.98M,model_politics_scorer_64,输入长度1281
model_video_porn_scorer_326865sps0.07fp16,参数量100.98M,model_video_porn_scorer_32,输入长度1281
RoBERTa492.8sps0.0002fp16,参数量118.31M,RoBERTa,输入长度3841
RoBERTa-zh118.1sps0.27fp16,参数量314.95M,RoBERTa-zh,输入长度2561
RoFormer213.17sps0.0047fp16,参数量28.47M,RoFormer,输入长度10241
rtcbert1418.8sps0.41fp16,参数量141.54M,rtcbert,输入长度1281
t5_small_decoder18.57sps0.0538fp16,参数量59.38M,t5_small_decoder,输入长度5121
t5_small_encoder228.04sps0.0044fp16,参数量33.69M,t5_small_decoder,输入长度5121
video_medical_mm588.2sps1.11fp16,参数量58.55M,video_medical_mm,输入长度1281
XLNet1242.4sps0.0008fp16,参数量88.43M,XLNet,输入长度1281
bce_embedding_base_v196.32sps0.0104fp16,参数量265.16M,v1,输入长度5121
bce_reranker_base_v1103.89sps0.0096fp16,参数量265.16M,v1,输入长度5121
bge_reranker_v2_m339.72sps0.0252fp16,参数量541.45M,v2,输入长度1281
gte-multilingual-reranker-base71.2sps0.014fp16,参数量291.79M,base,输入长度5121

光学字符识别

NN吞吐时延单位时延NN说明板卡数
AttentionOCR13804.8fps0.04fp16,参数量7.57M,AttentionOCR,输入32*1721
ch_PP-OCRv4_server_rec469.95fps0.0021fp16,参数量21.56M,ch_PP-OCRv4_server_rec,输入48*3201
CRNN13824fps0.04fp16,参数量7.94M,CRNN,输入32*1001
crnn_r34_ppocr11503.2fps0.35fp16,参数量23.37M,crnn_r34_ppocr,输入32*1001
DBNet-MobileNetV3212.5fps0.17fp16,参数量1.61M,DBNet-MobileNetV3,输入736*12801
DBNet-ResNet50_vd27.2fps2.3fp16,参数量24.15M,DBNet-ResNet50_vd,输入736*12801
ocr_decoder2585.63fps0.0004fp16,参数量17.31M,ocr_decoder,输入128*5121
ocr_encoder1283.5fps0.4fp16,参数量20.63M,ocr_encoder,输入32*5121
PaddleOCRCnRec121121.50fps0.0042fp16,参数量2.53M,PaddleOCRCnRec,输入48*3201
ch_PP-OCRv4_server
_det_modified
33.27fps0.0301fp16,参数量27.01M,ch_PP-OCRv4_server_det_modified,输入384*5121

搜索推荐

NN吞吐时延单位时延NN说明板卡数
ctr_base15016461440pps0.34fp16,参数量9.04M,ctr_base15016461
cvr_pack_dcn_mmcn147838.8pps0.23fp16,参数量19.23M,cvr_pack_dcn_mmcn1
cypher_cvr_b1582402513.2pps1.12fp16,参数量3.35M,cypher_cvr_b15824021
cypher_norbert_send
_seq_iw_afs_r1765296_0
269.5pps0.95fp16,参数量281.87M,cypher_norbert_send_seq_iw_afs_r1765296_01
cypher_realtime421.5pps1.28fp16,参数量3.24M,cypher_realtime1
deep_interest96366.68pps0.0001fp16,参数量4.05M,deep_interest1
DeepFM951944.90pps0.0344fp16,参数量0.01M,DeepFM1
DFN2336814.30pps0.0018fp16,参数量0.27M,DFN1
DF_debias6080.1pps5.84fp16,参数量4.41M,DF_debias1
DLRM127551.24pps0.0001fp16,参数量1.26M,DLRM1
experience_model_split5383.5pps0.0951fp16,参数量6.22M,experience_model_split1
gip_cypher_ltr365.7pps1.4fp16,参数量6.78M,gip_cypher_ltr1
ipnn205356.29pps0.0001fp16,参数量25.39M,ipnn1
kpnn232349.52pps0.0001fp16,参数量25.4M,kpnn1
mmoe_large210969.63pps0.0001fp16,参数量0.08M,mmoe_large1
mmoe_XL208351.22pps0.0001fp16,参数量0.08M,mmoe_XL1
NCF136651.16pps0.0001fp16,参数量0.37M,NeuralCF1
opnn150868.48pps0.0001fp16,参数量25.39M,opnn1
preclk_sail1598.4pps21.16fp16,参数量8.52M,preclk_sail1
recall_base8744.20pps0.0585fp16,参数量1.9M,recall_base1
recall_ctr_base6215.4pps5.2720fp16,参数量1.91M,recall_ctr_base1
rough339134.40pps0.2144fp16,参数量0.09M,rough1
sail_cypher_ctr_aid
_realtime_b1585413
484.6pps1.09fp16,参数量5.03M,sail_cypher_ctr_aid_realtime_b15854131
sail_model28417pps1.28fp16,参数量358.6M,sail_model1
search_ctr3282202.60pps0.0079fp16,参数量7.39M,search_ctr1
st_interactive4606.6pps0.1465fp16,参数量7.25M,st_interactive1
staytime2006867.10pps0.0163fp16,参数量1.43M,staytime1
WDL122245.41pps0.0001fp16,参数量2.27M,WDL,输入532482,2048131

语音

NN吞吐时延单位时延NN说明板卡数
conformer_speech_large204.4fps2.5fp16,参数量193.01M,large,输入1078*801
conformer_speech_medium640.22fps0.0016fp16,参数量65.3M,medium,输入1078*801
conformer_speech_small943.34fps0.0011fp16,参数量30.39M,small,输入1078*801
ECAPA-TDNN315.80fps0.2fp16,参数量19.84M,ECAPA-TDNN,输入640*801
ECAPA-TDNN_s400691.90fps0.09fp16,参数量19.84M,ECAPA-TDNN_s400,输入400*801
whisper_small_decoder10.34fps0.0967fp16,参数量184.45M,whisper_small_decoder,输入1500*7681
whisper_small_encoder16.92fps0.0591fp16,参数量84.07M,whisper_small_encoder,输入80*30001

多模态

NN吞吐时延单位时延NN说明板卡数
METER126.13fps0.0079fp16,参数量56.89M,METER,输入图片240*768,序列长度2401
videobert_t1125.5fps0.45fp16,参数量169.61M,videobert_t,输入256*2561
videobert_v598.8fps0.23fp16,参数量22.64M,videobert_v,输入256*2561

强化学习

NN吞吐时延单位时延NN说明板卡数
A3C5552.42fps0.0002fp16,参数量2.9M,A3C,输入42*421
DQN6281.69fps0.0002fp16,参数量1.61M,videobert_v,输入84*841

LLM

NN板卡数Batch Size输入长度输出长度首字延迟(s)吞吐(tokens/s)总延迟(s)
Baichuan2-7B-Base212562560.11510.5823.741
Baichuan2-7B-Chat212562560.11410.5324.318
chatglm2-6b212562560.09712.8119.979
chatglm3-6b212562560.09613.0819.567
chatglm3-6b-32k212561500.09812.7911.73
chatglm3-6b-base212562000.09712.8315.591
DeepSeek-R1-Distill-Llama-8B212562560.11410.7623.803
DeepSeek-R1-Distill-Llama-70B1612562561.0054.4257.956
DeepSeek-R1-Distill-Qwen-1.5B112562560.0520.912.25
DeepSeek-R1-Distill-Qwen-14B412562560.2999.3327.444
DeepSeek-R1-Distill-Qwen-32B812562560.4637.1335.899
DeepSeek-R1-Distill-Qwen-7B212562560.11310.424.622
ERNIE-4.5-21B-A3B-PT412562560.3068.5730.608
glm-4-9b-chat212562560.1379.1428.008
internlm2-chat-20b412562560.3657.9132.361
internlm2-chat-7b212562560.11211.0123.256
Llama2-Chinese-7b-Chat212562000.11111.3317.657
Llama-2-13b412562560.26510.624.149
Llama3-Chinese-8B-Instruct212561500.11310.7713.933
Marco-o1212562560.11410.3724.679
Meta-Llama-3.1-8B-Instruct212562560.11410.7723.777
Meta-Llama-3-8B212562560.11510.7423.835
MiniCPM-1B-sft-bf16112562560.04724.1210.612
Mixtral-8x7B-Instruct-v0.1812562560.24814.8517.24
moss-moon-003-sft412562560.38.8728.86
Phi-2112562560.07114.218.023
Qwen1.5-1.8B-Chat112562560.04522.6111.321
Qwen1.5-14B-Chat412562560.27210.0625.442
Qwen1.5-14B-Chat-w8a16412562560.25610.9223.438
Qwen1.5-14B-Chat-w8a8412562560.21813.6118.812
Qwen1.5-32B-Chat812562560.4666.9936.6
Qwen1.5-7B-Chat212562560.11710.2824.902
Qwen2-0.5B-Instruct112562560.024435.954
Qwen2-1.5B-Instruct112562560.05120.0812.747
Qwen2-57B-A14B-Instruct-w8a16812562560.6559.0928.176
Qwen2-7B-Instruct212562560.11510.3924.644
Qwen2-7B-Instruct-w8a8212562560.0891418.289
Qwen2-72B-Instruct1612562560.9484.9851.427
Qwen-7B-Chat212562560.11620.8224.588
Qwen2.5-0.5B-Instruct112562560.02344.975.693
Qwen2.5-7B-Instruct212562560.11510.225.099
Qwen2.5-14B-Instruct412562560.3019.5926.686
Qwen2.5-Coder-7B-Instruct212562560.11210.3124.83
Qwen2.5-Math-7B-Instruct212562560.11310.3724.676
QwQ-32B812562560.4647.1835.665
TeleChat-12B-v2212562560.3166.9137.027
XVERSE-13B-Chat412562560.26810.2325.024
Yi-1.5-34B-Chat812562560.5217.1935.585
Yi-34B-Chat812562560.5197.3434.859
Qwen3-8B212562560.1249.9725.68
Qwen3-14B412562560.328.131.591
Qwen3-32B812562560.4617.1435.833
Jiuzhou-7B212562560.11410.324.844