사업 운영EP11
📜법무
Legal (contracts·family)
내용 — 측정하는 AI 능력
- · 한국 법령 인용 정확도
- · 사실관계 추론·위험 식별
- · 계약 조항·민법 구조화
채점자 editor · max_tokens 32768 · temp 0.7 · attempts 3 · reasoning_effort medium
| 모델 | | | | | | | |
|---|
| 5/5 | 91 | 85 | 86 | 91 | 91 | 89.4 |
| 5/5 | 88 | 87 | 87 | 92 | 89 | 89.2 |
| 5/5 | 88 | 84 | 86 | 88 | 90 | 87.8 |
| 5/5 | 84 | 91 | 87 | 82 | 94 | 86.0 |
| 5/5 | 87 | 83 | 83 | 87 | 85 | 85.6 |
6Gemini 3.1 Pro | 5/5 | 86 | 83 | 83 | 87 | 84 | 85.4 |
7Gemini 3.5 Flash | 5/5 | 86 | 82 | 82 | 86 | 85 | 85.0 |
| 5/5 | 85 | 82 | 82 | 85 | 87 | 85.0 |
| 5/5 | 85 | 83 | 83 | 86 | 85 | 85.0 |
| 5/5 | 83 | 83 | 82 | 83 | 85 | 83.4 |
| 5/5 | 84 | 81 | 82 | 84 | 82 | 83.0 |
| 5/5 | 80 | 80 | 80 | 80 | 88 | 81.2 |
13Gemma 4 12B | 5/5 | 78 | 85 | 82 | 78 | 85 | 80.4 |
14DeepSeek V4 Flash | 5/5 | 81 | 79 | 77 | 81 | 80 | 80.2 |
| 5/5 | 81 | 78 | 79 | 81 | 80 | 80.0 |
| 5/5 | 78 | 80 | 80 | 79 | 82 | 79.6 |
| 5/5 | 77 | 88 | 82 | 74 | 88 | 79.6 |
18Gemini 3.1 Flash Lite | 5/5 | 80 | 78 | 78 | 80 | 76 | 79.0 |
19Gemma 4 26B A4B | 5/5 | 80 | 78 | 78 | 80 | 79 | 79.0 |
20Gemma 4 31B | 5/5 | 80 | 77 | 77 | 80 | 79 | 78.8 |
| 5/5 | 71 | 77 | 74 | 72 | 80 | 73.4 |
| 5/5 | 71 | 76 | 74 | 71 | 77 | 73.0 |
| 5/5 | 60 | 80 | 60 | 74 | 100 | 73.0 |
24Nemotron 3 Ultra 550B | 5/5 | 58 | 83 | 65 | 66 | 91 | 69.6 |
25HyperCLOVAX SEED Think 32B | 5/5 | 60 | 80 | 60 | 64 | 80 | 66.4 |
| 5/5 | 46 | 74 | 60 | 60 | 92 | 62.8 |
| 5/5 | 52 | 82 | 64 | 56 | 79 | 61.8 |
28Mistral Small 4 | 5/5 | 53 | 64 | 57 | 52 | 64 | 56.0 |
29Gemma 4 E2B | 5/5 | 45 | 53 | 42 | 41 | 54 | 45.2 |
30Kanana 2 30B-A3B Thinking | 5/5 | 32 | 50 | 50 | 38 | 60 | 42.6 |
31HyperCLOVAX SEED 1.5B | 5/5 | 34 | 46 | 33 | 30 | 43 | 35.2 |
| 5/5 | 29 | 45 | 27 | 25 | 45 | 31.2 |
Claude Sonnet 4.6Anthropic
958085959592
Claude Opus 4.8Anthropic
908585959090
GPT-5.5OpenAI
908285909289
MiniMax M3Minimax
909288909591
DeepSeek V4 ProDeepSeek
888080888586
Gemini 3.1 ProGoogle
888082888285
Gemini 3.5 FlashGoogle
887880888585
Mimo V2.5 ProXiaomi
857880858584
Qwen 3.7 MaxAlibaba
858080868284
Kimi K2.6Moonshot
868082868585
GPT-5.4 MiniOpenAI
857880858283
Qwen 3.7 PlusAlibaba
8080808010083
Gemma 4 12BGoogle
808482808682
DeepSeek V4 FlashDeepSeek
857575858082
Step 3.7 FlashStepFun
859085829085
Gemini 3.1 Flash LiteGoogle
827575827579
Gemma 4 26B A4BGoogle
787272787576
Gemma 4 31BGoogle
787272767575
Qwen 3.6 35B A3BAlibaba
827880828281
Qwen 3.6 27BAlibaba
727575747874
EXAONE 4.5 33BLG AI
6080608010075
Nemotron 3 Ultra 550BNVIDIA
608270809576
HyperCLOVAX SEED Think 32BNaver
608060608065
Solar Pro 3Upstage
407060608059
Qwen 3.5 9BAlibaba
558270587063
Mistral Small 4Mistral
707072706870
Gemma 4 E2BGoogle
465343415646
Kanana 2 30B-A3B ThinkingKakao
406050406046
HyperCLOVAX SEED 1.5BNaver
324432274133
LFM2.5 8B-A1BLiquid AI
384936365240
Claude Sonnet 4.6Anthropic
908282909088
Claude Opus 4.8Anthropic
958888959092
GPT-5.5OpenAI
888282889087
MiniMax M3Minimax
729085709278
DeepSeek V4 ProDeepSeek
858080858283
Gemini 3.1 ProGoogle
908282908287
Gemini 3.5 FlashGoogle
908282908587
Mimo V2.5 ProXiaomi
908583909088
Qwen 3.7 MaxAlibaba
838080848282
Kimi K2.6Moonshot
908585928889
GPT-5.4 MiniOpenAI
858080858283
Qwen 3.7 PlusAlibaba
808080808080
Gemma 4 12BGoogle
768480768479
DeepSeek V4 FlashDeepSeek
827878828081
Step 3.7 FlashStepFun
788882758880
Gemini 3.1 Flash LiteGoogle
787575787577
Gemma 4 26B A4BGoogle
827878827880
Gemma 4 31BGoogle
827878827880
Qwen 3.6 35B A3BAlibaba
828080848282
Qwen 3.6 27BAlibaba
667470667469
EXAONE 4.5 33BLG AI
6080608010075
Nemotron 3 Ultra 550BNVIDIA
608470729073
HyperCLOVAX SEED Think 32BNaver
608060608065
Solar Pro 3Upstage
5080606010066
Qwen 3.5 9BAlibaba
628572708071
Mistral Small 4Mistral
506555506054
Gemma 4 E2BGoogle
526048486252
Kanana 2 30B-A3B ThinkingKakao
407060506052
HyperCLOVAX SEED 1.5BNaver
415139394942
LFM2.5 8B-A1BLiquid AI
425440405444
Claude Sonnet 4.6Anthropic
908585929089
Claude Opus 4.8Anthropic
888888908889
GPT-5.5OpenAI
838282848283
MiniMax M3Minimax
889088859288
DeepSeek V4 ProDeepSeek
858282858284
Gemini 3.1 ProGoogle
858282868284
Gemini 3.5 FlashGoogle
828080818081
Mimo V2.5 ProXiaomi
858283858685
Qwen 3.7 MaxAlibaba
888585888687
Kimi K2.6Moonshot
767876748076
GPT-5.4 MiniOpenAI
807880807880
Qwen 3.7 PlusAlibaba
808080808080
Gemma 4 12BGoogle
768480768479
DeepSeek V4 FlashDeepSeek
808078827880
Step 3.7 FlashStepFun
688578658573
Gemini 3.1 Flash LiteGoogle
807878807679
Gemma 4 26B A4BGoogle
807878807879
Gemma 4 31BGoogle
807878807879
Qwen 3.6 35B A3BAlibaba
506860527458
Qwen 3.6 27BAlibaba
627268627466
EXAONE 4.5 33BLG AI
6080607010072
Nemotron 3 Ultra 550BNVIDIA
458045428854
HyperCLOVAX SEED Think 32BNaver
608060808072
Solar Pro 3Upstage
5080807010072
Qwen 3.5 9BAlibaba
507862487858
Mistral Small 4Mistral
355545355842
Gemma 4 E2BGoogle
374936334738
Kanana 2 30B-A3B ThinkingKakao
406060606055
HyperCLOVAX SEED 1.5BNaver
324432304234
LFM2.5 8B-A1BLiquid AI
203818143822
Claude Sonnet 4.6Anthropic
908888909090
Claude Opus 4.8Anthropic
808888888886
GPT-5.5OpenAI
908890909290
MiniMax M3Minimax
829288809585
DeepSeek V4 ProDeepSeek
888585888687
Gemini 3.1 ProGoogle
838582848484
Gemini 3.5 FlashGoogle
868585878886
Mimo V2.5 ProXiaomi
828282828683
Qwen 3.7 MaxAlibaba
868686868886
Kimi K2.6Moonshot
788582788680
GPT-5.4 MiniOpenAI
858585858685
Qwen 3.7 PlusAlibaba
808080808080
Gemma 4 12BGoogle
788684788681
DeepSeek V4 FlashDeepSeek
757875767876
Step 3.7 FlashStepFun
708580688575
Gemini 3.1 Flash LiteGoogle
808080807880
Gemma 4 26B A4BGoogle
787878788078
Gemma 4 31BGoogle
807878808080
Qwen 3.6 35B A3BAlibaba
657872668070
Qwen 3.6 27BAlibaba
788080788079
EXAONE 4.5 33BLG AI
6080606010068
Nemotron 3 Ultra 550BNVIDIA
508460589064
HyperCLOVAX SEED Think 32BNaver
608060608065
Solar Pro 3Upstage
406040408048
Qwen 3.5 9BAlibaba
358042388048
Mistral Small 4Mistral
305035305536
Gemma 4 E2BGoogle
394937364940
Kanana 2 30B-A3B ThinkingKakao
204040206031
HyperCLOVAX SEED 1.5BNaver
254125203827
LFM2.5 8B-A1BLiquid AI
11351373516
Claude Sonnet 4.6Anthropic
888888889088
Claude Opus 4.8Anthropic
888885909089
GPT-5.5OpenAI
908890909290
MiniMax M3Minimax
889288849588
DeepSeek V4 ProDeepSeek
888686888888
Gemini 3.1 ProGoogle
868686878887
Gemini 3.5 FlashGoogle
858585868686
Mimo V2.5 ProXiaomi
858484858685
Qwen 3.7 MaxAlibaba
858585868686
Kimi K2.6Moonshot
868686878887
GPT-5.4 MiniOpenAI
848484848484
Qwen 3.7 PlusAlibaba
8080808010083
Gemma 4 12BGoogle
788682788681
DeepSeek V4 FlashDeepSeek
828280828282
Step 3.7 FlashStepFun
859085829085
Gemini 3.1 Flash LiteGoogle
808080807880
Gemma 4 26B A4BGoogle
828282828482
Gemma 4 31BGoogle
808080808280
Qwen 3.6 35B A3BAlibaba
748078748076
Qwen 3.6 27BAlibaba
768078748077
EXAONE 4.5 33BLG AI
6080608010075
Nemotron 3 Ultra 550BNVIDIA
758582809081
HyperCLOVAX SEED Think 32BNaver
608060608065
Solar Pro 3Upstage
5080607010069
Qwen 3.5 9BAlibaba
588572658569
Mistral Small 4Mistral
788080778078
Gemma 4 E2BGoogle
505546465850
Kanana 2 30B-A3B ThinkingKakao
202040206029
HyperCLOVAX SEED 1.5BNaver
404838364640
LFM2.5 8B-A1BLiquid AI
324830284834