Türkçe Dil Modellerinin Performans
  Karşılaştırması Performance Comparison of Turkish Language
  Models

Eren Dogan; M. Egemen Uzun; Atahan Uz; H. Emre Seyrek; Ahmed Zeer; Ezgi Sevi; H. Toprak Kesgin; M. Kaan Yuce; M. Fatih Amasyali

Türkçe Dil Modellerinin Performans Karşılaştırması Performance Comparison of Turkish Language Models

Eren Dogan; M. Egemen Uzun; Atahan Uz; H. Emre Seyrek; Ahmed Zeer; Ezgi Sevi; H. Toprak Kesgin; M. Kaan Yuce; M. Fatih Amasyali

arXiv 2024

21

amasyali2024trke

Abstract

The developments that language models have provided in fulfilling almost all kinds of tasks have attracted the attention of not only researchers but also the society and have enabled them to become products. There are commercially successful language models available. However, users may prefer open-source language models due to cost, data privacy, or regulations. Yet, despite the increasing number of these models, there is no comprehensive comparison of their performance for Turkish. This study aims to fill this gap in the literature. A comparison is made among seven selected language models based on their contextual learning and question-answering abilities. Turkish datasets for contextual learning and question-answering were prepared, and both automatic and human evaluations were conducted. The results show that for question-answering, continuing pretraining before fine-tuning with instructional datasets is more successful in adapting multilingual models to Turkish and that in-context learning performances do not much related to question-answering performances.

Keywords

cs.ai cs.cl

Access

URL:

http://arxiv.org/abs/2404.17010v1

Citation

ID: 283438

Ref Key: amasyali2024trke

Use this key to autocite in SciMatic or Thesis Manager

References

No Bibliography

Blockchain Verification

Account:

NFT Contract Address:

0x95644003c57E6F55A65596E3D9Eac6813e3566dA

Article ID:

283438

Unique Identifier:

Network:

Scimatic Chain (ID: 481)

Blockchain Readiness Checklist

Authors

Abstract

Journal Name

Year

Title

5/5

Creates 1,000,000 NFT tokens for this article

Token Features:

ERC-1155 Standard NFT
1 Million Supply per Article
Transferable via MetaMask
Permanent Blockchain Record

Scan with Saymatik Web3.0 Wallet

Gas fees required in SCI Coins

Buy SCI

Saymatik Web3.0 Wallet

Google Play

App Store

Coming soon

Reference Key: lastname+year+titlefirstword+journalfirstword

Article Type (Article, Book, Proceedings etc.)

Add a reference in a raw form. Our automatic system will correct it later.