[R] Do Llamas Work in English? On the Latent Language of Multilingual Transformers | allainews.com

Hai raggiunto la pagina nella memoria cache per https://allainews.com/item/r-do-llamas-work-in-english-on-the-latent-language-of-multilingual-transformers-2024-05-18/

Di seguito è disponibile lo snapshot della pagina Web alla data 18/05/2024 (l'ultima volta che è stata visitata dal nostro crawler). Questa è la versione della pagina utilizzata per la classificazione dei risultati della ricerca. La pagina potrebbe essere stata modificata dall'ultima molta che è stata memorizzata nella cache. Per verificare le eventuali modifiche (senza evidenziazioni), go vai alla pagina corrente.

Bing non è responsabile del contenuto di questa pagina.

[R] Do Llamas Work in English? On the Latent Language of Multilingual Transformers | allainews.com

May 18, 2024, 3:17 p.m. | /u/EternalBlueFriday

Machine Learning www.reddit.com

**Paper**: [https://arxiv.org/abs/2402.10588](https://arxiv.org/abs/2402.10588)

**Code**: [https://github.com/epfl-dlab/llm-latent-language](https://github.com/epfl-dlab/llm-latent-language)

**Dataset**: [https://huggingface.co/datasets/wendlerc/llm-latent-language](https://huggingface.co/datasets/wendlerc/llm-latent-language)

**Colab links**:

**(1)** [https://colab.research.google.com/drive/1l6qN-hmCV4TbTcRZB5o6rUk\_QPHBZb7K?usp=sharing](https://colab.research.google.com/drive/1l6qN-hmCV4TbTcRZB5o6rUk_QPHBZb7K?usp=sharing)

**(2)** [https://colab.research.google.com/drive/1EhCk3\_CZ\_nSfxxpaDrjTvM-0oHfN9m2n?usp=sharing](https://colab.research.google.com/drive/1EhCk3_CZ_nSfxxpaDrjTvM-0oHfN9m2n?usp=sharing)

**Abstract**:

>We ask whether multilingual language models trained on unbalanced, English-dominated corpora use English as an internal pivot language -- a question of key importance for understanding how language models function and the origins of linguistic bias. Focusing on the Llama-2 family of transformer models, our study uses carefully constructed non-English prompts with a unique correct single-token continuation. From layer to layer, transformers gradually map an input embedding …

abstract bias colab english family function importance key language language models links llama machinelearning multilingual pivot prompts question study token transformer transformer models understanding unique

More from www.reddit.com / Machine Learning

[R] Grounding DINO 1.5 Release: the most capable open-set detection model 6 hours ago | www.reddit.com

building dataset detection foundation +12

[D] Foundational Time Series Models Overrated? 6 hours ago | www.reddit.com

chronos domain etc example +13

[R] Do Llamas Work in English? On the Latent Language of Multilingual Transformers 7 hours ago | www.reddit.com

abstract bias colab english +19

[R] Robust agents learn causal world models 7 hours ago | www.reddit.com

abstract agent agents biases +14

[D] Library for named entity recognition 7 hours ago | www.reddit.com

library machinelearning mean recognition +3

[N] ICML 2024 Workshop on making discrete operations differentiable 🤖 9 hours ago | www.reddit.com

clustering deep learning differentiable everything +12

[P] GPT-Burn: A simple & concise implementation of the GPT in pure Rust 🔥 14 hours ago | www.reddit.com

gpt implementation machinelearning rust +1

[R] 1:10 Radio Controlled Car autonomous driving 18 hours ago | www.reddit.com

advice autonomous autonomous driving cameras +13

[P] How to keep only the top 10K most common tokens (transformers library) 22 hours ago | www.reddit.com

huggingface machinelearning tokens

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net