Sber / GigaChat
Аудиозапись технического собеседованияТехническое собеседование2026-03-25
Sber / GigaChat: LLM подробный разбор, inference и distributed training
Плотное теоретическое интервью по LLM: BERT vs GPT, sentence embeddings, tokenization, positional embeddings, attention, GQA/SWA, KV cache, long context и DDP all-reduce.
Аудио и материалы
Аудио собеседования
0:00 / 1:06:54
Выводы и как готовиться
- LLM theory interviews often move quickly across architecture, training and inference systems.
- Good answers connect formulas to cost: sequence length, KV cache memory, attention complexity and distributed communication.
- поиск embeddings should be evaluated with поиск metrics, not generic classification accuracy.
