acronym community-consensus
MQA is most commonly pronounced "em kyoo ay" (/ˌɛm ˌkjuː ˈeɪ/). This is the widely-used reading among engineers, though edge cases exist.
Multi-Query Attention — the single-KV-head attention variant that GQA generalizes; referenced in serving docs like TensorRT-LLM. Initialism read letter-by-letter, not as a word: STRESS final letter, em-kyoo-AY, exactly parallel to GQA. The expansion 'multi-query attention' is never pronounced.
Source: arXiv 2305.13245 (GQA paper, defines/compares MQA)
Pronouncing project and product names correctly avoids the small but persistent friction of being gently corrected during standups, conference Q&As, and team calls. Hearing the word a few times locks in the right reading better than reading IPA ever will. Pronounce is a community-maintained dictionary — every entry tagged with a confidence level and (where possible) a citable source.
Install the CLI: git clone https://github.com/anzy-renlab-ai/pronounce.git && cd pronounce && ./install.sh
This whole page exists because of a community-maintained TSV. If it saved you a cringey moment, drop a star.
★ Star on GitHub