Elsevier interview question

How do transformers work? What is Multi-Head attention? What is positional encoding?