Vosyn interview question

How does a transformer model work(roughly)?