China’s economic growth and improved international status have created an urgent need for high-level international communication capabilities to support national development in the era of AI. The objective of enhancing China’s capabilities in this regard is to cultivate high-level talents who possess a “5+5” set of skills: the five mindsets of “global vision, national standpoint, political acumen, economic thinking, and cultural inclusiveness”, and the five communication abilities of live broadcasting, resource integration, current affairs collection and editing, and translation and interpreting from a foreign language into Chinese and vice versa. To help cultivate international communication talents in the era of AI and put China’s interpreting talents at the center of the international stage, we have constructed the discipline of multimodal interpreting (MI). This research is grounded in the theory of Marxist materialism, General Secretary Xi Jinping’s important thought on international communication, and the theories of interpreting communication studies (ICS) and communication interpreting studies (CIS). Beginning with the practice of interpreting, it reviews the evolution of interpreting studies and proposes a new perspective on the construction of the MI discipline, which integrates “practice-teaching-research” (PTR) with the aim of achieving improved interpreting practice, teaching, and research. This paper not only emphasizes the importance of interpreting practice and teaching but also enriches the theoretical connotations of interpreting studies, ICS, CIS, and embodied-cognitive interpreting studies, expands interpreting research ideas, and enhances international communication capabilities in the era of AI.