Making LLMs Better at Long Code Translation

Making LLMs Better at Long Code Translation

Improving accuracy through instrumentation and state alignment

This research addresses critical limitations in how Large Language Models translate complex, lengthy code between programming languages.

Key Innovations:

  • Uses program instrumentation to capture execution behavior
  • Aligns program states between source and target code
  • Significantly improves translation accuracy for longer, more complex code
  • Enables better preservation of functionality across languages

Engineering Impact: This approach helps software teams migrate codebases between languages more reliably, reducing errors during cross-platform development and modernization projects.

Enhancing LLMs in Long Code Translation through Instrumentation and Program State Alignment

289 | 323