DeInfer: Efficient Parallel Inferencing for Decomposed Large Language Models Infos: arXiv, DAC, ACM Digital Library (To be published) Code is under prepration and plan to release around end of May 2026.