Skip to content

Conversation

@ma-hang
Copy link
Contributor

@ma-hang ma-hang commented Jan 14, 2026

9G7B 单卡 max_batch_size=32:
e73679451a5e665aace0317ed4e6546f

9G7B 4卡 max_batch_size=32:
d77ca5b3b22ccccf91360ce278067378

@ma-hang ma-hang requested review from a team, PanZezhong1725 and whjthu January 14, 2026 15:58
@ma-hang ma-hang linked an issue Jan 15, 2026 that may be closed by this pull request
def start(self):
app = self._create_app()
logger.info("Starting API Server...")
uvicorn.run(app, host="0.0.0.0", port=8000)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

把port做一个脚本参数,默认8000

@PanZezhong1725 PanZezhong1725 merged commit c73ff20 into main Jan 20, 2026
@PanZezhong1725 PanZezhong1725 deleted the issue/189 branch January 20, 2026 05:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[DEV] InfiniLM添加推理服务

3 participants