Skip to content

Sunflower grpo inference test#9

Merged
PatrickCmd merged 8 commits intodeployfrom
sunflower_grpo_test
Apr 18, 2026
Merged

Sunflower grpo inference test#9
PatrickCmd merged 8 commits intodeployfrom
sunflower_grpo_test

Conversation

@PatrickCmd
Copy link
Copy Markdown

@PatrickCmd PatrickCmd commented Apr 7, 2026

This pull request introduces a comprehensive development and deployment setup for the sunflower-grpo-inference-vllm project, integrating both a frontend and backend, and providing infrastructure-as-code for Google Cloud Run deployment. The changes include Dockerization, local development tools, a client utility, and a full Terraform configuration for deploying and managing the service and its dependencies in Google Cloud.

The most important changes are:

1. Infrastructure-as-Code for Google Cloud Deployment

  • Added a complete Terraform configuration to provision and manage a Cloud Run service, Artifact Registry repository, required APIs, environment variables, IAM permissions, and outputs for URLs and resources. This setup enables reproducible, configurable, and secure deployment of the app to Google Cloud. [1] [2] [3] [4] [5]

2. Dockerization and Multi-Stage Build

  • Introduced a multi-stage Dockerfile that builds the Next.js frontend, installs Python dependencies using uv, and assembles a minimal runtime image for deployment, supporting efficient builds and separation of concerns.
  • Added a .dockerignore file to ensure only necessary files are included in the Docker build context, improving build performance and security.

3. Local Development and Deployment Tooling

  • Added a Makefile for local development with targets for installing dependencies, building the frontend, serving the app with hot-reload, and cleaning up build artifacts.
  • Provided a separate Makefile in the cloudrun directory for building, pushing, and deploying the Docker image to Cloud Run, as well as for managing Terraform and viewing logs.
  • Added a .python-version file to specify Python 3.12 for development consistency.

4. Client Utility

  • Added a client.py script that allows users to interact with the deployed service via HTTP, streaming, or direct Modal SDK calls, facilitating testing and integration.

5. Cloud Run Proxy Directory Setup

  • Introduced a .gitignore for the cloudrun directory to avoid committing sensitive or transient files related to Terraform and deployment.

These changes establish a robust foundation for local development, containerized deployment, and managed cloud infrastructure for the Sunflower GRPO inference service.This pull request makes a minor update to the modal-deploy/README.md instructions, adjusting the example path for the audio file used in the client script to reflect the correct relative location.

@PatrickCmd PatrickCmd changed the base branch from modal-deploy to deploy April 7, 2026 18:44
@PatrickCmd PatrickCmd force-pushed the sunflower_grpo_test branch from 91d1200 to bce45d4 Compare April 7, 2026 18:56
…ate FastAPI app for production API integration and environment variable management
…date README with new endpoints and usage instructions
- Added package.json for project dependencies and scripts
- Created postcss.config.mjs for PostCSS configuration
- Added tailwind.config.ts for Tailwind CSS setup with custom theme
- Initialized tsconfig.json for TypeScript configuration
@PatrickCmd PatrickCmd merged commit 696d482 into deploy Apr 18, 2026
1 check failed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant