Skip to content

Support to access model gardens using endpoints: rawPredict or streamRawPredict #900

@cmoulliard

Description

@cmoulliard

Description

The Vertex AI documentation describes that we can select one of the different models: open, partner, gemini-family and access them using curl, python and probably this Java SDK (as documented here) but when we use the following code

        Client client = null;
        try {
            GoogleCredentials creds = GoogleCredentials.getApplicationDefault();
            creds.refreshIfExpired();

            client = Client.builder()
                    .credentials(creds)
                    .build();
        } catch (Exception e) {
            throw new RuntimeException(e);
        }

        GenerateContentResponse response =
                client.models.generateContent("anthropic/claude-opus-4-6", "What is your name?", null);

then we got this error

com.google.genai.errors.ClientException: 400 . claude-opus-4-6 is not supported in the generateContent API.
    at com.google.genai.errors.ApiException.throwFromResponse (ApiException.java:94)
    at com.google.genai.HttpApiResponse.getBody (HttpApiResponse.java:37)
    at com.google.genai.Models.processResponseForPrivateGenerateContent (Models.java:5480)
    at com.google.genai.Models.privateGenerateContent (Models.java:5546)
    at com.google.genai.Models.generateContent (Models.java:7076)
    at com.google.genai.Models.generateContent (Models.java:7150)
    at com.google.genai.examples.GenerateContent.main (GenerateContent.java:89)

Suggestion

This Java SDK project is a great library as it allows third-party projects to access the Google Cloud platform and Vertex AI using HTTP(s) with JSON body messages instead of using gRPC.

As the access to one of the model partners like Claude, Mistral, etc relies on the following endpoints

https://LOCATION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/LOCATION/publishers/anthropic/models/MODEL:rawPredict
https://LOCATION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/LOCATION/publishers/anthropic/models/MODEL:streamRawPredict

I suggest that this Java SDK also support them. Here is the API documentation: https://docs.cloud.google.com/vertex-ai/generative-ai/docs/reference/rest/v1/projects.locations.endpoints/rawPredict

FYI: The projects Langchain4j and Quarkus Langchains4j currently use the gRPC code of the following GAV and will be more than happy to use now this SDK with HTTP(S).

<dependency>
            <groupId>com.google.cloud</groupId>
            <artifactId>google-cloud-aiplatform</artifactId>
            <version>3.69.0</version>
        </dependency>

Metadata

Metadata

Assignees

Labels

priority: p3Desirable enhancement or fix. May not be included in next release.type: feature request‘Nice-to-have’ improvement, new feature or different behavior or design.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions