Mscc.GenerativeAI

The base class for all models, providing common functionality for API communication.

The base URL for the Google AI API.

The base URL for the Vertex AI API.

The base URL for the Vertex AI API for global locations.

Gets the instance used for making API requests. If an is available, it is used to create the client; otherwise, a default client is created.

Creates a default with appropriate settings for the target framework.

A new instance.

Gets or sets the API version to use.

Gets or sets the request options for API calls.

Gets or sets the name of the model to use.

Returns the name of the model.

Name of the model.

Sets the API key to use for the request.

The value can only be set or modified before the first request is made.

Adds the API key to the request header if it is available.

Using an API key with REST to send to the API.

Sets the access token to use for the request.

Sets the project ID to use for the request.

The value can only be set or modified before the first request is made.

Gets or sets the region to use for the request. This is used in Vertex AI requests. The default value is "us-central1".

Gets or sets the timespan to wait before the request times out.

A hook to verify if a specific request is supported by the current model configuration. Throws a if the functionality is not supported.

The type of the request object. The request to verify.

Initializes a new instance of the class, primarily for Google AI. It configures the model using an API key and environment variables.

Optional. The to use for creating HttpClient instances. Optional. Logger instance used for logging Optional. Provides options for API requests.

Initializes a new instance of the class, primarily for Vertex AI. It configures the model using a project ID, region, and access token.

The Google Cloud project ID. The Google Cloud region. The model name to use. Optional. The access token for authentication. Optional. The IHttpClientFactory to use for creating HttpClient instances. Optional. Logger instance used for logging Options for the request.

Internal constructor for testing purposes, allows injecting a custom HttpMessageHandler.

The to use for HTTP requests. Optional. Logger instance used for logging. Optional. Provides options for API requests.

Parses the URL template and replaces placeholders with their current values. This method supports templates for both Google AI and Vertex AI endpoints.

The URL template to parse (e.g., ). The specific API method to append to the URL (e.g., ":generateContent"). The fully resolved URL with all placeholders replaced.

Serializes the request payload to a JSON string.

The type of the request object. The request object to serialize. A JSON string representing the request.

Deserializes the JSON response from an API call into an object.

The type to deserialize the response into. The from the API call. A task that resolves to an instance of type .

Configures and returns the default for deserialization.

Default options for JSON deserialization.

A modifier that adds a snake_case alias for each property. This allows deserialization to work with both camelCase and snake_case property names.

Configures and returns the default for serialization.

Default options for JSON serialization.

Reads credentials from a specified JSON file.

This is typically used for reading service account credentials from Google Cloud Platform. The path to the credentials file. A object if the file exists and is valid; otherwise, null.

Retrieves an access token from Application Default Credentials (ADC) using the gcloud command-line tool. This method is specific to Google Cloud Platform.

The access token as a string, or an empty string if it fails.

Executes an external command-line application.

The command or application to run. Optional arguments to pass to the application. The standard output from the application. Thrown if the process exits with a non-zero code.

Formats a command and its arguments for logging purposes.

The command or application that was run. The arguments passed to the application. A formatted string containing the command and arguments.

Sends a POST request to the specified API endpoint and deserializes the response.

The type of the request object. The type of the expected response object. The request object to send. The URL template for the API endpoint. The specific API method to call. Optional. Options for the request, like timeout and retry settings. Optional. Defines when the HTTP operation should complete. A cancellation token that can be used by other objects or threads to receive notice of cancellation. A task that represents the asynchronous operation. The task result contains the deserialized response. Thrown if the request is null. Thrown if the HTTP response times out.

Sends a POST request to the specified API endpoint and streams the response.

Using Server-Sent Events The type of the request object. The type of the expected response objects in the stream. The request object to send. The URL template for the API endpoint. The specific API method to call. Optional. Options for the request, like timeout and retry settings. A cancellation token that can be used by other objects or threads to receive notice of cancellation. An asynchronous stream of response objects. Thrown if the request is null.

Sends an HTTP request and returns the response. It handles adding authentication headers, user-agent, and custom headers. It also implements a retry mechanism for transient failures.

The to send. Optional. Options for the request, including timeout and retry settings. A cancellation token that can be used by other objects or threads to receive notice of cancellation. Defines when the operation should complete. The from the API.

Truncates the base64 data in the log to avoid blowing up the log file.

The JSON string to sanitize. The sanitized JSON string.

Asynchronously reads the content of an as a string.

The HTTP response. A cancellation token. A task that represents the asynchronous operation. The task result contains the response content as a string.

Creates a deep clone of an . This is necessary for retrying requests, as a request message can only be sent once.

The request to clone. A cancellation token. A task that represents the asynchronous operation. The task result contains the cloned HTTP request message.

Disposes the and its underlying resources.

Releases the unmanaged resources used by the and optionally releases the managed resources.

true to release both managed and unmanaged resources; false to release only unmanaged resources.

Asynchronously disposes the and its underlying resources.

A that represents the asynchronous dispose operation.

Initializes a new instance of the class.

Optional. The IHttpClientFactory to use for creating HttpClient instances. Optional. Logger instance used for logging

Lists operations that match the specified filter in the request. If the server doesn't support this method, it returns `UNIMPLEMENTED`.

The standard list filter. Optional. The maximum number of cached contents to return. The service may return fewer than this value. If unspecified, some default (under maximum) number of items will be returned. The maximum value is 1000; values above 1000 will be coerced to 1000. Optional. A page token, received from a previous `ListCachedContents` call. Provide this to retrieve the subsequent page. When paginating, all other parameters provided to `ListCachedContents` must match the call that provided the page token. When set to `true`, operations that are reachable are returned as normal, and those that are unreachable are returned in the [ListOperationsResponse.unreachable] field. This can only be `true` when reading across collections e.g. when `parent` is set to `"projects/example/locations/-"`. This field is not by default supported and will result in an `UNIMPLEMENTED` error if set unless explicitly documented otherwise in service or product specific documentation. Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation.

Gets the latest state of a long-running operation. Clients can use this method to poll the operation result at intervals as recommended by the API service.

Required. The name of the operation resource. Format: `batches/{id}` Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation. The long-running operation resource. Thrown when the is or empty.

Gets the latest state of a long-running operation. Clients can use this method to poll the operation result at intervals as recommended by the API service.

Starts asynchronous cancellation on a long-running operation. The server makes a best effort to cancel the operation, but success is not guaranteed. If the server doesn't support this method, it returns `google.rpc.Code.UNIMPLEMENTED`. Clients can use Operations.GetOperation or other methods to check whether the cancellation succeeded or whether the operation completed despite cancellation. On successful cancellation, the operation is not deleted; instead, it becomes an operation with an Operation.error value with a google.rpc.Status.code of `1`, corresponding to `Code.CANCELLED`.

Required. The name of the operation resource. Format: `batches/{id}` Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation. If successful, the response body is empty. Thrown when the is or empty.

Deletes a long-running operation. This method indicates that the client is no longer interested in the operation result. It does not cancel the operation. If the server doesn't support this method, it returns `google.rpc.Code.UNIMPLEMENTED`.

Required. The name of the operation resource to be deleted. Format: `batches/{id}` Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation. If successful, the response body is empty. Thrown when the is or empty.

Updates a batch of EmbedContent requests for batch processing.

The batch resources to update Required. The name of the operation resource to be deleted. Format: `batches/{id}` Optional. The list of fields to update. Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation. Thrown when the is or empty.

Updates a batch of GenerateContent requests for batch processing.

Content that has been preprocessed and can be used in subsequent request to GenerativeService. Cached content can be only used with model it was created for.

Initializes a new instance of the class.

Optional. The IHttpClientFactory to use for creating HttpClient instances. Optional. Logger instance used for logging

Creates CachedContent resource.

The cached content resource to create. Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation. The cached content resource created Thrown when the is .

Creates CachedContent resource.

The minimum input token count for context caching is 32,768, and the maximum is the same as the maximum for the given model. Required. The name of the `Model` to use for cached content Format: `models/{model}` Optional. The user-generated meaningful display name of the cached content. Maximum 128 Unicode characters. Optional. Input only. Developer set system instruction. Currently, text only. Optional. Input only. The content to cache. Optional. A chat history to initialize the session with. Optional. Input only. New TTL for this resource, input only. A duration in seconds with up to nine fractional digits, ending with 's' Optional. Timestamp in UTC of when this resource is considered expired. This is always provided on output, regardless of what was sent on input. Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation. The created cached content resource. Thrown when the is or empty.

Lists CachedContents resources.

Optional. The maximum number of cached contents to return. The service may return fewer than this value. If unspecified, some default (under maximum) number of items will be returned. The maximum value is 1000; values above 1000 will be coerced to 1000. Optional. A page token, received from a previous `ListCachedContents` call. Provide this to retrieve the subsequent page. When paginating, all other parameters provided to `ListCachedContents` must match the call that provided the page token. Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation.

Reads CachedContent resource.

Required. The resource name referring to the content cache entry. Format: `cachedContents/{id}` Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation. The cached content resource. Thrown when the is or empty.

Updates CachedContent resource (only expiration is updatable).

The cached content resource to update. Optional. Input only. New TTL for this resource, input only. A duration in seconds with up to nine fractional digits, ending with 's' Optional. The list of fields to update. Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation. The updated cached content resource. Thrown when the is . Thrown when the is or empty.

Deletes CachedContent resource.

Required. The resource name referring to the content cache entry. Format: `cachedContents/{id}` Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation. If successful, the response body is empty. Thrown when the is or empty.

The `ChatModel` class provides methods for interacting with a chat-based generative model.

Initializes a new instance of the class.

Optional. The to use for creating HttpClient instances. Optional. Logger instance used for logging

Generates a set of responses from the model given a chat history input.

Required. The request to send to the API. Options for the request. A cancellation token that can be used by other objects or threads to receive notice of cancellation. Thrown when the is .

Helper class to provide API versions.

Helper class to provide model names. Ref: https://cloud.google.com/vertex-ai/generative-ai/docs/learn/model-versioning#latest-version

Possible roles.

Initializes a new instance of the class.

Initializes a new instance of the class with a specific message that describes the current exception.

Initializes a new instance of the class with a specific message that describes the current exception and an inner exception.

Initializes a new instance of the class with the block reason message that describes the current exception.

Represents errors that occur during Generative AI API calls.

HTTP response from the API.

Initializes a new instance of the class.

Initializes a new instance of the class with a specified error message.

The message that describes the error.

Initializes a new instance of the class with a specified error message and a reference to the inner exception that is the cause of this exception.

The error message that explains the reason for the exception. The exception that is the cause of the current exception, or a null reference if no inner exception is specified.

Initializes a new instance of the class with a specified error message and a reference to the inner exception that is the cause of this exception.

The error message that explains the reason for the exception. The HTTP response. The exception that is the cause of the current exception, or a null reference if no inner exception is specified.

Represents errors that occur during Generative AI API calls.

HTTP response from the API.

Initializes a new instance of the class.

Initializes a new instance of the class with a specified error message.

The message that describes the error.

Initializes a new instance of the class with a specified error message and a reference to the inner exception that is the cause of this exception.

The error message that explains the reason for the exception. The exception that is the cause of the current exception, or a null reference if no inner exception is specified.

Initializes a new instance of the class with a specified error message and a reference to the inner exception that is the cause of this exception.

The error message that explains the reason for the exception. The HTTP response. The exception that is the cause of the current exception, or a null reference if no inner exception is specified.

Initializes a new instance of the class.

Initializes a new instance of the class with a specific message that describes the current exception.

Initializes a new instance of the class with a specific message that describes the current exception and an inner exception.

Initializes a new instance of the class.

Initializes a new instance of the class with a specific message that describes the current exception.

Initializes a new instance of the class with a specific message that describes the current exception and an inner exception.

Initializes a new instance of the class.

Initializes a new instance of the class with a specific message that describes the current exception.

Initializes a new instance of the class with a specific message that describes the current exception and an inner exception.

Initializes a new instance of the class.

Initializes a new instance of the class with a specific message that describes the current exception.

Initializes a new instance of the class with a specific message that describes the current exception and an inner exception.

Initializes a new instance of the class with the finish message that describes the current exception.

Extensions for logging invocations.

This extension uses the to generate logging code at compile time to achieve optimized code.

Logs parsing the URL to call.

Logger instance used for logging HTTP method of the request. Parsed URL.

Logs invoking an API request.

Optional. Logger instance used for logging Calling method

Logs

Optional. Logger instance used for logging

Logs

Optional. Logger instance used for logging

Logs

Optional. Logger instance used for logging

Logs

Optional. Logger instance used for logging

Logs

Optional. Logger instance used for logging

Logs when exception thrown to run an external application.

Optional. Logger instance used for logging Message of to log.

Logs when exception thrown.

Optional. Logger instance used for logging Nth attempt of request sent.

Logs when exception thrown.

Optional. Logger instance used for logging Nth attempt of request sent. Message to log.

Logs when exception thrown.

Optional. Logger instance used for logging Nth attempt of request sent. Message to log.

Defines what effect activity has.

If unspecified, the default behavior is `START_OF_ACTIVITY_INTERRUPTS`.

If true, start of activity will interrupt the model's response (also called \"barge in\"). The model's current response will be cut-off in the moment of the interruption. This is the default behavior.

The model's response will not be interrupted."

Adapter size is unspecified.

Adapter size 1.

Adapter size 2.

Adapter size 4.

Adapter size 8.

Adapter size 16.

Adapter size 32.

The aggregation result for the entire dataset and all metrics.

One AggregationResult per metric.

The dataset used for evaluation & aggregation.

The aggregation result for a single metric.

Aggregation metric.

Results for bleu metric.

Result for code execution metric.

Results for exact match metric.

Result for pairwise metric.

Result for pointwise metric.

Results for rouge metric.

Unspecified aggregation metric.

Average aggregation metric. Not supported for Pairwise metric.

Mode aggregation metric.

Standard deviation aggregation metric. Not supported for pairwise metric.

Variance aggregation metric. Not supported for pairwise metric.

Minimum aggregation metric. Not supported for pairwise metric.

Maximum aggregation metric. Not supported for pairwise metric.

Median aggregation metric. Not supported for pairwise metric.

90th percentile aggregation metric. Not supported for pairwise metric.

95th percentile aggregation metric. Not supported for pairwise metric.

99th percentile aggregation metric. Not supported for pairwise metric.

Unspecified answer style.

Succinct but abstract style.

Very brief and extractive style.

Verbose style including extra details. The response may be formatted as a sentence, paragraph, multiple paragraphs, or bullet points, etc.

The generic reusable api auth config. Deprecated. Please use AuthConfig (google/cloud/aiplatform/master/auth.proto) instead.

The generic reusable api auth config.

The API secret.

Required. The SecretManager secret version resource name storing API key. e.g. projects/{project}/secrets/{secret}/versions/{version}

The API key string. Either this or api_key_secret_version must be set.

Config for authentication with API key.

Optional. The parameter name of the API key. E.g. If the API request is "https://example.com/act?api_key=", "api_key" would be the parameter name.

Optional. The name of the SecretManager secret version resource storing the API key. Format: `projects/{project}/secrets/{secrete}/versions/{version}` - If both `api_key_secret` and `api_key_string` are specified, this field takes precedence over `api_key_string`. - If specified, the `secretmanager.versions.access` permission should be granted to Vertex AI Extension Service Agent (https://cloud.google.com/vertex-ai/docs/general/access-control#service-agents) on the specified resource.

Optional. The API key to be used in the request directly.

Optional. The location of the API key.

Required. The SecretManager secret version resource name storing API key. e.g. projects/{project}/secrets/{secret}/versions/{version}

Request for an AsyncBatchEmbedContent operation.

Required. The batch to create.

Closes the WebSocket connection gracefully. This method is thread-safe and idempotent.

Asynchronously disposes the session by closing the WebSocket connection.

Identifier for the source contributing to this attribution.

Identifier for an inline passage.

Identifier for a Chunk fetched via Semantic Retriever.

Options for audio generation.

Optional. The format of the audio response.

Can be either: - "wav": Format the response as a WAV file. - "mp3": Format the response as an MP3 file. - "flac": Format the response as a FLAC file. - "opus": Format the response as an OPUS file. - "pcm16": Format the response as a PCM16 file.

Optional. The voice to use for the audio response.

The audio transcription configuration.

Auth configuration to run the extension.

Config for API key auth.

Type of auth scheme.

Config for Google Service Account auth.

Config for HTTP Basic auth.

Config for user oauth.

Config for user OIDC auth.

No Auth.

API Key Auth.

HTTP Basic Auth.

Google Service Account Auth.

OAuth auth.

OpenID Connect (OIDC) Auth.

Config for authentication with API key.

Optional. The name of the SecretManager secret version resource storing the API key. Format: projects/{project}/secrets/{secrete}/versions/{version} - If both api_key_secret and api_key_string are specified, this field takes precedence over api_key_string. - If specified, the secretmanager.versions.access permission should be granted to Vertex AI Extension Service Agent (https://cloud.google.com/vertex-ai/docs/general/access-control#service-agents) on the specified resource.

Optional. The API key to be used in the request directly.

Optional. The location of the API key.

Optional. The parameter name of the API key. E.g. If the API request is "https://example.com/act?api_key=", "api_key" would be the parameter name.

Config for Google Service Account Authentication.

Optional. The service account that the extension execution service runs as. - If the service account is specified, the iam.serviceAccounts.getAccessToken permission should be granted to Vertex AI Extension Service Agent (https://cloud.google.com/vertex-ai/docs/general/access-control#service-agents) on the specified service account. - If not specified, the Vertex AI Extension Service Agent will be used to execute the Extension.

Config for HTTP Basic Authentication.

Required. The name of the SecretManager secret version resource storing the base64 encoded credentials. Format: projects/{project}/secrets/{secrete}/versions/{version} - If specified, the secretmanager.versions.access permission should be granted to Vertex AI Extension Service Agent (https://cloud.google.com/vertex-ai/docs/general/access-control#service-agents) on the specified resource.

Config for user oauth.

Access token for extension endpoint. Only used to propagate token from [[ExecuteExtensionRequest.runtime_auth_config]] at request time.

The service account used to generate access tokens for executing the Extension. - If the service account is specified, the iam.serviceAccounts.getAccessToken permission should be granted to Vertex AI Extension Service Agent (https://cloud.google.com/vertex-ai/docs/general/access-control#service-agents) on the provided service account.

Config for user OIDC auth.

OpenID Connect formatted ID token for extension endpoint. Only used to propagate token from [[ExecuteExtensionRequest.runtime_auth_config]] at request time.

The service account used to generate an OpenID Connect (OIDC)-compatible JWT token signed by the Google OIDC Provider (accounts.google.com) for extension endpoint (https://cloud.google.com/iam/docs/create-short-lived-credentials-direct#sa-credentials-oidc). - The audience for the token will be set to the URL in the server url defined in the OpenApi spec. - If the service account is provided, the service account should grant iam.serviceAccounts.getOpenIdToken permission to Vertex AI Extension Service Agent (https://cloud.google.com/vertex-ai/docs/general/access-control#service-agents).

A request to create an ephemeral authentication token.

Optional. Input only. Immutable. Configuration specific to BidiGenerateContent.

Optional. Input only. Immutable. An optional time after which, when using the resulting token, messages in BidiGenerateContent sessions will be rejected. (Gemini may preemptively close the session after this time.) If not set then this defaults to 30 minutes in the future. If set, this value must be less than 20 hours in the future.

Optional. Input only. Immutable. If field_mask is empty, and bidi_generate_content_setup is not present, then the effective BidiGenerateContentSetup message is taken from the Live API connection. If field_mask is empty, and bidi_generate_content_setup _is_ present, then the effective BidiGenerateContentSetup message is taken entirely from bidi_generate_content_setup in this request. The setup message from the Live API connection is ignored. If field_mask is not empty, then the corresponding fields from bidi_generate_content_setup will overwrite the fields from the setup message in the Live API connection.

Output only. Identifier. The token itself.

Optional. Input only. Immutable. The time after which new Live API sessions using the token resulting from this request will be rejected. If not set this defaults to 60 seconds in the future. If set, this value must be less than 20 hours in the future.

Optional. Input only. Immutable. The number of times the token can be used. If this value is zero then no limit is applied. Resuming a Live API session does not count as a use. If unspecified, the default is 1.

Configures automatic detection of activity.

Optional. If enabled (the default), detected voice and text input count as activity. If disabled, the client must send activity signals.

Optional. Determines how likely detected speech is ended.

Optional. The required duration of detected speech before start-of-speech is committed. The lower this value, the more sensitive the start-of-speech detection is and shorter speech can be recognized. However, this also increases the probability of false positives.

Optional. The required duration of detected non-speech (e.g. silence) before end-of-speech is committed. The larger this value, the longer speech gaps can be without interrupting the user's activity but this will increase the model's latency.

Optional. Determines how likely speech is to be detected.

The default is END_SENSITIVITY_HIGH.

Automatic detection ends speech more often.

Automatic detection ends speech less often.

The default is START_SENSITIVITY_HIGH.

Automatic detection will detect the start of speech more often.

Automatic detection will detect the start of speech less often.

The configuration for automatic function calling.

Whether to disable automatic function calling. If not set or set to False, will enable automatic function calling. If set to True, will disable automatic function calling.

If automatic function calling is enabled, whether to ignore call history to the response. If not set, SDK will set ignore_call_history to false, and will append the call history to GenerateContentResponse.automatic_function_calling_history.

If automatic function calling is enabled, maximum number of remote calls for automatic function calling. This number should be a positive integer. If not set, SDK will set maximum number of remote calls to 10.

The configs for autorater. This is applicable to both EvaluateInstances and EvaluateDataset.

Optional. The fully qualified name of the publisher model or tuned autorater endpoint to use. Publisher model format: projects/{project}/locations/{location}/publishers/*/models/* Tuned model endpoint format: projects/{project}/locations/{location}/endpoints/{endpoint}

Optional. Default is true. Whether to flip the candidate and baseline responses. This is only applicable to the pairwise metric. If enabled, also provide PairwiseMetricSpec.candidate_response_field_name and PairwiseMetricSpec.baseline_response_field_name. When rendering PairwiseMetricSpec.metric_prompt_template, the candidate and baseline fields will be flipped for half of the samples to reduce bias.

Optional. Configuration options for model generation and outputs.

Optional. Number of samples for each instance in the dataset. If not specified, the default is 4. Minimum value is 1, maximum value is 32.

The configuration for automated routing. When automated routing is specified, the routing will be determined by the pretrained routing model and customer provided model routing preference. This data type is not supported in Gemini API.

The model routing preference.

Abstract base class for many config types.

Used to override HTTP request options.

Whether to include a reason for filtered-out images in the response.

Abstract base type with logging instance.

Default constructor.

Base constructor to set the instance.

Optional. Logger instance used for logging

Request to batch create s.

Required. The request messages specifying the s to create. A maximum of 100 s can be created in a batch.

Response from containing a list of created s.

s created.

Request to batch delete s.

Required. The request messages specifying the s to delete.

Batch request to get embeddings from the model for a list of prompts.

Required. Embed requests for the batch. The model in each of these requests must match the model specified BatchEmbedContentsRequest.model.

The response to a BatchEmbedContentsRequest.

Output only. The embeddings for each request, in the same order as provided in the batch request.

Batch request to get a text embedding from the model.

Optional. Embed requests for the batch. Only one of texts or requests can be set.

Optional. The free-form input texts that the model will turn into an embedding. The current limit is 100 texts, over which an error will be thrown.

The response to a EmbedTextRequest.

Output only. The embeddings generated from the input text.

Request for a BatchGenerateContent operation.

Required. The batch to create.

Stats about the batch.

Output only. The number of requests that failed to be processed.

Output only. The number of requests that are still pending processing.

Output only. The number of requests in the batch.

Output only. The number of requests that were successfully processed.

Request to batch update s.

Required. The request messages specifying the s to update. A maximum of 100 s can be updated in a batch.

Response from containing a list of updated s.

s updated.

Message to be sent in the first (and only in the first) BidiGenerateContentClientMessage. Contains configuration that will apply for the duration of the streaming RPC. Clients should wait for a BidiGenerateContentSetupComplete message before sending any additional messages.

Optional. Configures a context window compression mechanism. If included, the server will automatically reduce the size of the context when it exceeds the configured length.

Optional. Generation config. The following fields are not supported: - response_logprobs - response_mime_type - logprobs - response_schema - response_json_schema - stop_sequence - skip_response_cache - routing_config - audio_timestamp

Optional. If set, enables transcription of voice input. The transcription aligns with the input audio language, if configured.

Required. The model's resource name. This serves as an ID for the Model to use. Format: models/{model}

Optional. If set, enables transcription of the model's audio output. The transcription aligns with the language code specified for the output audio, if configured.

Optional. Configures the proactivity of the model. This allows the model to respond proactively to the input and to ignore irrelevant input.

Optional. Configures the handling of realtime input.

Optional. Configures session resumption mechanism. If included, the server will send SessionResumptionUpdate messages.

Optional. The user provided system instructions for the model. Note: Only text should be used in parts and content in each part will be in a separate paragraph.

Optional. A list of Tools the model may use to generate the next response. A Tool is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model.

The BigQuery location for the input content.

Required. BigQuery URI to a table, up to 2000 characters long. Accepted forms: * BigQuery path. For example: bq://projectId.bqDatasetId.bqTableId.

Bleu metric value for an instance.

Output only. Bleu score.

Spec for bleu score metric - calculates the precision of n-grams in the prediction as compared to reference - returns a score ranging between 0 to 1.

Optional. Whether to use_effective_order to compute bleu score.

A content blob. A Blob contains data of a specific media type. It is used to represent images, audio, and video.

Raw media bytes. Text should not be sent as raw bytes, use the 'text' field.

Optional. The display name of the blob. Used to provide a label or filename to distinguish blobs. This field is only returned in PromptMessage for prompt management. It is used in the Gemini calls only when server-side tools (code_execution, google_search, and url_context) are enabled.

Raw bytes for media formats.

The IANA standard MIME type of the source data. Examples: - image/png - image/jpeg If an unsupported MIME type is provided, an error will be returned. For a complete list of supported types, see [Supported file formats](https://ai.google.dev/gemini-api/docs/prompting_with_media#supported_file_formats).

Information to read/write to blobstore2.

The blob id, e.g., /blobstore/prod/playground/scotty

The blob read token. Needed to read blobs that have not been replicated. Might not be available until the final call.

The blob generation id.

Metadata passed from Blobstore -> Scotty for a new GCS upload. This is a signed, serialized blobstore2.BlobMetadataContainer proto which must never be consumed outside of Bigstore, and is not applicable to non-GCS media uploads.

Read handle passed from Bigstore -> Scotty for a GCS download. This is a signed, serialized blobstore2.ReadHandle proto which must never be set outside of Bigstore, and is not applicable to non-GCS media downloads.

Defaults to unspecified.

Blocks Low and above confidence URL that is risky.

Blocks Medium and above confidence URL that is risky.

Blocks High and above confidence URL that is risky.

Blocks Higher and above confidence URL that is risky.

Blocks Very high and above confidence URL that is risky.

Blocks Extremely high confidence URL that is risky.

Default value. This value is unused.

The blocked reason is unspecified.

Prompt was blocked due to safety reasons. Inspect `safety_ratings` to understand which safety category blocked it.

Prompt was blocked due to unknown reasons.

Prompt was blocked due to the terms which are included from the terminology blocklist.

Prompt was blocked due to prohibited content.

The prompt was blocked by Model Armor.

Candidates blocked due to unsafe image generation content.

The prompt was blocked as a jailbreak attempt.

Config for blur baseline. When enabled, a linear path from the maximally blurred image to the input image is created. Using a blurred baseline instead of zero (black image) is motivated by the BlurIG approach explained here: https://arxiv.org/abs/2004.03383

The standard deviation of the blur kernel for the blurred baseline. The same blurring parameter is used for both the height and the width dimension. If not set, the method defaults to the zero (i.e. black for images) baseline.

A resource used in LLM queries for users to explicitly specify what to cache and how to cache.

Content that has been preprocessed and can be used in subsequent request to GenerativeService. Cached content can be only used with model it was created for.

Input only. Immutable. Customer-managed encryption key spec for a CachedContent. If set, this CachedContent and all its sub-resources will be secured by this key.

Required. Immutable. The name of the `Model` to use for cached content Format: `models/{model}`

Optional. Identifier. The resource name referring to the cached content. Format: `cachedContents/{id}`

Specifies when this resource will expire.

Input only. New TTL for this resource, input only.

Optional. Input only. Immutable. The content to cache.

Output only. Creation time of the cache entry.

Optional. Immutable. The user-generated meaningful display name of the cached content. Maximum 128 Unicode characters.

Timestamp in UTC of when this resource is considered expired. This is *always* provided on output, regardless of what was sent on input.

Optional. Input only. Immutable. Developer set system instruction. Currently text only.

Optional. Input only. Immutable. Tool config. This config is shared for all tools.

Optional. Input only. Immutable. A list of Tools the model may use to generate the next response

Output only. When the cache entry was last updated in UTC time.

Output only. Metadata on the usage of the cached content.

Metadata on the usage of the cached content.

Duration of audio in seconds.

Number of images.

Number of text characters.

Duration of video in seconds.

Total number of tokens that the cached content consumes.

A response candidate generated from the model.

Output only. Generated content returned from the model.

Output only. Metadata related to url context retrieval tool.

Output only. Average log probability score of the candidate.

Output only. Citation information for model-generated candidate. This field may be populated with recitation information for any text included in the content. These are passages that are "recited" from copyrighted material in the foundational LLM's training data.

Optional. Output only. Details the reason why the model stopped generating tokens. This is populated only when finish_reason is set.

Optional. Output only. The reason why the model stopped generating tokens. If empty, the model has not stopped generating tokens.

Output only. Attribution information for sources that contributed to a grounded answer. This field is populated for GenerateAnswer calls.

Output only. Grounding metadata for the candidate. This field is populated for GenerateContent calls.

Output only. Index of the candidate in the list of response candidates.

Output only. Log-likelihood scores for the response tokens and top tokens

List of ratings for the safety of a response candidate. There is at most one rating per category.

Output only. Token count for this candidate.

Output only. Metadata related to url context retrieval tool.

Request for chat completions.

Required. The name of the `Model` to use for generating the completion. The model name will prefixed by \"models/\" if no slash appears in it.

Required. The chat history to use for generating the completion. Supports single and multi-turn queries. Note: This is a polymorphic field, it is deserialized to a InternalChatMessage.

Optional. The maximum number of tokens to include in a response candidate. Must be a positive integer.

Optional. The maximum number of tokens to include in a response candidate. Must be a positive integer. This field is deprecated by the SDK.

Optional. Amount of candidate completions to generate. Must be a positive integer. Defaults to 1 if not set.

Optional. Defines the format of the response. If not set, the response will be formatted as text.

Optional. The set of character sequences that will stop output generation. Note: This is a polymorphic field. It is meant to contain a string or repeated strings.

Optional. Whether to stream the response or return a single response. If true, the \"object\" field in the response will be \"chat.completion.chunk\". Otherwise it will be \"chat.completion\".

Optional. Options for streaming requests.

Optional. Controls the randomness of the output.

Optional. The maximum cumulative probability of tokens to consider when sampling.

Optional. Controls whether the model should use a tool or not, and which tool to use. Can be either: - The string \"none\", to disable tools. - The string \"auto\", to let the model decide. - The string \"required\", to force the model to use a tool. - A function name descriptor object, specifying the tool to use. The last option follows the following schema: { \"type\": \"function\", \"function\": {\"name\" : \"the_function_name\"} }

Optional. The set of tools the model can generate calls for. Each tool declares its signature.

Optional. Options for audio generation.

Optional. Modalities for the request.

Optional. Whether to call tools in parallel.

Included here for compatibility with the SDK, but only false is supported.

Optional. Penalizes new tokens based on previous appearances. Valid ranges are [-2, 2]. Default is 0.

Optional. The user name used for tracking the request. Not used, only for compatibility with the SDK.

A function that the model can generate calls for.

Required. The name of the function.

Optional. A description of the function.

Optional. Whether the schema validation is strict. If true, the model will fail if the schema is not valid. NOTE: This parameter is currently ignored.

Optional. The parameters of the function.

Contains an ongoing conversation with the model.

This ChatSession object collects the messages sent and received, in its ChatSession.History attribute.

The chat history.

Returns the last received ContentResponse

Constructor to start a chat session with history.

The model to use in the chat. A chat history to initialize the session with. Optional. Configuration options for model generation and outputs. Optional. A list of unique SafetySetting instances for blocking unsafe content. Optional. A list of Tools the model may use to generate the next response. Optional.

Sends the conversation history with the added message and returns the model's response. Appends the request and response to the conversation history.

The content request. Optional. Overrides for the model's generation config. Optional. Overrides for the model's safety settings. Optional. Overrides for the list of tools the model may use to generate the next response. Optional. Overrides for the configuration of tools. Optional. Overrides for the request options. A cancellation token that can be used by other objects or threads to receive notice of cancellation. The model's response. Thrown when is . Thrown when the model's response is blocked by a reason. Thrown when the model's response is stopped by the model's safety settings. Thrown when the candidate count is larger than 1.

Sends the conversation history with the added message and returns the model's response. Appends the request and response to the conversation history.

The message or content sent. Optional. Overrides for the model's generation config. Optional. Overrides for the model's safety settings. Optional. Overrides for the list of tools the model may use to generate the next response. Optional. Overrides for the configuration of tools. Optional. Overrides for the request options. A cancellation token that can be used by other objects or threads to receive notice of cancellation. The model's response. Thrown when is .

Sends the conversation history with the added message and returns the model's response. Appends the request and response to the conversation history.

The list of content parts sent. Optional. Overrides for the model's generation config. Optional. Overrides for the model's safety settings. Optional. Overrides for the list of tools the model may use to generate the next response. Optional. Overrides for the configuration of tools. Optional. Overrides for the request options. A cancellation token that can be used by other objects or threads to receive notice of cancellation. The model's response. Thrown when the candidate count is larger than 1.

Sends the conversation history with the added message and returns the model's response.

Appends the request and response to the conversation history. The content request. Optional. Overrides for the model's generation config. Optional. Overrides for the model's safety settings. Optional. Overrides for the list of tools the model may use to generate the next response. Optional. Overrides for the configuration of tools. Optional. Overrides for the request options. A cancellation token that can be used by other objects or threads to receive notice of cancellation. The model's response. Thrown when is Thrown when the is blocked by a reason. Thrown when the candidate count is larger than 1.

Sends the conversation history with the added message and returns the model's response. Appends the request and response to the conversation history.

The message sent. Optional. Overrides for the model's generation config. Optional. Overrides for the model's safety settings. Optional. Overrides for the list of tools the model may use to generate the next response. Optional. Overrides for the configuration of tools. Optional. Overrides for the request options. A cancellation token that can be used by other objects or threads to receive notice of cancellation. The model's response. Thrown when is .

Sends the conversation history with the added message and returns the model's response. Appends the request and response to the conversation history.

Removes the last request/response pair from the chat history.

Tuple with the last request/response pair.

A tool that the model can generate calls for.

Required. The name of the tool.

Required. Required, must be \"function\".

Describes the machine learning model version checkpoint.

The ID of the checkpoint.

The epoch of the checkpoint.

The step of the checkpoint.

Container for bytes-encoded data such as video frame, audio sample, or a complete binary/text data.

A is a subpart of a that is treated as an independent unit for the purposes of vector representation and storage.

Optional. Metadata that is associated with the data in the payload.

Required. Mime type of the chunk data. See https://www.iana.org/assignments/media-types/media-types.xhtml for the full list.

Output only. The Timestamp of when the was created.

Optional. User provided custom metadata stored as key-value pairs. The maximum number of per chunk is 20.

Required. The content for the , such as the text string. The maximum number of tokens per chunk is 2043.

Immutable. Identifier. The resource name. The ID (name excluding the "corpora/*/documents/*/chunks/" prefix) can contain up to 40 characters that are lowercase alphanumeric or dashes (-). The ID cannot start or end with a dash. If the name is empty on create, a random 12-character unique ID will be generated. Example:

Output only. Current state of the .

Output only. The Timestamp of when the was last updated.

Extracted data that represents the content.

The content as a string. The maximum number of tokens per chunk is 2043.

Parameters for telling the service how to chunk the file. inspired by google3/cloud/ai/platform/extension/lib/retrieval/config/chunker_config.proto

Number of tokens each chunk should have.

Number of tokens overlap between chunks.

White space chunking configuration.

A citation for a piece of generatedcontent.

Output only. The end index of the citation in the content.

Output only. The license of the source of the citation.

Output only. The publication date of the source of the citation.

Output only. The start index of the citation in the content.

Output only. The title of the source of the citation.

Output only. The URI of the source of the citation.

A collection of citations that apply to a piece of generated content.

A collection of source attributions for a piece of content.

Output only. A list of citations for the content.

Citations to sources for a specific response.

A citation to a source for a portion of a specific response.

Output only. The title of the source of the citation.

Output only. The publication date of the source of the citation.

Optional. End of the attributed segment, exclusive.

Optional. License for the GitHub project that is attributed as a source for segment. License info is required for code citations.

Optional. Start of segment of the response that is attributed to this source. Index indicates the start of the segment, measured in bytes.

Optional. URI that is attributed as a source for a portion of the text.

Tool that executes code generated by the model, and automatically returns the result to the model. See also and which are only generated when using this tool.

Tool that executes code generated by the model, and automatically returns the result to the model. See also ExecutableCode and CodeExecutionResult which are only generated when using this tool.

Result of executing the [ExecutableCode]. Only generated when using the [CodeExecution] tool, and always follows a part containing the [ExecutableCode].

Result of executing the . Only generated when using the , and always follows a containing the .

Result of executing the ExecutableCode. Generated only when the CodeExecution tool is used.

Required. Outcome of the code execution.

Optional. Contains stdout when code execution is successful, stderr or other description otherwise.

A sequence of media data references representing composite data. Introduced to support Bigstore composite objects. For details, visit http://go/bigstore-composites.

Media data, set if reference_type is INLINE

Path to the data, set if reference_type is PATH

Describes what the field reference contains.

Scotty-provided MD5 hash for an upload.

Scotty-provided SHA1 hash for an upload.

Scotty-provided SHA256 hash for an upload.

For Scotty Uploads: Scotty-provided hashes for uploads For Scotty Downloads: (WARNING: DO NOT USE WITHOUT PERMISSION FROM THE SCOTTY TEAM.) A Hash provided by the agent to be used to verify the data being downloaded. Currently only supported for inline payloads. Further, only crc32c_hash is currently supported.

Blobstore v1 reference, set if reference_type is BLOBSTORE_REF This should be the byte representation of a blobstore.BlobRef. Since Blobstore is deprecating v1, use blobstore2_info instead. For now, any v2 blob will also be represented in this field as v1 BlobRef.

Size of the data, in bytes

Reference to a TI Blob, set if reference_type is BIGSTORE_REF.

A binary data reference for a media download. Serves as a technology-agnostic binary reference in some Google infrastructure. This value is a serialized storage_cosmo.BinaryReference proto. Storing it as bytes is a hack to get around the fact that the cosmo proto (as well as others it includes) doesn't support JavaScript. This prevents us from including the actual type of this field.

Blobstore v2 info, set if reference_type is BLOBSTORE_REF and it refers to a v2 blob.

Specification for a computation based metric.

Optional. A map of parameters for the metric, e.g. {"rouge_type": "rougeL"}.

Required. The type of the computation based metric.

Unspecified computation based metric type.

Exact match metric.

BLEU metric.

ROUGE metric.

Computer Use tool type.

Required. The environment being operated.

Optional. By default, predefined functions are included in the final model call. Some of them can be explicitly excluded from being automatically included. This can serve two purposes: 1. Using a more restricted / different action space. 2. Improving the definitions / instructions of predefined functions.

Defaults to browser.

Operates in a web browser.

Optional parameters for computing tokens.

Request message for ComputeTokens RPC call.

Optional. Input content.

Optional. The instances that are the input to token computing API call. Schema is identical to the prediction schema of the text model, even for the non-text models, like chat models, or Codey models.

Optional. The name of the publisher model requested to serve the prediction. Format: projects/{project}/locations/{location}/publishers/*/models/*

Optional parameters for the request.

Response message for ComputeTokens RPC call.

Lists of tokens info from the input. A ComputeTokensRequest could have multiple instances with a prompt in each instance. We also need to return lists of tokens info for the request with multiple instances.

Filter condition applicable to a single key.

Required. Operator applied to the given key-value pair to trigger the condition.

The numeric value to filter the metadata on.

The string value to filter the metadata on.

The base structured datatype containing multipart content of a message. Ref: https://ai.google.dev/api/rest/v1beta/Content

The base structured datatype containing multi-part content of a message. A Content includes a role field designating the producer of the Content and a parts field containing multi-part data that contains the content of the message turn.

Ordered Parts that constitute a single message. Parts may have different MIME types.

Optional. The producer of the content. Must be either 'user' or 'model'. If not set, the service will default to 'user'.

Ordered Parts that constitute a single message. Parts may have different MIME types.

The ETag of the item.

Initializes a new instance of the class.

String to process. Provide the of the text.

Initializes a new instance of the class.

The part to add. Provide the of the text. Thrown when is null.

Initializes a new instance of the class.

The parts to add. Provide the of the text. Thrown when is null.

Initializes a new instance of the class.

String to process. Role of the content. Must be either 'user' or 'model'. Thrown when or is empty or null.

A list of floats representing an embedding.

The embedding values.

This field stores the soft tokens tensor frame shape (e.g. [1, 1, 256, 2048]).

Content filtering metadata associated with processing a single request. ContentFilter contains a reason and an optional supporting string. The reason may be unspecified.

The reason content was blocked during request processing.

A string that describes the filtering behavior in more detail.

A single example of a conversation with the model.

Required. The content of the conversation with the model that resulted in the expected output.

Required. The expected output for the given contents. To represent multi-step reasoning, this is a repeated field that contains the iterative steps of the expected output.

A single step of the expected output.

Required. A single step's content.

Detailed Content-Type information from Scotty. The Content-Type of the media will typically be filled in by the header or Scotty's best_guess, but this extended information provides the backend with more information so that it can make a better decision if needed. This is only used on media upload requests from Scotty.

The content type of the file derived by looking at specific bytes (i.e. \"magic bytes\") of the actual file.

The content type of the file derived from the file extension of the URL path. The URL path is assumed to represent a file name (which is typically only true for agents that are providing a REST API).

The content type of the file as specified in the request headers, multipart headers, or RUPIO start request.

The content type of the file derived from the file extension of the original file name used by the client.

Scotty's best guess of what the content type of the file is.

Enables context window compression — a mechanism for managing the model's context window so that it does not exceed a given length.

A sliding-window mechanism.

The number of tokens (before running a turn) required to trigger a context window compression. This can be used to balance quality against latency as shorter context windows may result in faster model responses. However, any compression operation will cause a temporary latency increase, so they should not be triggered frequently. If not set, the default is 80% of the model's context window limit. This leaves 20% for the next user request/model response.

Configuration for a Control reference image.

The type of control reference image to use.

When set to True, the control image will be computed by the model based on the control type. When set to False, the control image must be provided by the user.

Defaults to False.

Default control type.

for canny edge

for face mesh (person customization)

for scribble

Details of ModelService.CopyModel operation.

The common part of the operation metadata.

Request message for ModelService.CopyModel.

Customer-managed encryption key options. If this is set, then the Model copy will be encrypted with the provided encryption key.

Optional. Copy source_model into a new Model with this ID. The ID will become the final component of the model resource name. This value may be up to 63 characters, and valid characters are [a-z0-9_-]. The first character cannot be a number or hyphen.

Optional. Specify this field to copy source_model into this existing Model as a new version. Format: projects/{project}/locations/{location}/models/{model}

Required. The resource name of the Model to copy. That Model must be in the same Project. Format: projects/{project}/locations/{location}/models/{model}

The copied model.

A is a collection of s. A project can create up to 10 corpora.

A Corpus is a collection of Documents. A project can create up to 10 corpora.

Output only. The Timestamp of when the Corpus was created.

Optional. The human-readable display name for the Corpus. The display name must be no more than 512 characters in length, including spaces. Example: "Docs on Semantic Retriever"

Output only. Immutable. Identifier. The Corpus resource name. The ID (name excluding the "corpora/" prefix) can contain up to 40 characters that are lowercase alphanumeric or dashes (-). The ID cannot start or end with a dash. If the name is empty on create, a unique name will be derived from display_name along with a 12 character random suffix. Example: corpora/my-awesome-corpora-123a456b789c

Output only. The Timestamp of when the Corpus was last updated.

Request for querying a `Corpus`.

Required. Query string to perform semantic search.

Optional. Filter for `Chunk` and `Document` metadata. Each `MetadataFilter` object should correspond to a unique key. Multiple `MetadataFilter` objects are joined by logical \"AND\"s. Example query at document level: (year >= 2020 OR year < 2010) AND (genre = drama OR genre = action) `MetadataFilter` object list: metadata_filters = [ {key = \"document.custom_metadata.year\" conditions = [{int_value = 2020, operation = GREATER_EQUAL}, {int_value = 2010, operation = LESS}]}, {key = \"document.custom_metadata.year\" conditions = [{int_value = 2020, operation = GREATER_EQUAL}, {int_value = 2010, operation = LESS}]}, {key = \"document.custom_metadata.genre\" conditions = [{string_value = \"drama\", operation = EQUAL}, {string_value = \"action\", operation = EQUAL}]}] Example query at chunk level for a numeric range of values: (year > 2015 AND year <= 2020) `MetadataFilter` object list: metadata_filters = [ {key = \"chunk.custom_metadata.year\" conditions = [{int_value = 2015, operation = GREATER}]}, {key = \"chunk.custom_metadata.year\" conditions = [{int_value = 2020, operation = LESS_EQUAL}]}] Note: \"AND\"s for the same key are only supported for numeric values. String values only support \"OR\"s for the same key.

Optional. The maximum number of `Chunk`s to return. The service may return fewer `Chunk`s. If unspecified, at most 10 `Chunk`s will be returned. The maximum specified result count is 100.

Response from `QueryCorpus` containing a list of relevant chunks.

The relevant chunks.

RagCorpus status.

Output only. Only when the state field is ERROR.

Output only. RagCorpus life state.

This state is not supposed to happen.

RagCorpus resource entry is initialized, but hasn't done validation.

RagCorpus is provisioned successfully and is ready to serve.

RagCorpus is in a problematic situation. See `error_message` field for details.

Counts the number of tokens in the prompt sent to a model. Models may tokenize text differently, so each model may return a different token_count.

Required. The prompt, whose token count is to be returned.

A response from CountMessageTokens. It returns the model's token_count for the prompt.

The number of tokens that the model tokenizes the prompt into. Always non-negative.

Counts the number of tokens in the prompt sent to a model. Models may tokenize text differently, so each model may return a different token_count.

Required. The free-form input text given to the model as a prompt.

A response from CountTextTokens. It returns the model's token_count for the prompt.

The number of tokens that the model tokenizes the prompt into. Always non-negative.

Config for the count_tokens method.

Configuration that the model uses to generate the response. Not supported by the Gemini Developer API.

Instructions for the model to steer it toward better performance.

Code that enables the system to interact with external systems to perform an action outside of the knowledge and scope of the model.

Request message for PredictionService.CountTokens.

Request for counting tokens.

Counts the number of tokens in the prompt sent to a model. Models may tokenize text differently, so each model may return a different token_count.

Optional. Generation config that the model will use to generate the response.

Optional. The instances that are the input to token counting call. Schema is identical to the prediction schema of the underlying model.

Optional. The name of the publisher model requested to serve the prediction. Format: projects/{project}/locations/{location}/publishers/*/models/*

Optional. The user provided system instructions for the model. Note: only text should be used in parts and content in each part will be in a separate paragraph.

Configuration for counting tokens.

Optional. The input given to the model as a prompt. This field is ignored when generate_content_request is set.

Optional. The overall input given to the Model. This includes the prompt as well as other model steering information like [system instructions](https://ai.google.dev/gemini-api/docs/system-instructions), and/or function declarations for [function calling](https://ai.google.dev/gemini-api/docs/function-calling). Models/Contents and generate_content_requests are mutually exclusive. You can either send Model + Contents or a generate_content_request, but never both.

Response message for PredictionService.CountTokens.

A response from `CountTokens`. It returns the model's `token_count` for the `prompt`.

A response from CountTokens. It returns the model's token_count for the prompt.

The total number of billable characters counted across all instances from the request.

The number of tokens that the tokenizes the into. Always non-negative.

The total number of tokens counted across all instances from the request.

Number of tokens in the cached part of the prompt (the cached content).

Output only. List of modalities that were processed in the cached content.

Output only. List of modalities that were processed in the request input.

Config for optional parameters.

GCS or BigQuery URI prefix for the output predictions. Example: “gs://path/to/output/data” or “bq://projectId.bqDatasetId.bqTableId”.

The user-defined name of this BatchJob.

Request to create a .

Required. The to create.

Required. The name of the where this will be created. Example:

Request for CreateFile.

Optional. Metadata for the file to create.

Response for CreateFile.

Metadata for the created file.

Request to create a tuned model.

The name to display for this model in user interfaces. The display name must be up to 40 characters including spaces.

The name of the Model to tune. Example: models/text-bison-001

Tuning tasks that create tuned models.

Constructor.

Creates a request for a tuned model.

Model to use. Name of the tuned model. Dataset for training or validation. Immutable. Hyperparameters controlling the tuning process. If not provided, default values will be used.

Response of a newly created tuned model.

A fine-tuned model created using ModelService.CreateTunedModel.

Optional. Name of the foundation model to tune.

Supported values: gemini-1.5-pro-002, gemini-1.5-flash-002, and gemini-1.0-pro-002.

Optional. A display name for the tuned model. If not set, a random name is generated.

Creates an instance of .

Creates a request for tuning a model.

Model to use. URI of dataset for training. URI of dataset for validation. Immutable. Hyperparameters controlling the tuning process. If not provided, default values will be used. Thrown when is empty or null. Thrown when is empty or null.

Represents the credentials used to authenticate with the API. It de/serializes the content of the client_secret.json file for OAuth 2.0 using either Desktop or Web approach, and supports Service Accounts on Google Cloud Platform.

Client secrets for web applications.

Client secrets for desktop applications.

Account used in Google CLoud Platform.

Refresh token for the API to retrieve a new access token.

Type of account in Google Cloud Platform.

Uri of domain

Project ID in Google Cloud Platform.

Project ID (quota) in Google Cloud Platform.

Represents the content of a client_secret.json file used in Google Cloud Platform to authenticate a user or service account.

Client ID

Client secret

List of Callback URLs in case of a web application.

Authentication endpoint.

URL to an X509 certificate provider.

Uri of token.

Result for custom code execution metric.

Output only. Custom code execution score.

Specificies a metric that is populated by evaluating user-defined Python code.

Required. Python function. Expected user to define the following function, e.g.: def evaluate(instance: dict[str, Any]) -> float: Please include this function signature in the code snippet. Instance is the evaluation instance, any fields populated in the instance are available to the function as instance[field_name]. Example: Example input: `` instance= EvaluationInstance( response=EvaluationInstance.InstanceData(text="The answer is 4."), reference=EvaluationInstance.InstanceData(text="4") ) ` Example converted input: ` { 'response': {'text': 'The answer is 4.'}, 'reference': {'text': '4'} } ` Example python function: ` def evaluate(instance: dict[str, Any]) -> float: if instance'response' == instance'reference': return 1.0 return 0.0 `` CustomCodeExecutionSpec is also supported in Batch Evaluation (EvalDataset RPC) and Tuning Evaluation. Each line in the input jsonl file will be converted to dict[str, Any] and passed to the evaluation function.

If the value is false, it means the operation is still in progress. If true, the operation is completed, and either error or response is available.

The error result of the operation in case of failure or cancellation.

Service-specific metadata associated with the operation. It typically contains progress information and common metadata such as create time. Some services might not provide such metadata. Any method that returns a long-running operation should document the metadata type, if any.

The server-assigned name, which is only unique within the same service that originally returns it. If you use the default HTTP mapping, the name should be a resource name ending with operations/{unique_id}.

The normal, successful response of the operation. If the original method returns no data on success, such as Delete, the response is google.protobuf.Empty. If the original method is standard Get/Create/Update, the response should be the resource. For other methods, the response should have the type XxxResponse, where Xxx is the original method name. For example, if the original method name is TakeSnapshot(), the inferred response type is TakeSnapshotResponse.

User provided metadata stored as key-value pairs.

Required. The key of the metadata to store.

The numeric value of the metadata to store.

The StringList value of the metadata to store.

The string value of the metadata to store.

Spec for custom output.

Output only. List of raw output strings.

Spec for custom output format configuration.

Optional. Whether to return raw output.

A collection of DataItems and Annotations on them.

Dataset for training or validation.

Output only. Timestamp when this Dataset was created.

Output only. The number of DataItems in this Dataset. Only apply for non-structured Dataset.

The description of the Dataset.

Required. The user-defined name of the Dataset. The name can be up to 128 characters long and can consist of any UTF-8 characters.

Customer-managed encryption key spec for a Dataset. If set, this Dataset and all sub-resources of this Dataset will be secured by this key.

Used to perform consistent read-modify-write updates. If not set, a blind "overwrite" update happens.

The labels with user-defined metadata to organize your Datasets. Label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed. No more than 64 user labels can be associated with one Dataset (System labels are excluded). See https://goo.gl/xmQnxf for more information and examples of labels. System reserved label keys are prefixed with "aiplatform.googleapis.com/" and are immutable. Following system labels exist for each Dataset: * "aiplatform.googleapis.com/dataset_metadata_schema": output only, its value is the metadata_schema's title.

Required. Additional information about the Dataset.

Output only. The resource name of the Artifact that was created in MetadataStore when creating the Dataset. The Artifact resource name pattern is projects/{project}/locations/{location}/metadataStores/{metadata_store}/artifacts/{artifact}.

Required. Points to a YAML file stored on Google Cloud Storage describing additional information about the Dataset. The schema is defined as an OpenAPI 3.0.2 Schema Object. The schema files that can be used here are found in gs://google-cloud-aiplatform/schema/dataset/metadata/.

Optional. Reference to the public base model last used by the dataset. Only set for prompt datasets.

Output only. Identifier. The resource name of the Dataset. Format: projects/{project}/locations/{location}/datasets/{dataset}

Output only. Reserved for future use.

All SavedQueries belong to the Dataset will be returned in List/Get Dataset response. The annotation_specs field will not be populated except for UI cases which will only use annotation_spec_count. In CreateDataset request, a SavedQuery is created together if this field is set, up to one SavedQuery can be set in CreateDatasetRequest. The SavedQuery should not contain any AnnotationSpec.

Output only. Timestamp when this Dataset was last updated.

Optional. Inline examples with simple input/output text.

Distribution computed over a tuning dataset.

Output only. Defines the histogram bucket.

Output only. The maximum of the population values.

Output only. The arithmetic mean of the values in the population.

Output only. The median of the values in the population.

Output only. The minimum of the population values.

Output only. The 5th percentile of the values in the population.

Output only. The 95th percentile of the values in the population.

Output only. Sum of a given population of values.

Dataset bucket used to create a histogram for the distribution given a population of values.

Output only. Number of values in the bucket.

Output only. Left bound of the bucket.

Output only. Right bound of the bucket.

Statistics computed over a tuning dataset.

Output only. A partial sample of the indices (starting from 1) of the dropped examples.

Output only. For each index in dropped_example_indices, the user-facing reason why the example was dropped.

Output only. Number of billable characters in the tuning dataset.

Output only. Number of tuning characters in the tuning dataset.

Output only. Number of examples in the tuning dataset.

Output only. Number of tuning steps for this Tuning Job.

Output only. Sample user messages in the training dataset uri.

Output only. Dataset distributions for the user input tokens.

Output only. Dataset distributions for the messages per example.

Output only. Dataset distributions for the user output tokens.

Custom DateTime JSON converter to serialize and deserialize ISO 8601 format without nanoseconds.

Configuration options that change client network behavior when testing.

Request to delete a .

Required. The resource name of the to delete. Example:

Points to a DeployedModel.

Immutable. The ID of the Checkpoint deployed in the DeployedModel.

Immutable. An ID of a DeployedModel in the above Endpoint.

Immutable. A resource name of an Endpoint.

Backend response for a Diff get checksums response. For details on the Scotty Diff protocol, visit http://go/scotty-diff-protocol.

The object version of the object the checksums are being returned for.

The total size of the server object.

The chunk size of checksums. Must be a multiple of 256KB.

If set, calculate the checksums based on the contents and return them to the caller.

Exactly one of these fields must be populated. If checksums_location is filled, the server will return the corresponding contents to the user. If object_location is filled, the server will calculate the checksums based on the content there and return that to the user. For details on the format of the checksums, see http://go/scotty-diff-protocol.

Backend response for a Diff download response. For details on the Scotty Diff protocol, visit http://go/scotty-diff-protocol.

The original object location.

A Diff upload request. For details on the Scotty Diff protocol, visit http://go/scotty-diff-protocol.

The object version of the object that is the base version the incoming diff script will be applied to. This field will always be filled in.

The location of the new object. Agents must clone the object located here, as the upload server will delete the contents once a response is received.

The location of the checksums for the new object. Agents must clone the object located here, as the upload server will delete the contents once a response is received. For details on the format of the checksums, see http://go/scotty-diff-protocol.

Backend response for a Diff upload request. For details on the Scotty Diff protocol, visit http://go/scotty-diff-protocol.

The object version of the object at the server. Must be included in the end notification response. The version in the end notification response must correspond to the new version of the object that is now stored at the server, after the upload.

The location of the original file for a diff upload request. Must be filled in if responding to an upload start notification.

Backend response for a Diff get version response. For details on the Scotty Diff protocol, visit http://go/scotty-diff-protocol.

The object version of the object the checksums are being returned for.

The total size of the server object.

The input content is encapsulated and uploaded in the request.

Statistics for distillation prompt dataset. These statistics do not include the responses sampled from the teacher model.

Output only. Statistics computed for the training dataset.

Hyperparameters for Distillation.

Optional. Adapter size for distillation.

Optional. Number of complete passes the model makes over the entire training dataset during training.

Optional. Multiplier for adjusting the default learning rate.

Tuning Spec for Distillation.

The base teacher model that is being distilled. See [Supported models](https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/tuning#supported_models).

Optional. Hyperparameters for Distillation.

Deprecated. A path in a Cloud Storage bucket, which will be treated as the root output directory of the distillation pipeline. It is used by the system to generate the paths of output artifacts.

The student model that is being tuned, e.g., "google/gemma-2b-1.1-it". Deprecated. Use base_model instead.

Deprecated. Cloud Storage path to file containing training dataset for tuning. The dataset must be formatted as a JSONL file.

The resource name of the Tuned teacher model. Format: projects/{project}/locations/{location}/models/{model}.

Optional. Cloud Storage path to file containing validation dataset for tuning. The dataset must be formatted as a JSONL file.

A `Document` is a collection of `Chunk`s.

A Document is a collection of Chunks.

Output only. The Timestamp of when the Document was created.

Optional. User provided custom metadata stored as key-value pairs used for querying. A Document can have a maximum of 20 CustomMetadata.

Optional. The human-readable display name for the Document. The display name must be no more than 512 characters in length, including spaces. Example: "Semantic Retriever Documentation"

Output only. The mime type of the Document.

Immutable. Identifier. The Document resource name. The ID (name excluding the "fileSearchStores/*/documents/" prefix) can contain up to 40 characters that are lowercase alphanumeric or dashes (-). The ID cannot start or end with a dash. If the name is empty on create, a unique name will be derived from display_name along with a 12 character random suffix. Example: fileSearchStores/{file_search_store_id}/documents/my-awesome-doc-123a456b789c

Output only. The size of raw bytes ingested into the Document.

Output only. Current state of the Document.

Output only. The Timestamp of when the Document was last updated.

The default value. This value is used if the state is omitted.

Some `Chunks` of the `Document` are being processed (embedding and vector storage).

All `Chunks` of the `Document` is processed and available for querying.

Some `Chunks` of the `Document` failed processing.

Response for DownloadFile.

Parameters specific to media downloads.

A boolean to be returned in the response to Scotty. Allows/disallows gzip encoding of the payload content when the server thinks it's advantageous (hence, does not guarantee compression) which allows Scotty to GZip the response to the client.

Determining whether or not Apiary should skip the inclusion of any Content-Range header on its response to Scotty.

A Duration represents a signed, fixed-length span of time represented as a count of seconds and fractions of seconds at nanosecond resolution.

It is independent of any calendar and concepts like "day" or "month". It is related to Timestamp in that the difference between two Timestamp values is a Duration and it can be added or subtracted from a Timestamp. Range is approximately +-10,000 years.

Seconds of a duration.

Nano seconds of a duration.

Describes the options to customize dynamic retrieval.

The threshold to be used in dynamic retrieval. If not set, a system default value is used.

The mode of the predictor to be used in dynamic retrieval.

Always trigger retrieval.

Run retrieval only when system decides it is necessary.

Edit config object for model versions 006 and greater. All editConfig subfields are optional. If not specified, the default editing mode is inpainting.

Optional. Describes the editing mode for the request. One editing mode per request.

Optional.

Optional. Determines the dilation percentage of the mask provided.

0.03 (3%) is the default value of shortest side. Minimum: 0, Maximum: 1

Optional. Defines whether the detected product should stay fixed or be repositioned. If you set this field, you must also set "editMode": "product-image".

Values: reposition - Lets the model move the location of the detected product or object. (default value) fixed - The model maintains the original positioning of the detected product or object If the input image is not square, the model defaults to reposition.

Request for image editing.

Initializes a new instance of the class.

A text description of the edit to apply to the image. The number of generated images.

Response for the request to edit an image.

Output only. A list of the generated images.

List of generated images.

A list of reasons why content may have been blocked.

Default editing mode.

Background swap editing mode.

Controlled editing mode.

Inpainting insertion editing mode.

Inpainting removal editing mode.

Outpainting editing mode.

Product image editing mode.

Style editing mode.

A resource representing a batch of EmbedContent requests.

Output only. Stats about the batch.

Output only. The time at which the batch was created.

Required. The user-defined name of this batch.

Output only. The time at which the batch processing completed.

Required. Input configuration of the instances on which batch processing are performed.

Required. The name of the Model to use for generating the completion. Format: models/{model}.

Output only. Identifier. Resource name of the batch. Format: batches/{batch_id}.

Output only. The output of the batch request.

Optional. The priority of the batch. Batches with a higher priority value will be processed before batches with a lower priority value. Negative values are allowed. Default is 0.

Output only. The state of the batch.

Output only. The time at which the batch was last updated.

The batch state is unspecified.

The service is preparing to run the batch.

The batch is in progress.

The batch completed successfully.

The batch failed.

The batch has been cancelled.

The batch has expired.

The output of a batch request. This is returned in the AsyncBatchEmbedContentResponse or the EmbedContentBatch.output field.

Output only. The responses to the requests in the batch. Returned when the batch was built using inlined requests. The responses will be in the same order as the input requests.

Output only. The file ID of the file containing the responses. The file will be a JSONL file with a single response per line. The responses will be EmbedContentResponse messages formatted as JSON. The responses will be written in the same order as the input requests.

Stats about the batch.

Output only. The number of requests that failed to be processed.

Output only. The number of requests that are still pending processing.

Output only. The number of requests in the batch.

Output only. The number of requests that were successfully processed.

Optional parameters for the embed_content method.

Vertex API only. Whether to silently truncate inputs longer than the max sequence length. If this option is set to false, oversized inputs will lead to an INVALID_ARGUMENT error, similar to other text APIs.

Vertex API only. The MIME type of the input.

Reduced dimension for the output embedding. If set, excessive values in the output embedding are truncated from the end. Supported by newer models since 2024 only. You cannot set this value if using the earlier model (models/embedding-001).

Type of task for which the embedding will be used.

Title for the text. Only applicable when TaskType is RETRIEVAL_DOCUMENT.

Request message for PredictionService.EmbedContent.

Request containing the for the model to embed.

Request containing the Content for the model to embed.

Optional. Whether to silently truncate the input content if it's longer than the maximum sequence length.

Required. The content to embed. Only the fields will be counted.

Required. The model's resource name. This serves as an ID for the Model to use. This name should match a model name returned by the ListModels method. Format: models/{model}

Optional. Optional reduced dimension for the output embedding. If set, excessive values in the output embedding are truncated from the end. Supported by newer models since 2024 only. You cannot set this value if using the earlier model (models/embedding-001).

Optional. Optional task type for which the embeddings will be used. Not supported on earlier models (models/embedding-001).

Optional. An optional title for the text. Only applicable when TaskType is RETRIEVAL_DOCUMENT. Note: Specifying a title for RETRIEVAL_DOCUMENT provides better quality embeddings for retrieval.

Response message for PredictionService.EmbedContent.

The response to an .

The response to an EmbedContentRequest.

Whether the input content was truncated before generating the embedding.

Metadata about the response(s).

Output only. Generated candidates.

Output only. The embeddings for each request, in the same order as provided in the batch request.

Output only. The embedding generated from the input content.

A list of floats representing the embedding.

The embedding values.

Request to get a text embedding from the model.

Default constructor.

Optional. The free-form input text that the model will turn into an embedding.

Required. The model name to use with the format model=models/{model}.

Optional. The free-form input text that the model will turn into an embedding.

The response to a EmbedTextRequest.

Output only. The embeddings generated from the input text.

Output only. The embedding generated from the input text.

A generic empty message that you can re-use to avoid defining duplicated empty messages in your APIs. A typical example is to use it as the request or the response type of an API method. For instance: service Foo { rpc Bar(google.protobuf.Empty) returns (google.protobuf.Empty); }

Represents a customer-managed encryption key specification that can be applied to a Vertex AI resource.

Required. Resource name of the Cloud KMS key used to protect the resource. The Cloud KMS key must be in the same region as the resource. It must have the format projects/{project}/locations/{location}/keyRings/{key_ring}/cryptoKeys/{crypto_key}.

Tool to search public web data, powered by Vertex AI Search and Sec4 compliance.

Optional. Sites with confidence level chosen & above this value will be blocked from the search results.

Optional. List of domains to be excluded from the search results. The default limit is 2000 domains.

Optional. Filter search results to a specific time range. If customers set a start time, they must set an end time (and vice versa). This field is not supported in Vertex AI.

Represents an environment variable present in a Container or Python Module.

Required. Name of the environment variable. Must be a valid C identifier.

Required. Variables that reference a $(VAR_NAME) are expanded using the previous defined environment variables in the container and any service environment variables. If a variable cannot be resolved, the reference in the input string will be unchanged. The $(VAR_NAME) syntax can be escaped with a double $$, ie: $$(VAR_NAME). Escaped references will never be expanded, regardless of whether the variable exists or not.

The results from an evaluation run performed by the EvaluationService.

Output only. Aggregation statistics derived from results of EvaluationService.

Output only. Output info for EvaluationService.

Evaluate Dataset Run Result for Tuning Job.

Output only. The checkpoint id used in the evaluation run. Only populated when evaluating checkpoints.

Output only. The error of the evaluation run if any.

Output only. Results for EvaluationService.

Output only. The resource name of the evaluation run. Format: projects/{project}/locations/{location}/evaluationRuns/{evaluation_run_id}.

Output only. The operation ID of the evaluation run. Format: projects/{project}/locations/{location}/operations/{operation_id}.

Evaluation Config for Tuning Job.

Optional. Autorater config for evaluation.

Optional. Configuration options for inference generation and outputs. If not set, default generation parameters are used.

Required. The metrics used for evaluation.

Required. Config for evaluation output.

The dataset used for evaluation.

BigQuery source holds the dataset.

Cloud storage source holds the dataset. Currently only one Cloud Storage file path is supported.

Exact match metric value for an instance.

Output only. Exact match score.

Spec for exact match metric - returns 1 if prediction and reference exactly matches, otherwise 0.

An input/output example used to instruct the Model. It demonstrates how the model should respond or format its response.

Output only. Timestamp when this Example was created.

Optional. The display name for Example.

Optional. Immutable. Unique identifier of an example. If not specified when upserting new examples, the example_id will be generated.

An example of chat history and its expected outcome to be used with GenerateContent.

Required. An example of an input Message from the user.

Required. An example of what the model should output given the input.

Example-based explainability that returns the nearest neighbors from the provided dataset.

The Cloud Storage input instances.

The Cloud Storage locations that contain the instances to be indexed for approximate nearest neighbor search.

The full configuration for the generated index, the semantics are the same as metadata and should match [NearestNeighborSearchConfig](https://cloud.google.com/vertex-ai/docs/explainable-ai/configuring-explanations-example-based#nearest-neighbor-search-config).

The number of neighbors to return when querying for examples.

Simplified preset configuration, which automatically sets configuration values based on the desired query speed-precision trade-off and modality.

The Cloud Storage input instances.

The format in which instances are given, if not specified, assume it's JSONL format. Currently only JSONL format is supported.

The Cloud Storage location for the input instances.

Format unspecified, used when unset.

Examples are stored in JSONL files.

Code generated by the model that is meant to be executed, and the result returned to the model. Generated when using the [CodeExecution] tool, in which the code will be automatically executed, and a corresponding [CodeExecutionResult] will also be generated.

Code generated by the model that is meant to be executed, and the result returned to the model. Only generated when using the tool, in which the code will be automatically executed, and a corresponding will also be generated.

Code generated by the model that is meant to be executed, and the result returned to the model. Only generated when using the CodeExecution tool, in which the code will be automatically executed, and a corresponding CodeExecutionResult will also be generated.

Required. The code to be executed.

Required. Programming language of the code.

Unspecified language. This value should not be used.

Python >= 3.10, with numpy and simpy available. Python is the default language.

Metadata describing the Model's input and output for explanation.

Points to a YAML file stored on Google Cloud Storage describing the format of the feature attributions. The schema is defined as an OpenAPI 3.0.2 [Schema Object](https://github.com/OAI/OpenAPI-Specification/blob/main/versions/3.0.2.md#schemaObject). AutoML tabular Models always have this field populated by Vertex AI. Note: The URI given on output may be different, including the URI scheme, than the one given on input. The output URI will point to a location where the user only has a read access.

Required. Map from feature names to feature input metadata. Keys are the name of the features. Values are the specification of the feature. An empty InputMetadata is valid. It describes a text feature which has the name specified as the key in ExplanationMetadata.inputs. The baseline of the empty feature is chosen by Vertex AI. For Vertex AI-provided Tensorflow images, the key can be any friendly name of the feature. Once specified, featureAttributions are keyed by this key (if not grouped with another feature). For custom images, the key must match with the key in instance.

Name of the source to generate embeddings for example based explanations.

Required. Map from output names to output metadata. For Vertex AI-provided Tensorflow images, keys can be any user defined string that consists of any UTF-8 characters. For custom images, keys are the name of the output field in the prediction to be explained. Currently only one key is allowed.

Parameters to configure explaining for Model's predictions.

Example-based explanations that returns the nearest neighbors from the provided dataset.

An attribution method that computes Aumann-Shapley values taking advantage of the model's fully differentiable structure. Refer to this paper for more details: https://arxiv.org/abs/1703.01365

If populated, only returns attributions that have output_index contained in output_indices. It must be an ndarray of integers, with the same shape of the output it's explaining. If not populated, returns attributions for top_k indices of outputs. If neither top_k nor output_indices is populated, returns the argmax index of the outputs. Only applicable to Models that predict multiple outputs (e,g, multi-class Models that predict multiple classes).

An attribution method that approximates Shapley values for features that contribute to the label being predicted. A sampling strategy is used to approximate the value rather than considering all subsets of features. Refer to this paper for model details: https://arxiv.org/abs/1306.4265.

If populated, returns attributions for top K indices of outputs (defaults to 1). Only applies to Models that predicts more than one outputs (e,g, multi-class Models). When set to -1, returns explanations for all outputs.

An attribution method that redistributes Integrated Gradients attribution to segmented regions, taking advantage of the model's fully differentiable structure. Refer to this paper for more details: https://arxiv.org/abs/1906.02825 XRAI currently performs better on natural images, like a picture of a house or an animal. If the images are taken in artificial environments, like a lab or manufacturing line, or from diagnostic equipment, like x-rays or quality-control cameras, use Integrated Gradients instead.

Specification of Model explanation.

Optional. Metadata describing the Model's input and output for explanation.

Required. Parameters that configure explaining of the Model's predictions.

Retrieve from data source powered by external API for grounding. The external API is not owned by Google, but need to follow the pre-defined API spec.

The authentication config to access the API. Deprecated. Please use auth_config instead.

The API spec that the external API implements.

The authentication config to access the API.

Parameters for the elastic search API.

The endpoint of the external API. The system will call the API at this endpoint to retrieve the data for grounding. Example: https://acme.com:443/search

Parameters for the simple search API.

Unspecified API spec. This value should not be used.

Simple search API spec.

Elastic search API spec.

The search parameters to use for the ELASTIC_SEARCH spec.

The ElasticSearch index to use.

Optional. Number of hits (chunks) to request. When specified, it is passed to Elasticsearch as the num_hits param.

The ElasticSearch search template to use.

The search parameters to use for SIMPLE_SEARCH spec.

Noise sigma by features. Noise sigma represents the standard deviation of the gaussian kernel that will be used to add noise to interpolated inputs prior to computing gradients.

Noise sigma per feature. No noise is added to features that are not set.

Noise sigma for a single feature.

The name of the input feature for which noise sigma is provided. The features are defined in explanation metadata inputs.

This represents the standard deviation of the Gaussian kernel that will be used to add noise to the feature prior to computing gradients. Similar to noise_sigma but represents the noise added to the current feature. Defaults to 0.1.

URI-based data. A FileData message contains a URI pointing to data of a specific media type. It is used to represent images, audio, and video stored in Google Cloud Storage.

URI based data.

Optional. The display name of the file. Used to provide a label or filename to distinguish files. This field is only returned in PromptMessage for prompt management. It is used in the Gemini calls only when server side tools (code_execution, google_search, and url_context) are enabled.

Required. URI.

Optional. The IANA standard MIME type of the source data.

Optional. The human-readable display name for the File. The display name must be no more than 512 characters in length, including spaces. Example: "Welcome Image"

Optional. The resource name of the File to create.

A file resource of the File API.

A file uploaded to the API. Next ID: 15

Output only. The timestamp of when the File was created.

Optional. The human-readable display name for the File. The display name must be no more than 512 characters in length, including spaces. Example: "Welcome Image"

Output only. The download uri of the File.

Output only. Error status if File processing failed.

Output only. The timestamp of when the File will be deleted. Only set if the File is scheduled to expire.

Output only. MIME type of the file.

Immutable. Identifier. The File resource name. The ID (name excluding the "files/" prefix) can contain up to 40 characters that are lowercase alphanumeric or dashes (-). The ID cannot start or end with a dash. If the name is empty on create, a unique name will be generated. Example: files/123-456

Output only. SHA-256 hash of the uploaded bytes.

Output only. Size of the file in bytes.

Source of the File.

Output only. Processing state of the File.

Output only. The timestamp of when the File was last updated.

Output only. The uri of the File.

Output only. Metadata for a video.

Used if source is not specified.

Indicates the file is uploaded by the user.

Indicates the file is generated by Google.

Indicates the file is a registered, i.e. a Google Cloud Storage file.

The default value. This value is used if the state is omitted.

File is being processed and cannot be used for inference yet.

File is processed and available for inference.

File failed processing.

Source of the File.

Used if source is not specified.

Indicates the file is uploaded by the user.

Indicates the file is generated by Google.

Indicates the file is a registered, i.e. a Google Cloud Storage file.

The FileSearch tool that retrieves knowledge from Semantic Retrieval corpora. Files are imported to Semantic Retrieval corpora using the ImportFile API.

Optional. The configuration for the retrieval.

Required. Semantic retrieval resources to retrieve from. Currently only supports one corpus. In the future we may open up multiple corpora support.

Convenience property.

Required. The names of the file_search_stores to retrieve from. Example: fileSearchStores/my-file-search-store-123

Optional. Metadata filter to apply to the semantic retrieval documents and chunks.

Optional. The number of semantic retrieval chunks to retrieve.

Semantic retrieval configuration.

Optional. Metadata filter to apply to the semantic retrieval documents and chunks.

Optional. The number of semantic retrieval chunks to retrieve.

A is a collection of s.

A FileSearchStore is a collection of Documents.

Output only. The number of documents in the FileSearchStore that are active and ready for retrieval.

Output only. The Timestamp of when the FileSearchStore was created.

Optional. The human-readable display name for the FileSearchStore. The display name must be no more than 512 characters in length, including spaces. Example: "Docs on Semantic Retriever"

Output only. The number of documents in the FileSearchStore that have failed processing.

Output only. Immutable. Identifier. The FileSearchStore resource name. It is an ID (name excluding the "fileSearchStores/" prefix) that can contain up to 40 characters that are lowercase alphanumeric or dashes (-). It is output only. The unique name will be derived from display_name along with a 12 character random suffix. Example: fileSearchStores/my-awesome-file-search-store-123a456b789c If display_name is not provided, the name will be randomly generated.

Output only. The number of documents in the FileSearchStore that are being processed.

Output only. The size of raw bytes ingested into the FileSearchStore. This is the total size of all the documents in the FileSearchStore.

Output only. The Timestamp of when the FileSearchStore was last updated.

RagFile status.

Output only. Only when the state field is ERROR.

Output only. RagFile state.

RagFile state is unspecified.

RagFile resource has been created and indexed successfully.

RagFile resource is in a problematic state. See `error_message` field for details.

Optional. String for metadata filtering.

Optional. Only returns contexts with vector distance smaller than the threshold.

Optional. Only returns contexts with vector similarity larger than the threshold.

Default value. This value is unused.

Natural stop point of the model or provided stop sequence.

The maximum number of tokens as specified in the request was reached.

The response candidate content was flagged for safety reasons.

The response candidate content was flagged for recitation reasons.

The response candidate content was flagged for using an unsupported language.

Unknown reason.

Token generation stopped because the content contains forbidden terms.

Token generation stopped for potentially containing prohibited content.

Token generation stopped because the content potentially contains Sensitive Personally Identifiable Information (SPII).

The function call generated by the model is invalid.

Token generation stopped because generated images contain safety violations.

Image generation stopped because generated images has other prohibited content.

Image generation stopped because of other miscellaneous issue.

The model was expected to generate an image, but none was generated.

Image generation stopped due to recitation.

Model generated a tool call but no tools were enabled in the request.

Model called too many tools consecutively, thus the system exited execution.

Request has at least one thought signature missing.

Finished due to malformed response.

Tuning Spec for Full Fine Tuning.

Optional. Hyperparameters for Full Fine Tuning.

Required. Training dataset used for tuning. The dataset can be specified as either a Cloud Storage path to a JSONL file or as the resource name of a Vertex Multimodal Dataset.

Optional. Validation dataset used for tuning. The dataset can be specified as either a Cloud Storage path to a JSONL file or as the resource name of a Vertex Multimodal Dataset.

A predicted [FunctionCall] returned from the model that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing the parameters and their values.

A predicted returned from the model that contains a string representing the with the arguments and their values.

A predicted FunctionCall returned from the model that contains a string representing the FunctionDeclaration.name with the arguments and their values.

Optional. The partial argument value of the function call. If provided, represents the arguments/fields that are streamed incrementally.

Optional. Whether this is the last part of the FunctionCall. If true, another partial message for the current FunctionCall is expected to follow.

Optional. The function parameters and values in JSON object format.

Optional. Unique identifier of the function call. If populated, the client to execute the function_call and return the response with the matching id.

Required. The name of the function to call. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.

Function calling config.

Configuration for specifying function calling behavior.

Optional. When set to true, arguments of a single function call will be streamed out in multiple parts/contents/responses. Partial parameter results will be returned in the [FunctionCall.partial_args] field.

Optional. A set of function names that, when provided, limits the functions the model will call. This should only be set when the Mode is ANY or VALIDATED. Function names should match [FunctionDeclaration.name]. When set, model will predict a function call from only allowed function names.

Optional. Specifies the mode in which function calling should execute. If unspecified, the default value will be set to AUTO.

Unspecified function calling mode. This value should not be used.

Default model behavior, model decides to predict either a function call or a natural language response.

Model is constrained to always predicting a function call only. If "allowed_function_names" are set, the predicted function call will be limited to any one of "allowed_function_names", else the predicted function call will be any one of the provided "function_declarations".

Model will not predict any function call. Model behavior is same as when not passing any function declarations.

Model decides to predict either a function call or a natural language response, but will validate function calls with constrained decoding. If "allowed_function_names" are set, the predicted function call will be limited to any one of "allowed_function_names", else the predicted function call will be any one of the provided "function_declarations".

Structured representation of a function declaration as defined by the [OpenAPI 3.0 specification](https://spec.openapis.org/oas/v3.0.3). Included in this declaration are the function name, description, parameters and response type. This FunctionDeclaration is a representation of a block of code that can be used as a Tool by the model and executed by the client.

Structured representation of a function declaration as defined by the OpenAPI 3.03 specification. Included in this declaration are the function name and parameters. This FunctionDeclaration is a representation of a block of code that can be used as a Tool by the model and executed by the client.

Structured representation of a function declaration as defined by the [OpenAPI 3.03 specification](https://spec.openapis.org/oas/v3.0.3). Included in this declaration are the function name and parameters. This FunctionDeclaration is a representation of a block of code that can be used as a Tool by the model and executed by the client.

Optional. Specifies the function Behavior. Currently only supported by the BidiGenerateContent method.

Required. A brief description of the function.

Required. The name of the function. Must be a-z, A-Z, 0-9, or contain underscores, colons, dots, and dashes, with a maximum length of 64.

Optional. Describes the parameters to this function. Reflects the Open API 3.03 Parameter Object string Key: the name of the parameter. Parameter names are case sensitive. Schema Value: the Schema defining the type used for the parameter.

Optional. Describes the parameters to the function in JSON Schema format. The schema must describe an object where the properties are the parameters to the function. For example: `` { "type": "object", "properties": { "name": { "type": "string" }, "age": { "type": "integer" } }, "additionalProperties": false, "required": ["name", "age"], "propertyOrdering": ["name", "age"] } ` This field is mutually exclusive with parameters`.

Optional. Describes the output from this function in JSON Schema format. Reflects the Open API 3.03 Response Object. The Schema defines the type used for the response value of the function.

Optional. Describes the output from this function in JSON Schema format. The value specified by the schema is the response value of the function. This field is mutually exclusive with response.

This value is unused.

If set, the system will wait to receive the function response before continuing the conversation.

If set, the system will not wait to receive the function response. Instead, it will attempt to handle function responses as they become available while maintaining the conversation between the user and the model.

The result output from a [FunctionCall] that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing any output from the function is used as context to the model. This should contain the result of a [FunctionCall] made based on model prediction.

The result output from a that contains a string representing the and a structured JSON object containing any output from the function is used as context to the model. This should contain the result of a made based on model prediction.

The result output from a FunctionCall that contains a string representing the FunctionDeclaration.name and a structured JSON object containing any output from the function is used as context to the model. This should contain the result of aFunctionCall made based on model prediction.

Optional. The identifier of the function call this response is for. Populated by the client to match the corresponding function call id.

Required. The name of the function to call. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.

Optional. Ordered Parts that constitute a function response. Parts may have different IANA MIME types.

Required. The function response in JSON object format. Callers can use any keys of their choice that fit the function's syntax to return the function output, e.g. "output", "result", etc. In particular, if the function call failed to execute, the response can have an "error" key to return error details to the model.

Optional. Specifies how the response should be scheduled in the conversation. Only applicable to NON_BLOCKING function calls, is ignored otherwise. Defaults to WHEN_IDLE.

Optional. Signals that function call continues, and more responses will be returned, turning the function call into a generator. Is only applicable to NON_BLOCKING function calls, is ignored otherwise. If set to false, future responses will not be considered. It is allowed to return empty response with will_continue=False to signal that the function call is finished. This may still trigger the model generation. To avoid triggering the generation and finish the function call, additionally set scheduling to SILENT.

This value is unused.

Only add the result to the conversation context, do not interrupt or trigger generation.

Add the result to the conversation context, and prompt to generate output without interrupting ongoing generation.

Add the result to the conversation context, interrupt ongoing generation and prompt to generate output.

Raw media bytes for function response. Text should not be sent as raw bytes, use the 'text' field.

Raw media bytes for function response. Text should not be sent as raw bytes, use the 'FunctionResponse.response' field.

Optional. Display name of the blob. Used to provide a label or filename to distinguish blobs. This field is only returned in PromptMessage for prompt management. It is currently used in the Gemini GenerateContent calls only when server side tools (code_execution, google_search, and url_context) are enabled.

Raw bytes for media formats.

URI based data for function response.

Optional. Display name of the file data. Used to provide a label or filename to distinguish file datas. This field is only returned in PromptMessage for prompt management. It is currently used in the Gemini GenerateContent calls only when server side tools (code_execution, google_search, and url_context) are enabled.

Required. URI.

Required. The IANA standard MIME type of the source data.

A datatype containing media that is part of a FunctionResponse message. A FunctionResponsePart consists of data which has an associated datatype. A FunctionResponsePart can only contain one of the accepted types in FunctionResponsePart.data. A FunctionResponsePart must have a fixed IANA MIME type identifying the type and subtype of the media if the inline_data field is filled with raw bytes.

A datatype containing media that is part of a message. A consists of data which has an associated datatype. A can only contain one of the accepted types in . A must have a fixed IANA MIME type identifying the type and subtype of the media if the field is filled with raw bytes.

URI based data.

Inline media bytes.

The Google Cloud Storage location where the output is to be written to.

Required. Google Cloud Storage URI to output directory. If the uri doesn't end with '/', a '/' will be automatically appended. The directory is created if it doesn't exist.

The Google Cloud Storage location for the input content.

Required. Google Cloud Storage URI(-s) to the input file(s). May contain wildcards. For more information on wildcards, see https://cloud.google.com/storage/docs/wildcards.

Input example for preference optimization.

List of completions for a given prompt.

Multi-turn contents that represents the Prompt.

Completion and its preference score.

Single turn completion for the given prompt.

The score for the given completion.

Request to generate a grounded answer from the model.

Request to generate a grounded answer from the Model.

Default constructor.

Required. Style in which answers should be returned.

Required. The content of the current conversation with the Model. For single-turn queries, this is a single question to answer. For multi-turn queries, this is a repeated field that contains conversation history and the last Content in the list containing the question. Note: GenerateAnswer only supports queries in English.

Passages provided inline with the request.

Optional. A list of unique SafetySetting instances for blocking unsafe content. This will be enforced on the GenerateAnswerRequest.contents and GenerateAnswerResponse.candidate. There should not be more than one setting for each SafetyCategory type. The API will block any contents and responses that fail to meet the thresholds set by these settings. This list overrides the default settings for each SafetyCategory specified in the safety_settings. If there is no SafetySetting for a given SafetyCategory provided in the list, the API will use the default safety setting for that category. Harm categories HARM_CATEGORY_HATE_SPEECH, HARM_CATEGORY_SEXUALLY_EXPLICIT, HARM_CATEGORY_DANGEROUS_CONTENT, HARM_CATEGORY_HARASSMENT are supported. Refer to the [guide](https://ai.google.dev/gemini-api/docs/safety-settings) for detailed information on available safety settings. Also refer to the [Safety guidance](https://ai.google.dev/gemini-api/docs/safety-guidance) to learn how to incorporate safety considerations in your AI applications.

Content retrieved from resources created via the Semantic Retriever API.

Optional. Controls the randomness of the output. Values can range from [0.0,1.0], inclusive. A value closer to 1.0 will produce responses that are more varied and creative, while a value closer to 0.0 will typically result in more straightforward responses from the model. A low temperature (~0.2) is usually recommended for Attributed-Question-Answering use cases.

Response from the model for a grounded answer.

Responded text information of first candidate.

A convenience overload to easily access the responded text.

The responded text information of first candidate.

Candidate answer from the model. Note: The model *always* attempts to provide a grounded answer, even when the answer is unlikely to be answerable from the given passages. In that case, a low-quality or ungrounded answer may be provided, along with a low answerable_probability.

Output only. The model's estimate of the probability that its answer is correct and grounded in the input passages. A low answerable_probability indicates that the answer might not be grounded in the sources. When answerable_probability is low, you may want to: * Display a message to the effect of "We couldn’t answer that question" to the user. * Fall back to a general-purpose LLM that answers the question from world knowledge. The threshold and nature of such fallbacks will depend on individual use cases. 0.5 is a good starting threshold.

Output only. Feedback related to the input data used to answer the question, as opposed to the model-generated response to the question. The input data can be one or more of the following: - Question specified by the last entry in GenerateAnswerRequest.content - Conversation history specified by the other entries in GenerateAnswerRequest.content - Grounding sources (GenerateAnswerRequest.semantic_retriever or GenerateAnswerRequest.inline_passages)

A resource representing a batch of GenerateContent requests.

Output only. Stats about the batch.

Output only. The time at which the batch was created.

Required. The user-defined name of this batch.

Output only. The time at which the batch processing completed.

Required. Input configuration of the instances on which batch processing are performed.

Required. The name of the Model to use for generating the completion. Format: models/{model}.

Output only. Identifier. Resource name of the batch. Format: batches/{batch_id}.

Output only. The output of the batch request.

Optional. The priority of the batch. Batches with a higher priority value will be processed before batches with a lower priority value. Negative values are allowed. Default is 0.

Output only. The state of the batch.

Output only. The time at which the batch was last updated.

The batch state is unspecified.

The service is preparing to run the batch.

The batch is in progress.

The batch completed successfully.

The batch failed.

The batch has been cancelled.

The batch has expired.

The output of a batch request. This is returned in the BatchGenerateContentResponse or the GenerateContentBatch.output field.

Output only. The responses to the requests in the batch. Returned when the batch was built using inlined requests. The responses will be in the same order as the input requests.

Output only. The file ID of the file containing the responses. The file will be a JSONL file with a single response per line. The responses will be GenerateContentResponse messages formatted as JSON. The responses will be written in the same order as the input requests.

Optional model configuration parameters.

If enabled, audio timestamp will be included in the request to the model.

The configuration for automatic function calling.

Resource name of a context cache that can be used in subsequent requests.

Labels with user-defined metadata to break down billed charges.

Configuration for model router requests.

Safety settings in the request to block unsafe content in the response.

Instructions for the model to steer it toward better performance. For example, “Answer as concisely as possible” or “Don’t use technical terms in your response”.

Associates model output to a specific function call.

Code that enables the system to interact with external systems to perform an action outside of the knowledge and scope of the model.

Request message for [PredictionService.GenerateContent].

Request to generate a completion from the model.

Optional. Settings for prompt and response sanitization using the Model Armor service. If supplied, safety_settings must not be supplied.

The ETag of the item.

Optional. The labels with user-defined metadata for the request.

It is used for billing and reporting only. Label keys and values can be no longer than 63 characters (Unicode codepoints) and can only contain lowercase letters, numeric characters, underscores, and dashes. International characters are allowed. Label values are optional. Label keys must start with a letter.

Initializes a new instance of the class.

String to process. Optional. Configuration options for model generation and outputs. Optional. A list of unique SafetySetting instances for blocking unsafe content. Optional. A list of Tools the model may use to generate the next response. Optional. Optional. Configuration of tools. Thrown when the is .

Initializes a new instance of the class.

Optional. Configuration options for model generation and outputs. Optional. A list of unique SafetySetting instances for blocking unsafe content. Optional. A list of Tools the model may use to generate the next response. Optional. Optional. Configuration of tools. Thrown when the is .

Initializes a new instance of the class.

The media file resource. Optional. Configuration options for model generation and outputs. Optional. A list of unique SafetySetting instances for blocking unsafe content. Optional. A list of Tools the model may use to generate the next response. Optional. Optional. Configuration of tools. Thrown when the is .

Initializes a new instance of the class.

Adds a object to the request.

Adds a media file or a base64-encoded string to the request.

Depending on the flag, either an or part will be added to the request. Standard URLs are supported and the resource is downloaded if is . The URI of the media file. The IANA standard MIME type to check. Flag indicating whether the file shall be used online or read from the local file system. Thrown when the is .

Adds a media file resource to the request.

The media file resource. Thrown when the is . Thrown when the MIME type of > is not supported by the API.

Adds a object to the Content at the specified .

Part object to add to the collection. Zero-based index of element in the Contents collection.

Optional. The name of the content [cached](https://ai.google.dev/gemini-api/docs/caching) to use as context to serve the prediction. Format: cachedContents/{cachedContent}

Required. The content of the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries like [chat](https://ai.google.dev/gemini-api/docs/text-generation#chat), this is a repeated field that contains the conversation history and the latest request.

Optional. Configuration options for model generation and outputs.

Required. The name of the Model to use for generating the completion. Format: models/{model}.

Optional. A list of unique SafetySetting instances for blocking unsafe content. This will be enforced on the GenerateContentRequest.contents and GenerateContentResponse.candidates. There should not be more than one setting for each SafetyCategory type. The API will block any contents and responses that fail to meet the thresholds set by these settings. This list overrides the default settings for each SafetyCategory specified in the safety_settings. If there is no SafetySetting for a given SafetyCategory provided in the list, the API will use the default safety setting for that category. Harm categories HARM_CATEGORY_HATE_SPEECH, HARM_CATEGORY_SEXUALLY_EXPLICIT, HARM_CATEGORY_DANGEROUS_CONTENT, HARM_CATEGORY_HARASSMENT, HARM_CATEGORY_CIVIC_INTEGRITY are supported. Refer to the [guide](https://ai.google.dev/gemini-api/docs/safety-settings) for detailed information on available safety settings. Also refer to the [Safety guidance](https://ai.google.dev/gemini-api/docs/safety-guidance) to learn how to incorporate safety considerations in your AI applications.

Optional. Developer set [system instruction(s)](https://ai.google.dev/gemini-api/docs/system-instructions). Currently, text only.

Optional. Tool configuration for any Tool specified in the request. Refer to the [Function calling guide](https://ai.google.dev/gemini-api/docs/function-calling#function_calling_mode) for a usage example.

Optional. A list of Tools the Model may use to generate the next response. A Tool is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the Model. Supported Tools are Function and code_execution. Refer to the [Function calling](https://ai.google.dev/gemini-api/docs/function-calling) and the [Code execution](https://ai.google.dev/gemini-api/docs/code-execution) guides to learn more.

Response message for [PredictionService.GenerateContent].

Response from the model supporting multiple candidates. Ref: https://ai.google.dev/api/rest/v1beta/GenerateContentResponse

Response from the model supporting multiple candidate responses. Safety ratings and content filtering are reported for both prompt in GenerateContentResponse.prompt_feedback and for each candidate in finish_reason and in safety_ratings. The API: - Returns either all requested candidates or none of them - Returns no candidates at all only if there was something wrong with the prompt (check prompt_feedback) - Reports feedback on each candidate in finish_reason and safety_ratings.

Output only. Timestamp when the request is made to the server.

A convenience property to get the responded text information of first candidate.

A convenience property to get the function calls.

A convenience property to get the responded thinking information of first candidate.

Default constructor.

Base constructor to set the instance.

Optional. Logger instance used for logging

A convenience overload to easily access the responded text.

The responded text information of first candidate.

Candidate responses from the model.

Output only. The current model status of this model.

Output only. The model version used to generate the response.

Returns the prompt's feedback related to the content filters.

Output only. response_id is used to identify each response.

Output only. Metadata on the generation requests' token usage.

Content filter results for a prompt sent in the request. Note: This is sent only in the first stream chunk and only if no candidates were generated due to content violations.

Output only. The reason why the prompt was blocked.

Output only. A readable message that explains the reason why the prompt was blocked.

Output only. A list of safety ratings for the prompt. There is one rating per category.

Usage metadata about the content generation request and response. This message provides a detailed breakdown of token usage and other relevant metrics.

Output only. A detailed breakdown of the token count for each modality in the cached content.

Output only. The number of tokens in the cached content that was used for this request.

The total number of tokens in the generated candidates.

Output only. A detailed breakdown of the token count for each modality in the generated candidates.

The total number of tokens in the prompt. This includes any text, images, or other media provided in the request. When cached_content is set, this also includes the number of tokens in the cached content.

Output only. A detailed breakdown of the token count for each modality in the prompt.

Output only. The number of tokens that were part of the model's generated "thoughts" output, if applicable.

Output only. The number of tokens in the results from tool executions, which are provided back to the model as input, if applicable.

Output only. A detailed breakdown by modality of the token counts from the results of tool executions, which are provided back to the model as input.

The total number of tokens for the entire request. This is the sum of prompt_token_count, candidates_token_count, tool_use_prompt_token_count, and thoughts_token_count.

Output only. The traffic type for this request.

Unspecified request traffic type.

The request was processed using Pay-As-You-Go quota.

Type for Priority Pay-As-You-Go traffic.

Type for Flex traffic.

Type for Provisioned Throughput traffic.

A file generated on behalf of a user.

The blob reference of the generated file to download. Only set when the GeneratedFiles.get request url has the \"?alt=media\" query param.

Error details if the GeneratedFile ends up in the STATE_FAILED state.

MIME type of the generatedFile.

Identifier. The name of the generated file. Example: generatedFiles/abc-123

Output only. The state of the GeneratedFile.

The default value. This value is used if the state is omitted.

Being generated.

Generated and is ready for download.

Failed to generate the GeneratedFile.

An output image.

The output image data.

Responsible AI filter reason if the image is filtered out of the response.

The rewritten prompt used for the image generation if the prompt enhancer is enabled.

A generated video.

The output video

An embedding vector generated by the model.

Output only. The embedding vector generated for the input. Can be either a list of floats or a base64 string encoding the a list of floats with C-style layout (Numpy compatible).

Output only. Index of the embedding in the list of embeddings.

Output only. Always \"embedding\", required by the SDK.

Request for embedding generation.

Required. Model to generate the embeddings for.

Required. The input to generate embeddings for. Can be a string, or a list of strings. The SDK supports a list of numbers and list of list of numbers, but this is not yet implemented.

Optional. The format of the encoding. Must be either \"float\" or \"base64\".

Optional. Dimensional size of the generated embeddings.

Response for embedding generation.

Output only. Model used to generate the embeddings.

Output only. Always \"embedding\", required by the SDK.

Output only. A list of the requested embeddings.

The config for generating an images.

Request for image generation.

Initializes a new instance of the class.

The text prompt guides what images the model generates. The number of generated images. Thrown when the is . Thrown when the is less than 1 or greater than 8.

Response for image generation.

Output only. A list of the generated images.

List of generated images.

Output only. Model used to generate the images.

Output only. Always \"image\", required by the SDK.

Generates a response from the model given an input MessagePrompt.

Request to generate a message response from the model.

Default constructor.

Optional. The number of generated response messages to return. This value must be between [1, 8], inclusive. If unset, this will default to 1.

Required. The structured textual input given to the model as a prompt. Given a prompt, the model will return what it predicts is the next message in the discussion.

Optional. Controls the randomness of the output. Values can range over [0.0,1.0], inclusive. A value closer to 1.0 will produce responses that are more varied, while a value closer to 0.0 will typically result in less surprising responses from the model.

Optional. The maximum number of tokens to consider when sampling. The model uses combined Top-k and nucleus sampling. Top-k sampling considers the set of top_k most probable tokens.

Optional. The maximum cumulative probability of tokens to consider when sampling. The model uses combined Top-k and nucleus sampling. Nucleus sampling considers the smallest set of tokens whose probability sum is at least top_p.

The response from the model. This includes candidate messages and conversation history in the form of chronologically-ordered messages.

Responded text information of first candidate.

A convenience overload to easily access the responded text.

The responded text information of first candidate.

Candidate response messages from the model.

The conversation history used by the model.

Request to generate a text completion response from the model.

Default constructor.

Optional. Number of generated responses to return. This value must be between [1, 8], inclusive. If unset, this will default to 1.

Optional. The maximum number of tokens to include in a candidate. If unset, this will default to output_token_limit specified in the Model specification.

Required. The free-form input text given to the model as a prompt. Given a prompt, the model will generate a TextCompletion response it predicts as the completion of the input text.

Optional. A list of unique SafetySetting instances for blocking unsafe content. that will be enforced on the GenerateTextRequest.prompt and GenerateTextResponse.candidates. There should not be more than one setting for each SafetyCategory type. The API will block any prompts and responses that fail to meet the thresholds set by these settings. This list overrides the default settings for each SafetyCategory specified in the safety_settings. If there is no SafetySetting for a given SafetyCategory provided in the list, the API will use the default safety setting for that category. Harm categories HARM_CATEGORY_DEROGATORY, HARM_CATEGORY_TOXICITY, HARM_CATEGORY_VIOLENCE, HARM_CATEGORY_SEXUAL, HARM_CATEGORY_MEDICAL, HARM_CATEGORY_DANGEROUS are supported in text service.

The set of character sequences (up to 5) that will stop output generation. If specified, the API will stop at the first appearance of a stop sequence. The stop sequence will not be included as part of the response.

Optional. Controls the randomness of the output. Note: The default value varies by model, see the Model.temperature attribute of the Model returned the getModel function. Values can range from [0.0,1.0], inclusive. A value closer to 1.0 will produce responses that are more varied and creative, while a value closer to 0.0 will typically result in more straightforward responses from the model.

Optional. The maximum number of tokens to consider when sampling. The model uses combined Top-k and nucleus sampling. Top-k sampling considers the set of top_k most probable tokens. Defaults to 40. Note: The default value varies by model, see the Model.top_k attribute of the Model returned the getModel function.

Optional. The maximum cumulative probability of tokens to consider when sampling. The model uses combined Top-k and nucleus sampling. Tokens are sorted based on their assigned probabilities so that only the most likely tokens are considered. Top-k sampling directly limits the maximum number of tokens to consider, while Nucleus sampling limits number of tokens based on the cumulative probability. Note: The default value varies by model, see the Model.top_p attribute of the Model returned the getModel function.

The response from the model, including candidate completions.

Responded text information of first candidate.

A convenience overload to easily access the responded text.

The responded text information of first candidate.

Candidate responses from the model.

A set of content filtering metadata for the prompt and response text. This indicates which SafetyCategory(s) blocked a candidate from this response, the lowest HarmProbability that triggered a block, and the HarmThreshold setting for that category. This indicates the smallest change to the SafetySettings that would be necessary to unblock at least 1 response. The blocking is configured by the SafetySettings in the request (or the default SafetySettings of the API).

Returns any safety feedback related to content filtering.

Number of output videos.

Optional. The aspect ratio for the generated video. 16:9 (landscape) and 9:16 (portrait) are supported.

Value: 9:16, or 16:9

Duration of the clip for video generation in seconds.

Whether to use the prompt rewriting logic.

Frames per second for video generation.

Used to override HTTP request options.

Optional field in addition to the text content. Negative prompts can be explicitly stated here to help generate the video.

The GCS bucket where to save the generated videos.

Whether allow to generate person videos, and restrict to specific ages. Supported values are: dont_allow, allow_adult.

"personGeneration": "allow_all" is not available in Imagen 2 Editing and is only available to approved users‡ in Imagen 2 Generation. Values: allow_all: Allow generation of people of all ages. allow_adult (default): Allow generation of adults only. dont_allow: Disables the inclusion of people or faces in images.

The PubSub topic where to publish the video generation progress.

The resolution for the generated video. 1280x720, 1920x1080 are supported.

Value: 1280x720, or 1920x1080

The RNG seed. If RNG seed is exactly same for each request with unchanged inputs, the prediction results will be consistent. Otherwise, a random RNG seed will be used each time to produce a different result.

A video generation operation.\n\nUse the following code to refresh the operation:\n\n```\noperation = client.operations.get(operation)\n```

The server-assigned name, which is only unique within the same service that originally returns it. If you use the default HTTP mapping, the `name` should be a resource name ending with `operations/{unique_id}`.

The generated videos.

If the value is `false`, it means the operation is still in progress. If `true`, the operation is completed, and either `error` or `response` is available.

The error result of the operation in case of failure or cancellation.

The normal response of the operation in case of success.

An array that contains the object with video details to get information about.

Initializes a new instance of the class.

The text prompt guides what videos the model generates. The number of generated videos. Thrown when the is . Thrown when the is less than 1 or greater than 8.

Response with generated videos.

List of the generated videos

Returns if any videos were filtered due to RAI policies.

Returns rai failure reasons if any.

Configuration for content generation. This message contains all the parameters that control how the model generates content. It allows you to influence the randomness, length, and structure of the output.

Configuration options for model generation and outputs. Not all parameters are configurable for every model.

Optional. If enabled, audio timestamps will be included in the request to the model. This can be useful for synchronizing audio with other modalities in the response.

Optional. If enabled, the model will detect emotions and adapt its responses accordingly. For example, if the model detects that the user is frustrated, it may provide a more empathetic response.

Optional. Config for model selection.

Optional. Routing configuration.

Optional. Config for model selection.

Optional. An internal detail. Use `responseJsonSchema` rather than this field.

Optional. Controls the maximum depth of the model's internal reasoning process before it produces a response. If not specified, the default is HIGH. Recommended for Gemini 3 or later models. Use with earlier models results in an error.

Optional. Output schema of the generated response. This is an alternative to response_schema that accepts [JSON Schema](https://json-schema.org/). If set, response_schema must be omitted, but response_mime_type is required. While the full JSON Schema may be sent, not all features are supported. Specifically, only the following properties are supported: - $id - $defs - $ref - $anchor - type - format - title - description - enum (for strings and numbers) - items - prefixItems - minItems - maxItems - minimum - maximum - anyOf - oneOf (interpreted the same as anyOf) - properties - additionalProperties - required The non-standard propertyOrdering property may also be set. Cyclic references are unrolled to a limited degree and, as such, may only be used within non-required properties. (Nullable properties are not sufficient.) If $ref is set on a sub-schema, no other properties, except for than those starting as a $, may be set.

Optional. Number of generated responses to return. If unset, this will default to 1. Please note that this doesn't work for previous generation models (Gemini 1.0 family)

Optional. Enables enhanced civic answers. It may not be available for all models.

Optional. Frequency penalty applied to the next token's logprobs, multiplied by the number of times each token has been seen in the respponse so far. A positive penalty will discourage the use of tokens that have already been used, proportional to the number of times the token has been used: The more a token is used, the more difficult it is for the model to use that token again increasing the vocabulary of responses. Caution: A _negative_ penalty will encourage the model to reuse tokens proportional to the number of times the token has been used. Small negative values will reduce the vocabulary of a response. Larger negative values will cause the model to start repeating a common token until it hits the max_output_tokens limit.

Optional. Config for image generation. An error will be returned if this field is set for models that don't support these config options.

Optional. Only valid if response_logprobs=True. This sets the number of top logprobs to return at each decoding step in the Candidate.logprobs_result. The number must be in the range of [0, 20].

Optional. The maximum number of tokens to include in a response candidate. Note: The default value varies by model, see the Model.output_token_limit attribute of the Model returned from the getModel function.

Optional. If specified, the media resolution specified will be used.

Optional. Presence penalty applied to the next token's logprobs if the token has already been seen in the response. This penalty is binary on/off and not dependant on the number of times the token is used (after the first). Use frequency_penalty for a penalty that increases with each use. A positive penalty will discourage the use of tokens that have already been used in the response, increasing the vocabulary. A negative penalty will encourage the use of tokens that have already been used in the response, decreasing the vocabulary.

Optional. An internal detail. Use responseJsonSchema rather than this field.

Optional. If true, export the logprobs results in response.

Optional. MIME type of the generated candidate text. Supported MIME types are: text/plain: (default) Text output. application/json: JSON response in the response candidates. text/x.enum: ENUM as a string response in the response candidates. Refer to the [docs](https://ai.google.dev/gemini-api/docs/prompting_with_media#plain_text_formats) for a list of all supported text MIME types.

Optional. The requested modalities of the response. Represents the set of modalities that the model can return, and should be expected in the response. This is an exact match to the modalities of the response. A model may have multiple combinations of supported modalities. If the requested modalities do not match any of the supported combinations, an error will be returned. An empty list is equivalent to requesting only text.

Optional. Output schema of the generated candidate text. Schemas must be a subset of the [OpenAPI schema](https://spec.openapis.org/oas/v3.0.3#schema) and can be objects, primitives or arrays. If set, a compatible response_mime_type must also be set. Compatible MIME types: application/json: Schema for JSON response. Refer to the [JSON text generation guide](https://ai.google.dev/gemini-api/docs/json-mode) for more details.

Optional. Seed used in decoding. If not set, the request uses a randomly generated seed.

Optional. The speech generation config.

Optional. The set of character sequences (up to 5) that will stop output generation. If specified, the API will stop at the first appearance of a stop_sequence. The stop sequence will not be included as part of the response.

Optional. Controls the randomness of the output. Note: The default value varies by model, see the Model.temperature attribute of the Model returned from the getModel function. Values can range from [0.0, 2.0].

Optional. Config for thinking features. An error will be returned if this field is set for models that don't support thinking.

Optional. The maximum number of tokens to consider when sampling. Gemini models use Top-p (nucleus) sampling or a combination of Top-k and nucleus sampling. Top-k sampling considers the set of top_k most probable tokens. Models running with nucleus sampling don't allow top_k setting. Note: The default value varies by Model and is specified by theModel.top_p attribute returned from the getModel function. An empty top_k attribute indicates that the model doesn't apply top-k sampling and doesn't allow setting top_k on requests.

Optional. The maximum cumulative probability of tokens to consider when sampling. The model uses combined Top-k and Top-p (nucleus) sampling. Tokens are sorted based on their assigned probabilities so that only the most likely tokens are considered. Top-k sampling directly limits the maximum number of tokens to consider, while Nucleus sampling limits the number of tokens based on the cumulative probability. Note: The default value varies by Model and is specified by theModel.top_p attribute returned from the getModel function. An empty top_k attribute indicates that the model doesn't apply top-k sampling and doesn't allow setting top_k on requests.

Media resolution has not been set.

Media resolution set to low (64 tokens).

Media resolution set to medium (256 tokens).

Media resolution set to high (zoomed reframing with 256 tokens).

Config for model selection.

Required. Feature selection preference.

Unspecified feature selection preference.

Prefer higher quality over lower cost.

Balanced feature selection preference.

Prefer lower cost over higher quality.

The configuration for routing the request to a specific model. This can be used to control which model is used for the generation, either automatically or by specifying a model name.

In this mode, the model is selected automatically based on the content of the request.

In this mode, the model is specified manually.

The configuration for automated routing. When automated routing is specified, the routing will be determined by the pretrained routing model and customer provided model routing preference.

The model routing preference.

Unspecified model routing preference.

The model will be selected to prioritize the quality of the response.

The model will be selected to balance quality and cost.

The model will be selected to prioritize the cost of the request.

The configuration for manual routing. When manual routing is specified, the model will be selected based on the model name provided.

The name of the model to use. Only public LLM models are accepted.

Generic Metadata shared by all operations.

Output only. Time when the operation was created.

Output only. Partial failures encountered. E.g. single files that couldn't be read. This field should never exceed 20 entries. Status details field will contain standard Google Cloud error details.

Output only. Time when the operation was updated for the last time. If the operation has finished (successfully or not), this is the finish time.

Contains information about the source of the models generated from Generative AI Studio.

Required. The public base model URI.

The Google Drive location for the input content.

Required. Google Drive resource IDs.

The type and ID of the Google Drive resource.

Required. The ID of the Google Drive resource.

Required. The type of the Google Drive resource.

Unspecified resource type.

File resource type.

Folder resource type.

Tool to retrieve public maps data for grounding, powered by Google.

The GoogleMaps Tool that provides geospatial context for the user's query.

Optional. Whether to return a widget context token in the GroundingMetadata of the response. Developers can use the widget context token to render a Google Maps widget with geospatial context related to the places that the model references in the response.

A `Maps` chunk is a piece of evidence that comes from Google Maps. It contains information about a place, such as its name, address, and reviews. This is used to provide the user with rich, location-based information. This data type is not supported in Gemini API.

The URI of the place.

The title of the place.

The text of the place answer.

This Place's resource name, in `places/{place_id}` format. This can be used to look up the place in the Google Maps API.

A unique identifier for review.

The sources that were used to generate the place answer. This includes review snippets and photos that were used to generate the answer, as well as URIs to flag content.

The Status type defines a logical error model that is suitable for different programming environments, including REST APIs and RPC APIs. It is used by [gRPC](https://github.com/grpc). Each Status message contains three pieces of data: error code, error message, and error details. You can find out more about this error model and how to work with it in the [API Design Guide](https://cloud.google.com/apis/design/errors).

The status code, which should be an enum value of google.rpc.Code.

A list of messages that carry the error details. There is a common set of message types for APIs to use.

A developer-facing error message, which should be in English. Any user-facing error message should be localized and sent in the google.rpc.Status.details field, or localized by the client.

GoogleSearch tool type. Tool to support Google Search in Model. Powered by Google.

Optional. Sites with confidence level chosen & above this value will be blocked from the search results.

Optional. List of domains to be excluded from the search results. The default limit is 2000 domains.

Optional. The set of search types to enable. If not set, web search is enabled by default.

Optional. Filter search results to a specific time range. If customers set a start time, they must set an end time (and vice versa).

Tool to retrieve public web data for grounding, powered by Google.

Optional. Disable using the result from this tool in detecting grounding attribution.

This does not affect how the result is given to the model for generation.

Creates an instance of

Creates an instance of with Mode and DynamicThreshold.

The mode of the predictor to be used in dynamic retrieval. The threshold to be used in dynamic retrieval. If not set, a system default value is used.

Specifies the dynamic retrieval configuration for the given source.

Represents a whole or partial calendar date, such as a birthday. The time of day and time zone are either specified elsewhere or are insignificant. The date is relative to the Gregorian Calendar. This can represent one of the following: * A full date, with non-zero year, month, and day values. * A month and day, with a zero year (for example, an anniversary). * A year on its own, with a zero month and a zero day. * A year and month, with a zero day (for example, a credit card expiration date). Related types: * google.type.TimeOfDay * google.type.DateTime * google.protobuf.Timestamp

Day of a month. Must be from 1 to 31 and valid for the year and month, or 0 to specify a year by itself or a year and month where the day isn't significant.

Month of a year. Must be from 1 to 12, or 0 to specify a year without a month and day.

Year of the date. Must be from 1 to 9999, or 0 to specify a date without a year.

The default value. This value is unused.

Represents a user. When set, you must provide email_address for the user.

Represents a group. When set, you must provide email_address for the group.

Represents access to everyone. No extra information is required.

Attribution for a source that contributed to an answer.

Grounding source content that makes up this attribution.

Output only. Identifier for the source contributing to this attribution.

Output only. Start index into the content.

Output only. End index into the content.

Output only. Part index into the content.

A piece of evidence that supports a claim made by the model. This is used to show a citation for a claim made by the model. When grounding is enabled, the model returns a GroundingChunk that contains a reference to the source of the information.

A GroundingChunk represents a segment of supporting evidence that grounds the model's response. It can be a chunk from the web, a retrieved context from a file, or information from Google Maps.

Optional. Grounding chunk from image search.

Optional. Grounding chunk from Google Maps.

Optional. Grounding chunk from context retrieved by the file search tool.

Grounding chunk from the web.

Information about the sources that support the content of a response. When grounding is enabled, the model returns citations for claims in the response. This object contains the retrieved sources.

Metadata returned to client when grounding is enabled.

Optional. The queries that were executed by the retrieval tools. This field is populated only when the grounding source is a retrieval tool, such as Vertex AI Search.

Optional. Output only. A list of URIs that can be used to flag a place or review for inappropriate content. This field is populated only when the grounding source is Google Maps.

Optional. Resource name of the Google Maps widget context token that can be used with the PlacesContextElement widget in order to render contextual data. Only populated in the case that grounding with Google Maps is enabled.

List of supporting references retrieved from specified grounding source. When streaming, this only contains the grounding chunks that have not been included in the grounding metadata of previous responses.

List of grounding support.

Image search queries used for grounding.

Metadata related to retrieval in the grounding flow.

Optional. Google search entry for the following-up web searches.

Web search queries for the following-up web search.

A URI that can be used to flag a place or review for inappropriate content. This is populated only when the grounding source is Google Maps.

The URI that can be used to flag the content.

The ID of the place or review.

Passage included inline with a grounding configuration.

Content of the passage.

Identifier for the passage for attributing this passage in grounded answers.

Identifier for a part within a GroundingPassage.

Output only. Index of the part within the GenerateAnswerRequest's GroundingPassage.content.

Output only. ID of the passage matching the GenerateAnswerRequest's GroundingPassage.id.

A repeated list of passages.

List of passages.

A collection of supporting references for a segment of the model's response.

Grounding support.

Optional. Confidence score of the support references. Ranges from 0 to 1. 1 is the most confident. This list must have the same size as the grounding_chunk_indices.

Optional. A list of indices (into 'grounding_chunk' in response.candidate.grounding_metadata) specifying the citations associated with the claim. For instance [1,3,4] means that grounding_chunk[1], grounding_chunk[3], grounding_chunk[4] are the retrieved content attributed to the claim. If the response is streaming, the grounding_chunk_indices refer to the indices across all responses. It is the client's responsibility to accumulate the grounding chunks from all responses (while maintaining the same order).

Segment of the content this support belongs to.

The harm block method is unspecified.

The harm block method uses both probability and severity scores.

The harm block method uses the probability score.

Threshold is unspecified.

Content with NEGLIGIBLE will be allowed.

Content with NEGLIGIBLE and LOW will be allowed.

Content with NEGLIGIBLE, LOW, and MEDIUM will be allowed.

All content will be allowed.

Turn off the safety filter.

Category is unspecified.

**PaLM** - Negative or harmful comments targeting identity and/or protected attribute.

**PaLM** - Content that is rude, disrespectful, or profane.

**PaLM** - Describes scenarios depicting violence against an individual or group, or general descriptions of gore.

**PaLM** - Contains references to sexual acts or other lewd content.

**PaLM** - Promotes unchecked medical advice.

**PaLM** - Dangerous content that promotes, facilitates, or encourages harmful acts.

**Gemini** - Harassment content.

**Gemini** - Hate speech and content.

**Gemini** - Sexually explicit content.

**Gemini** - Dangerous content.

**Gemini** - Content that may be used to harm civic integrity. DEPRECATED: use enable_enhanced_civic_answers instead.

Probability is unspecified.

Content has a negligible chance of being unsafe.

Content has a low chance of being unsafe.

Content has a medium chance of being unsafe.

Content has a high chance of being unsafe.

The harm severity is unspecified.

The harm severity is negligible.

The harm severity is low.

The harm severity is medium.

The harm severity is high.

Message that represents an arbitrary HTTP body. It should only be used for payload formats that can't be represented as JSON, such as raw binary or an HTML page. This message can be used both in streaming and non-streaming API methods in the request as well as the response. It can be used as a top-level request field, which is convenient if one wants to extract parameters from either the URL or HTTP template into the request fields and also want access to the raw HTTP body. Example: message GetResourceRequest { // A unique request id. string request_id = 1; // The raw HTTP body is bound to this field. google.api.HttpBody http_body = 2; } service ResourceService { rpc GetResource(GetResourceRequest) returns (google.api.HttpBody); rpc UpdateResource(google.api.HttpBody) returns (google.protobuf.Empty); } Example with streaming methods: service CaldavService { rpc GetCalendar(stream google.api.HttpBody) returns (stream google.api.HttpBody); rpc UpdateCalendar(stream google.api.HttpBody) returns (stream google.api.HttpBody); } Use of this type only changes how the request and response bodies are handled, all other features will continue to work unchanged.

The HTTP Content-Type header value specifying the content type of the body.

The HTTP request/response body as raw binary.

Application specific response metadata. Must be set in the first response for streaming APIs.

Element is in the HTTP request query.

Element is in the HTTP request header.

Element is in the HTTP request path.

Element is in the HTTP request body.

Element is in the HTTP request cookie.

HTTP options to be used in each of the requests.

Specifies the version of the API to use.

The base URL for the AI platform service endpoint.

Additional HTTP headers to be sent with the request.

Timeout for the request in milliseconds.

Hyperparameters controlling the tuning process. Read more at https://ai.google.dev/docs/model_tuning_guidance

Optional: The Adapter size to use for the tuning job.

The adapter size influences the number of trainable parameters for the tuning job. A larger adapter size implies that the model can learn more complex tasks, but it requires a larger training dataset and longer training times.

Immutable. The batch size hyperparameter for tuning. If not set, a default of 4 or 16 will be used based on the number of training examples.

Immutable. The number of training epochs. An epoch is one pass through the training data. If not set, a default of 5 will be used.

Optional. Immutable. The learning rate hyperparameter for tuning. If not set, a default of 0.001 or 0.0002 will be calculated based on the number of training examples.

Optional. Immutable. The learning rate multiplier is used to calculate a final learning_rate based on the default (recommended) value. Actual learning rate := learning_rate_multiplier * default learning rate Default learning rate is dependent on base model and dataset size. If not set, a default of 1.0 will be used.

An image generated by the model.

A base64 encoded string of one (generated) image. (20 MB)

The IANA standard MIME type of the image.

Exists if storageUri is provided. The Cloud Storage uri where the generated images are stored.

The image bytes data. can contain a value for this field or the `GcsUri` field but not both.

The base64-encoded JSON of the generated image.

The URL of the generated image, if response_format is url (default).

The prompt that was used to generate the image, if there was any revision to the prompt.

Configuration for image generation. This message allows you to control various aspects of image generation, such as the output format, aspect ratio, and whether the model can generate images of people.

Config for image generation features.

Optional. The image output format for generated images.

Optional. Controls whether the model can generate people.

The default behavior is unspecified. The model will decide whether to generate images of people.

Allows the model to generate images of people, including adults and children.

Allows the model to generate images of adults, but not children.

Prevents the model from generating images of people.

Optional. The aspect ratio of the image to generate. Supported aspect ratios: 1:1, 2:3, 3:2, 3:4, 4:3, 9:16, 16:9, 21:9. If not specified, the model will choose a default aspect ratio based on any reference images provided.

Optional. Specifies the size of generated images. Supported values are `1K`, `2K`, `4K`. If not specified, the model will use default value `1K`

MIME type of the generated image.

Compression quality of the generated image (for ``image/jpeg`` only).

The image output format for generated images.

Optional. The compression quality of the output image.

Optional. The image format that the output should be saved as.

The number of generated images.

Accepted integer values: 1-8 (v.002), 1-4 (v.005, v.006). Default value: 4.

The number of generated images.

Accepted integer values: 1-8 (v.002), 1-4 (v.005, v.006). Default value: 4.

Optional. Cloud Storage URI for where to store the generated images.

Optional. Pseudo random seed for reproducible generated outcome; setting the seed lets you generate deterministic output.

Version 006 model only: To use the seed field you must also set "addWatermark": false in the list of parameters.

Optional. The text prompt for guiding the response.

en (default), de, fr, it, es

Optional. Description of what to discourage in the generated images.

Optional. For model version 006 and greater use editConfig.guidanceScale.

Controls how much the model adheres to the text prompt. Large values increase output and prompt alignment, but may compromise image quality. Values: 0-500 - Default: 60

Optional. Whether to disable the person/face safety filter (so that person/face can be included in the generated images).

Deprecated (v.006 only): Use personGeneration instead.

Optional. With input prompt, image, mask - backgroundEditing mode enables background editing.

Values: backgroundEditing upscale

Optional. Sample image size when mode is set to upscale. This field is no longer required when upscaling. Use upscaleConfig.upscaleFactor to set the upscaled image size.

2048 or 4096

Optional. The aspect ratio of the generated image.

Value: 1:1, 9:16*, 16:9*, 3:4*, or 4:3*

Optional. Whether to enable rounded Responsible AI scores for a list of safety attributes in responses for unfiltered input and output.

Safety attribute categories: "Death, Harm and Tragedy", "Firearms and Weapons", "Hate", "Health", "Illicit Drugs", "Politics", "Porn", "Religion and Belief", "Toxic", "Violence", "Vulgarity", "War and Conflict".

Optional. The safety setting that controls the type of people or face generation allowed.

Optional. The safety setting that controls safety filter thresholds.

Values: block_most: The highest threshold resulting in most requests blocked. block_some (default): The medium threshold that balances blocks for potentially harmful and benign content. block_few: Reduces the number of requests blocked due to safety filters. This setting might increase objectionable content generated by Imagen.

Defines whether the image will include a SynthID. For more information, see Identifying AI-generated content with SynthID.

edit config object for model versions 006 and greater. All editConfig subfields are optional. If not specified, the default editing mode is inpainting.

Optional. Describes the output image format in an object.

Whether to use the prompt rewriting logic.

Cloud Storage URI used to store the generated images.

MIME type of the generated image.

Compression quality of the generated image (for `image/jpeg` only).

User specified labels to track billing usage.

An array that contains the object with image details to get information about.

Initializes a new instance of the class.

The text prompt guides what images the model generates. The number of generated images. Thrown when the is . Thrown when the is less than 1 or greater than 8.

Output only. A list of the generated images.

Optional. The compression quality of the output image.

Optional. The image format that the output should be saved as.

A list of languages (lower case?!)

The quality of the image that will be generated.

Standard quality.

High definition quality.

The type of the reference image.

Raw reference type

Mask reference type

Control reference type

Subject reference type

Style reference type

Image search for grounding and related configurations.

OpenAI image generation request

A text description of the desired image(s).

Required. The name of the `Model` to use for generating the completion. The model name will prefixed by \"models/\" if no slash appears in it.

Optional. Amount of candidate completions to generate. Must be a positive integer. Defaults to 1 if not set.

The quality of the image that will be generated. hd creates images with finer details and greater consistency across the image.

Optional. The format in which the generated images are returned. Must be one of url or b64_json. URLs are only valid for 60 minutes after the image has been generated.

The size of the generated images.

Must be one of 256x256, 512x512, or 1024x1024 for dall-e-2. Must be one of 1024x1024, 1792x1024, or 1024x1792 for dall-e-3 models.

The style of the generated images. Must be one of vivid or natural. Vivid causes the model to lean towards generating hyper-real and dramatic images. Natural causes the model to produce more natural, less hyper-real looking images.

A unique identifier representing your end-user, which can help OpenAI to monitor and detect abuse.

The style of the generated images.

Vivid causes the model to lean towards generating hyper-real and dramatic images.

Natural causes the model to produce more natural, less hyper-real looking images.

The number of generated images.

Accepted integer values: 1-3

Optional. Cloud Storage uri where to store the generated images.

Optional. The seed for random number generator (RNG). If RNG seed is the same for requests with the inputs, the prediction results will be the same.

Optional. The text prompt for guiding the response.

en (default), de, fr, it, es

Initializes a new instance of the class.

The base64 encoded image to process. The question to ask about the image. The number of predictions. Language of predicted text. Defaults to "en". Optional. Cloud Storage URI where to store the generated predictions. Thrown when the is . Thrown when the is less than 1 or greater than 3. Thrown when the is not supported.

List of text strings representing captions, sorted by confidence.

Request for ImportFile to import a File API file with a FileSearchStore. LINT.IfChange(ImportFileRequest)

Optional. Config for telling the service how to chunk the file. If not provided, the service will use default parameters.

Custom metadata to be associated with the file.

Required. The name of the File to import. Example: files/abc-123

Raw media bytes sent directly in the request. Text should not be sent as raw bytes.

Serialized bytes data of the image or video. You can specify at most 1 image with inlineData. To specify up to 16 images, use fileData.

The base64 encoding of the image, PDF, or video to include inline in the prompt.

When including media inline, you must also specify MIMETYPE. Size limit: 20MB

The IANA standard MIME type of the source data.

The media type of the image, PDF, or video specified in the data or fileUri fields. Acceptable values include the following: "image/png", "image/jpeg", "image/heic", "image/heif", "image/webp". application/pdf video/mov video/mpeg video/mp4 video/mpg video/avi video/wmv video/mpegps video/flv Maximum video length: 2 minutes. No limit on image resolution.

The request to be processed in the batch.

Optional. The metadata to be associated with the request.

Required. The request to be processed in the batch.

The requests to be processed in the batch if provided as part of the batch creation request.

Required. The requests to be processed in the batch.

The response to a single request in the batch.

Output only. The error encountered while processing the request.

Output only. The metadata associated with the request.

Output only. The response to the request.

The responses to the requests in the batch.

Output only. The responses to the requests in the batch.

The request to be processed in the batch.

Optional. The metadata to be associated with the request.

Required. The request to be processed in the batch.

The requests to be processed in the batch if provided as part of the batch creation request.

Required. The requests to be processed in the batch.

The response to a single request in the batch.

Output only. The error encountered while processing the request.

Output only. The metadata associated with the request.

Output only. The response to the request.

The responses to the requests in the batch.

Output only. The responses to the requests in the batch.

Configures the input to the batch request.

The name of the File containing the input requests.

The requests to be processed in the batch.

Configures the input to the batch request.

The name of the File containing the input requests.

The requests to be processed in the batch.

Feedback related to the input data used to answer the question, as opposed to the model-generated response to the question.

Optional. If set, the input was blocked and no candidates are returned. Rephrase the input.

Ratings for safety of the input. There is at most one rating per category.

An instance of an image with additional metadata.

Required. The text prompt for the image.

The text prompt guides what images the model generates. This field is required for both generation and editing.

Optional. Input image for editing.

Base64 encoded image (20 MB)

Optional. Mask image for mask-based editing.

Base64 input image with 1s and 0s where 1 indicates regions to keep (PNG) (20 MB)

Optional. A list of reference images for the editing operation.

An attribution method that computes the Aumann-Shapley value taking advantage of the model's fully differentiable structure. Refer to this paper for more details: https://arxiv.org/abs/1703.01365

Config for IG with blur baseline. When enabled, a linear path from the maximally blurred image to the input image is created. Using a blurred baseline instead of zero (black image) is motivated by the BlurIG approach explained here: https://arxiv.org/abs/2004.03383

Config for SmoothGrad approximation of gradients. When enabled, the gradients are approximated by averaging the gradients from noisy samples in the vicinity of the inputs. Adding noise can help improve the computed gradients. Refer to this paper for more details: https://arxiv.org/pdf/1706.03825.pdf

Required. The number of steps for approximating the path integral. A good value to start is 50 and gradually increase until the sum to diff property is within the desired error range. Valid range of its value is [1, 100], inclusively.

A convenience property to get the responded text information of first candidate.

Represents a time interval, encoded as a Timestamp start (inclusive) and a Timestamp end (exclusive). The start must be less than or equal to the end. When the start equals the end, the interval is empty (matches no time). When both start and end are unspecified, the interval matches any time.

Optional. Exclusive end of the interval. If specified, a Timestamp matching this interval will have to be before the end.

Optional. Inclusive start of the interval. If specified, a Timestamp matching this interval will have to be the same or after the start.

The Jira source for the ImportRagFilesRequest.

Required. The Jira queries.

JiraQueries contains the Jira queries and corresponding authentication.

Required. The SecretManager secret version resource name (e.g. projects/{project}/secrets/{secret}/versions/{version}) storing the Jira API key. See [Manage API tokens for your Atlassian account](https://support.atlassian.com/atlassian-account/docs/manage-api-tokens-for-your-atlassian-account/).

A list of custom Jira queries to import. For information about JQL (Jira Query Language), see https://support.atlassian.com/jira-service-management-cloud/docs/use-advanced-search-with-jira-query-language-jql/

Required. The Jira email address.

A list of Jira projects to import in their entirety.

Required. The Jira server URI.

Unspecified language. This value should not be used.

Python >= 3.10, with numpy and simpy available. Python is the default language.

An object that represents a latitude/longitude pair. This is expressed as a pair of doubles to represent degrees latitude and degrees longitude. Unless specified otherwise, this object must conform to the WGS84 standard. Values must be within normalized ranges.

The latitude in degrees. It must be in the range [-90.0, +90.0].

The longitude in degrees. It must be in the range [-180.0, +180.0].

Config for optional parameters.

Response with a list of CachedContents.

Response with CachedContents list.

List of cached contents.

A token, which can be sent as page_token to retrieve the next page. If this field is omitted, there are no subsequent pages.

Response from containing a paginated list of s. The s are sorted by ascending .

The returned s.

A token, which can be sent as to retrieve the next page. If this field is omitted, there are no more pages.

Response message for MetadataService.ListContexts.

The Contexts retrieved from the MetadataStore.

A token, which can be sent as ListContextsRequest.page_token to retrieve the next page. If this field is not populated, there are no subsequent pages.

Response from ListCorpora containing a paginated list of Corpora. The results are sorted by ascending corpus.create_time.

The returned corpora.

A token, which can be sent as page_token to retrieve the next page. If this field is omitted, there are no more pages.

Response from ListDocuments containing a paginated list of Documents. The Documents are sorted by ascending document.create_time.

The returned Documents.

A token, which can be sent as page_token to retrieve the next page. If this field is omitted, there are no more pages.

Response from ListFileSearchStores containing a paginated list of FileSearchStores. The results are sorted by ascending file_search_store.create_time.

The returned rag_stores.

A token, which can be sent as page_token to retrieve the next page. If this field is omitted, there are no more pages.

Response for ListFiles.

The list of Files.

A token that can be sent as a page_token into a subsequent ListFiles call.

Response for ListGeneratedFiles.

The list of GeneratedFiles.

A token that can be sent as a page_token into a subsequent ListGeneratedFiles call.

Configuration for retrieving models.

If query_base is set to True in the config or not set (default), the API will return all available base models. If set to False, it will return all tuned models.

Response message for ModelService.ListModels

Response from ListModel containing a paginated list of Models.

The returned Models.

A token, which can be sent as page_token to retrieve the next page. If this field is omitted, there are no more pages.

The response message for Operations.ListOperations.

The standard List next-page token.

A list of operations that matches the specified filter in the request.

Unordered list. Unreachable resources. Populated when the request sets ListOperationsRequest.return_partial_success and reads across collections. For example, when attempting to list all resources across all supported locations.

Response from ListPermissions containing a paginated list of permissions.

A token, which can be sent as page_token to retrieve the next page. If this field is omitted, there are no more pages.

Returned permissions.

Response message for VertexRagDataService.ListRagCorpora.

Response from `ListRagEngineCorpora` containing a paginated list of `RagEngineCorpora`.

A token to retrieve the next page of results. Pass to ListRagCorporaRequest.page_token to obtain that page.

List of RagCorpora in the requested page.

The returned corpora.

Response message for VertexRagDataService.ListRagFiles.

A token to retrieve the next page of results. Pass to ListRagFilesRequest.page_token to obtain that page.

List of RagFiles in the requested page.

The list of files.

Response from ListTunedModels containing a paginated list of Models.

A token, which can be sent as page_token to retrieve the next page. If this field is omitted, there are no more pages.

The returned Models.

Session config for the API connection.

The generation configuration for the session.

The requested modalities of the response. Represents the set of modalities that the model can return. Defaults to AUDIO if not specified.

The speech generation configuration.

The user provided system instructions for the model. Note: only text should be used in parts and content in each part will be in a separate paragraph.

A list of `Tools` the model may use to generate the next response.

A `Tool` is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model.

Specification for an LLM based metric.

Optional. Optional additional configuration for the metric.

Optional. Optional configuration for the judge LLM (Autorater).

Required. Template for the prompt sent to the judge model.

Dynamically generate rubrics using a predefined spec.

Dynamically generate rubrics using this specification.

Use a pre-defined group of rubrics associated with the input. Refers to a key in the rubric_groups map of EvaluationInstance.

Optional. System instructions for the judge model.

The log probabilities of the tokens generated by the model. This is useful for understanding the model's confidence in its predictions and for debugging. For example, you can use log probabilities to identify when the model is making a less confident prediction or to explore alternative responses that the model considered. A low log probability can also indicate that the model is "hallucinating" or generating factually incorrect information.

Logprobs Result

Length = total number of decoding steps. The chosen candidates may or may not be in top_candidates.

Sum of log probabilities for all tokens.

Length = total number of decoding steps.

A single token and its associated log probability.

Candidate for the logprobs token and score.

The candidate's log probability.

The candidate’s token string value.

The candidate’s token id value.

A list of the top candidate tokens and their log probabilities at each decoding step. This can be used to see what other tokens the model considered.

The list of candidate tokens, sorted by log probability in descending order.

The configuration for manual routing. When manual routing is specified, the model will be /// selected based on the model name provided. This data type is not supported in Gemini API.

The name of the model to use. Only public LLM models are accepted.

A grounding chunk from Google Maps. A Maps chunk corresponds to a single place.

Sources that provide answers about the features of a given place in Google Maps.

The ID of the place, in places/{place_id} format. A user can use this ID to look up that place.

Text description of the place answer.

Title of the place.

URI reference of the place.

Configuration for a Mask reference image.

Prompts the model to generate a mask instead of you needing to provide one (unless MASK_MODE_USER_PROVIDED is used).

A list of up to 5 class ids to use for semantic segmentation. Automatically creates an image mask based on specific objects.

Dilation percentage of the mask provided.

Float between 0 and 1.

Optional. Prompts the model to generate a mask instead of you needing to provide one. Consequently, when you provide this parameter you can omit a mask object.

Values: background: Automatically generates a mask to all regions except primary object, person, or subject in the image foreground: Automatically generates a mask to the primary object, person, or subject in the image semantic: Use automatic segmentation to create a mask area for one or more of the segmentation classes. Set the segmentation classes using the classes parameter and the corresponding class_id values. You can specify up to 5 classes.

Optional. Determines the classes of objects that will be segmented in an automatically generated mask image. If you use this field, you must also set "maskType": "semantic". See Segmentation class IDs

The mask mode of a reference image

Default mask mode.

automatically generate a mask using background segmentation.

automatically generate a mask using foreground segmentation.

automatically generate a mask using semantic segmentation, and the given mask class.

the reference image is a mask image.

A MCPServer is a server that can be called by the model to perform actions. It is a server that implements the MCP protocol. Next ID: 5

The name of the MCPServer.

A transport that can stream HTTP requests and responses.

A reference to data stored on the filesystem, on GFS or in blobstore.

Original file name.

Media data, set if reference_type is INLINE

A composite media composed of one or more media objects, set if reference_type is COMPOSITE_MEDIA. The media length field must be set to the sum of the lengths of all composite media objects. Note: All composite media must have length specified.

Parameters for a media download.

A unique fingerprint/version id for the media data.

Extended content type information provided for Scotty uploads.

Scotty-provided SHA1 hash for an upload.

Scotty-provided SHA256 hash for an upload.

Scotty-provided MD5 hash for an upload.

For Scotty uploads only. If a user sends a hash code and the backend has requested that Scotty verify the upload against the client hash, Scotty will perform the check on behalf of the backend and will reject it if the hashes don't match. This is set to true if Scotty performed this verification.

MIME type of the data.

Set if reference_type is DIFF_UPLOAD_REQUEST.

Set if reference_type is DIFF_UPLOAD_RESPONSE.

Set if reference_type is DIFF_CHECKSUMS_RESPONSE.

Set if reference_type is DIFF_VERSION_RESPONSE.

Set if reference_type is DIFF_DOWNLOAD_RESPONSE.

Deprecated, use one of explicit hash type fields instead. Algorithm used for calculating the hash. As of 2011/01/21, \"MD5\" is the only possible value for this field. New values may be added at any time.

Describes what the field reference contains.

Use object_id instead.

Time at which the media data was last updated, in milliseconds since UNIX epoch

Path to the data, set if reference_type is PATH

Blobstore v2 info, set if reference_type is BLOBSTORE_REF, and it refers to a v2 blob.

Deprecated, use one of explicit hash type fields instead. These two hash related fields will only be populated on Scotty based media uploads and will contain the content of the hash group in the NotificationRequest: Hex encoded hash value of the uploaded media.

Size of the data, in bytes

Reference to a TI Blob, set if reference_type is BIGSTORE_REF.

|is_potential_retry| is set false only when Scotty is certain that it has not sent the request before. When a client resumes an upload, this field must be set true in agent calls, because Scotty cannot be certain that it has never sent the request before due to potential failure in the session state persistence.

Media id to forward to the operation GetMedia. Can be set if reference_type is GET_MEDIA.

Media resolution for the input media.

Media resolution has not been set.

Media resolution set to low.

Media resolution set to medium.

Media resolution set to high.

Media resolution set to ultra high.

The base unit of structured text. A Message includes an author and the content of the Message. The author is used to tag messages when they are fed to the model as text.

Optional. The author of this Message. This serves as a key for tagging the content of this Message when it is fed to the model as text. The author can be any alphanumeric string.

Output only. Citation information for model-generated content in this Message. If this Message was generated as output from the model, this field may be populated with attribution information for any text included in the content. This field is used only on output.

Required. The text content of the structured Message.

All of the structured input text passed to the model as a prompt. A MessagePrompt contains a structured set of fields that provide context for the conversation, examples of user input/model output message pairs that prime the model to respond in different ways, and the conversation history or list of messages representing the alternating turns of the conversation between the user and the model.

Optional. Text that should be provided to the model first to ground the response. If not empty, this context will be given to the model first before the examples and messages. When using a context be sure to provide it with every request to maintain continuity. This field can be a description of your prompt to the model to help provide context and guide the responses. Examples: "Translate the phrase from English to French." or "Given a statement, classify the sentiment as happy, sad or neutral." Anything included in this field will take precedence over message history if the total input size exceeds the model's input_token_limit and the input request is truncated.

Optional. Examples of what the model should generate. This includes both user input and the response that the model should emulate. These examples are treated identically to conversation messages except that they take precedence over the history in messages: If the total input size exceeds the model's input_token_limit the input will be truncated. Items will be dropped from messages before examples.

Required. A snapshot of the recent conversation history sorted chronologically. Turns alternate between two authors. If the total input size exceeds the model's input_token_limit the input will be truncated: The oldest items will be dropped from messages.

Metadata for a chunk.

Optional. Attributes attached to the data. The keys have semantic conventions and the consumers of the attributes should know how to deserialize the value bytes based on the keys.

User provided filter to limit retrieval based on Chunk or Document level metadata values. Example (genre = drama OR genre = action): key = "document.custom_metadata.genre" conditions = [{string_value = "drama", operation = EQUAL}, {string_value = "action", operation = EQUAL}]

Required. The Conditions for the given key that will trigger this filter. Multiple Conditions are joined by logical ORs.

Required. The key of the metadata to filter on.

The metric used for running evaluations.

Optional. The aggregation metrics to use.

Spec for bleu metric.

Spec for a computation based metric.

Spec for Custom Code Execution metric.

Spec for exact match metric.

Spec for an LLM based metric.

Spec for pairwise metric.

Spec for pointwise metric.

The spec for a pre-defined metric.

Spec for rouge metric.