FAQ - Frequently Asked Questions

How to get started?

Use the getting started guide to create a tenant and start your free trial. This tenant can be your personal tenant, or shared with others. It can not be renamed.

How to create a company tenant?

If the tenant is already created, add more users to it. Click the “account” button in the Vespa Cloud Console (top right in the tenant view), then “users”. From this view you can manage users in the tenant, and their roles - from here, you can add/set tenant admins. The user email must have a Google or GitHub account associated.

What if my organization cannot use Google or GitHub accounts?

Vespa Cloud currently uses Auth0 with Google or GitHub login only, contact Vespa Cloud Support.

How to accept Terms of Service?

When starting the free trial, you are asked to accept Terms of Service. For paid plans, this is covered by the contract.

How do I switch from free trial to a paid plan?

Contact Vespa Cloud Support for a contract. Click “account”, then “billing” in the console to enter information required for billing. Use Vespa Cloud Support if you need to provide this information without console login.

Where can I find feeding and query examples

See vector-search for examples of how to feed and query with security credentials.

Managing resources

How can I change the cost of my Vespa Cloud usage?

Use the “Resource view” per instance in the Console to assess current and auto-suggested resources. See Autoscaling for how to automate.

How can I manually modify resources used?

Managing resources is easy, as most changes are automated. Adding / removing / changing nodes starts automated data migration, see Elastic Vespa.

How to modify a schema?

Schema changes might require data reindexing, which is automated, but takes some time. Other schema changes require data refeed - details

How to evaluate how much memory a field is using?

Use the Memory Visualizer to evaluate how memory is allocated to the fields. Fields can be index, attribute and summary, and combinations of these, with settings like fast-search that affects memory usage. Attributes is a great read for understanding Vespa memory usage.

Archive access failed with Permission ‘serviceusage.services.use’ denied

Listing archived objects can fail, e.g. gsutil -u my_project ls gs://vespa-cloud-data-prod-gcp-us-central1-f-12345f/my_tenant can fail with AccessDeniedException: 403 me@mymail.com does not have serviceusage.services.use access to the Google Cloud project. Permission \'serviceusage.services.use\' denied on resource (or it may not exist). This can be due to missing rights on your Google project (my_project in the example above) - from the Google documentation: “The user account accessing the Cloud Storage Bucket must be granted the Service Usage Consumer role (see https://cloud.google.com/service-usage/docs/access-control) in order to charge the specified user project for the bucket usage cost”

Performance

What is the best practice for scaling Vespa for day vs night?

Autoscaling is the best guide to understand how to size and autoscale the system. Container clusters are stateless and can be autoscaled more quickly than content clusters.

We can spike 8x in 5 minutes in terms of throughput requirements.

It is not possible to autoscale content clusters for 8x load increase in 5 minutes, as this requires both provisioning and data migration. Such use cases are best discussed with the Vespa Team to understand the resource bottlenecks, tradeoffs and mitigations. Also read Graceful Degradation.

How much lower-level configuration do we need to do? For example, do we need to alter the number of threads per container?

It depends. Vespa aims to adapt to resources (like auto thread config based on virtual node thread count) and actual use (when to run maintenance jobs like compaction), but there are tradeoffs that applications owners can/should make. Start off by reading the Vespa Serving Scaling Guide, then run benchmarks and use the dashboards.

Vespa Cloud Operations

How do I integrate with my current monitoring infrastructure?

Vespa Cloud applications have a Prometheus endpoint. Find guides for how to integrate with Grafana and AWS Cloudwatch at monitoring.

What is the best way to monitor instantaneously what is happening in Vespa? CPU usage? Memory usage? htop? Cloudwatch metrics?

Vespa Cloud has detailed dashboards linked from the monitoring tab in the Console, one for each zone the instance is deployed to.

How are Vespa versions upgrades handled - only for new deploys?

Vespa is normally upgraded daily. There are exceptions, like holidays and weekends. During upgrades, nodes are stopped one-by-one per cluster. As all clusters have one redundant node, serving and write traffic is not impacted by upgrades. Before the upgrade, the application’s system and staging tests are run, halting the upgrade if they fail. Documents are re-migrated to the upgraded node before doing the next node, see Elastic Vespa for details.

How do we get alerted to issues like Feed Block? Searchable copy going offline?

Issues like Feed Blocked, Deployment and Deprecation warnings show up in the console. There are no warnings on redundancy level / searchable copies, as redundant document buckets are activated for queries automatically, and auto data-migration kicks in for node failures / replacements.

What actions are needed when deploying schema changes?

Schema changes that require service restart are handled automatically by Vespa Cloud. A deployment job involves waiting for these to complete.
Schema changes that require reindexing of data require a validation override, and will trigger automatic reindexing. Status can be tracked in the console application view. Vespa Cloud also periodically re-indexes all data, with minimal resource usage, to account for changes in linguistics libraries.
Schema changes that require refeeding data require a validation override, and the user must refeed the data after deployment.

What are the Vespa Cloud data retention policies?

The management of data stored in an application running on Vespa Cloud is the responsibility of the application owner and, as such, Vespa Cloud does not have any retention policy for this data as long as it is stored by the application.

The following data retention policies applies to Vespa Cloud:

After a node previously allocated to an application has been deallocated (e.g. due to application being deleted by application owner), all application data will be deleted within four hours.
All application log data will be deleted from Vespa servers after no more than 30 days (most often sooner) dependent on log volume, allocated disk resources, etc. PLEASE NOTE: This is the theoretical maximum retention time - see archive guide for how to ensure access to your application logs.

Queries

Does Vespa Cloud support a simple UI for querying like /querybuilder?

It does not, but it is a great feature request!

I have a question about Vespa.ai queries and ranking

Vespa Cloud is build on Vespa.ai. Find documentation and support resources on this site for how to feed / query / rank with Vespa.

Misc

Is Vespa Cloud certified for ISO 27001 or SOC II type II?

Vespa Cloud itself is not yet certified. It is deployed on AWS, see iso-27001-faqs and soc-faqs.

Does Vespa store information from the information sources with which it is integrated?

Vespa is most often used for queries in data written from the information sources, although it can also be used without data, e.g. for model serving. It is the application owner that writes the integration with Vespa Cloud to write data.

What is the encryption algorithm used at rest?

Vespa Cloud uses LUKS2 with default aes-xts-plain64 encryption for all file storage. Also, Vespa Cloud uses AWS EC2 instances with local or remote storage, encrypted at rest - see encryption-of-data-at-rest.

Does the Vespa console have audit trails/logs module and can it be accessed by an Admin user?

See the security guide for roles and permissions. The Vespa Cloud Console has a log view tool, and logs / access logs can be exported to the customer’s AWS account easily. Deployment operations are tracked in the deployment view, with a history. Vespa Cloud Operators do not have node access, unless specifically granted by the customer, audit logged.

Once the service purchased with Vespa is terminated, is there a secure deletion procedure for the information collected from the customer?

At termination, all application instances are removed, with data, before the tenant can be deactivated.

Why is the CPU usage for my application above 100%?

In dev zones we use shared resources hence have more than one node on each host/instance. In order to provide a best possible overall responsiveness we do not restrict CPU resources for the individual application nodes.