A Generative AI Engineer developed an LLM application using the provisioned throughput Foundation Model API. Now that the application is ready to be deployed, they realize their volume of requests are not sufficiently high enough to create their own provisioned throughput endpoint. They want to choose a strategy that ensures the best cost-effectiveness for their application.What strategy should the Generative AI Engineer use?
A Generative AI Engineer is building an LLM to generate article summaries in the form of a type of poem, such as a haiku, given the article content. However, the initial output from the LLM does not match the desired tone or style.Which approach will NOT improve the LLM’s response to achieve the desired response?
A Generative AI Engineer is creating an LLM-powered application that will need access to up-to-date news articles and stock prices.The design requires the use of stock prices which are stored in Delta tables and finding the latest relevant news articles by searching the internet.How should the Generative AI Engineer architect their LLM system?
A Generative AI Engineer is designing a chatbot for a gaming company that aims to engage users on its platform while its users play online video games.Which metric would help them increase user engagement and retention for their platform?
A company has a typical RAG-enabled, customer-facing chatbot on its website.Select the correct sequence of components a user's questions will go through before the final output is returned. Use the diagram above for reference.
A team wants to serve a code generation model as an assistant for their software developers. It should support multiple programming languages. Quality is the primary objective.Which of the Databricks Foundation Model APIs, or models available in the Marketplace, would be the best fit?