It indicates that the cosine distance metric should be used to measure similarity between vectors
It specifies the type of vector encoding used in the database
32. You are tasked with finding the closest matching sentences across books, where each book has
multiple paragraphs and sentences. Which SQL structure should you use?
GROUP BY with vector operations
FETCH PARTITIONS BY clause
A nested query with ORDER BY
Exact similarity search with a single query vector
33. In the following Python code, what is the significance of prepending the source filename to each
text chunk before storing it in the vector database? docs = [{"text": filename + "|" + section, "path":
filename} for filename, sections in [Link]() for section in sections] # Sample the resulting data
docs[:2]
It preserves context and aids in the retrieval process by associating each vectorized chunk with its original source file
It helps differentiate between chunks from different files but has no impact on vectorization
It speeds up the vectorization process by providing a unique identifier for each chunk
It improves the accuracy of the LLM by providing additional training data
34. How does an application use vector similarity search to retrieve relevant information from a
database, and how is this information then integrated into the generation process?
Encodes the question and database chunks into vectors, finds the most similar (top-k) database entries, and includes
them in the LLM prompt
Trains a separate LLM on the database and uses it to answer, ignoring the general LLM
Converts the question to keywords, searches for matches, and inserts the text into the prompt
Clusters similar text chunks and randomly selects one from the most relevant cluster
35. When using SQL*Loader to load vector data for search applications, what is a critical consideration
regarding the formatting of the vector data within the input CSV file?
Encode vector components in curly braces {}
As FVEC is a binary format and the vector dimensions have a known width; fixed offsets can be used to make parsing
the vectors fast and efficient
Use sparse format for vector data
Rely on SQL*Loader's automatic normalization of vector data
36. Which function is used to generate vector embeddings within an Oracle database?
DBMS_VECTOR_CHAIN_UTL_TO_CHUNKS
DBMS_VECTOR_CHAIN_UTL_TO_TEXT
DBMS_VECTOR_CHAIN_UTL_TO_EMBEDDINGS
DBMS_VECTOR_CHAIN_UTL_TO_GENERATE_TEXT
37. Which statement best describes the capability of Oracle Data Pump for handling vector data in the
context of vector search applications?
Data Pump can only export and import vector data if the vector embeddings are stored as BLOB, since Large Object
types are in the database
Data Pump fully natively understands vectors as high-level data, and can be used to export/import tables of any
precision when transferring vector data for vector search applications
Data Pump provides native support for exporting and importing tables containing vector data types, facilitating the
transfer of vector data for vector search applications
Because of the complexity of vector data Data Pump requires a specialized plug-in to handle the export and import
operations involving vector data types
38. In Oracle Database 23ai, which data type is used to store vector embeddings for similarity search?
VECTOR2
BLOB
VECTOR
VARCHAR2
39. What is created to facilitate the use of OCI Generative AI with Autonomous Database?
An AI profile for OCI Generative AI
A dedicated OCI compartment
A new user account with elevated privileges
40. Why would you choose to NOT define a specific size for the VECTOR column during development?
It impacts the accuracy of similarity searches
It restricts the database to a single embedding model
It limits the length of text that can be vectorized
Different external embedding models produce vectors with varying dimensions and data types
41. What is the correct order of steps for building a RAG application using PL/SQL in Oracle Database
23ai?
Load ONNX Model, Vectorize Question, Load Document, Split Text into Chunks, Create Embeddings, Perform Vector
Search, Generate Output
Load Document, Split Text into Chunks, Load ONNX Model, Create Embeddings, Vectorize Question, Perform Vector
Search, Generate Output
Vectorize Question, Load ONNX Model, Load Document, Split Text into Chunks, Create Embeddings, Perform Vector
Search, Generate Output
Load Document, Load ONNX Model, Split Text into Chunks, Create Embeddings, Vectorize Question, Perform Vector
Search, Generate Output
42. What is the primary purpose of a similarity search in Oracle Database 23ai?
Optimize relational database operations
To compute distances between all data points in a database
To find exact matches in BLOB data
To retrieve the most semantically similar entries using distance metrics between different vectors
43. What is the advantage of using Euclidean Squared Distance rather than Euclidean Distance in
similarity search queries?
It is the default distance metric for Oracle AI Vector Search
It supports hierarchical partitioning of vectors
It is simpler and faster because it avoids square-root calculations
It guarantees higher accuracy than Euclidean Distance
44. You need to prioritize accuracy over speed in a similarity search for a dataset of images. Which
should you use?
Approximate similarity search with HNSW indexing and target accuracy of 70%
Multivector similarity search with partitioning
Exact similarity search using a full table scan
Approximate similarity search with IVF indexing and target accuracy of 70%
45. What is the significance of splitting text into chunks in the process of loading data into Oracle AI
Vector Search?
To reduce the computational burden on the embedding model
To facilitate parallel processing of the data during vectorization
To minimize token truncation as each vector embedding model has its own maximum token limit
To improve the accuracy of vector search by creating more granular vectors
46. What is the purpose of the VECTOR_DISTANCE function in Oracle Database 23ai similarity search?
To fetch rows that match exact vector embeddings
To create vector indices for efficient searches
To group vectors by their exact scores
To calculate the distance between vectors using a specified metric