Integration Details
API
There is basically a single method that needs to be called with the user query and that gives you access to the according QueryMapping
.
QueryMapper qm = qmManager.getQueryMapper(tenant);
String sessionId = null; // only required to support query-testing (see use cases)
QueryMapping mapping = qm.mapQuery(searchQuery, sessionId);
URL Scheme:
http://<host>:<port>/smartquery/v2/<tenant-name>/<tenant-channel>?userQuery=<user-query>[&sessionId=<SearchCollectorSession>]
- Parameters:
tenant-name (path): the first part of the tenant provided by searchHub
tenant-channel (path): the second part of the tenant provided by searchHub
userQuery (query): the text the user entered in the search box
sessionId (query): optional parameter that MUST contain the value of the SearchCollectorSession (see details below)
QueryMapping Response
The object returned by the QueryMapper contains all information to change the search result. The main focus is to change the natural query that is processed by the search engine. That’s the so-called ‘masterQuery’. In case there is no masterQuery, the original user query should be used for search. To avoid null-checks, there is the ‘searchQuery’ property that will always contain the correct query.
userQuery: the entered user query
masterQuery: if the query could be mapped, the master query is set, otherwise it’s null.
searchQuery: the final search query. This is the master or the user query.
successful: true if the query could be handled by smartQuery
Additional the QueryMapping contains more properties that can be used to further improve the result for the user.
redirect: URL to a landing page. If given, the search process can be stopped and the user can be redirected to the according URL.
potentialCorrections: An optional array of 1 or 2 queries that could be a correction to the given query. They are given in case no reliable masterQuery could be found and should be shown to the user as possible alternative queries.
relatedQueries: An optional list of queries that are related to the user input. They can be used as inspiring queries next to the search result.
resultModifications: An optional list of up to four different list types, each describing how the initial search result should be modified. Each modification type is paired with a list of corresponding IDs (docIDs).
Redirect
At searchHub search managers have the ability to add redirect URLs for certain search intends. That can be redirects to category pages, filtered or sorted search results or other arbitrary sites in the shop. If a redirect
is given, it should take precedence over the normal search. In fact, you can spare the request to the search engine, saving cost and time.
Normally the provided redirect URL comes as a full qualified URL. That way you can also validate if this is a redirect inside the shop. However, we can also provide relative URLs. That’s helpful if you have the same URL structures in different stages and want them to work in all of them. To achieve that, please let us know about your URL structures, we then adjust the exported URLs accordingly.
Potential Corrections
To implement the user story about potential corrections you should use the queries given with the potentialCorrections
property.
It’s a string array of one or more elements in case no certain mapping is available.
{ "userQuery": "batroom", "potentialCorrections": [ "bathroom", "bedroom" ] }
Along with a potential search result it should be presented to the user as alternative queries, maybe embedded into some feedback text like “Did you mean ‘${potentialCorrections[0]}’”.
A click on one of those queries should cause a new search request and can lead to an improved search result. In combination with our Search Collector tracking, we gather accepted corrections and may automatically improve future requests of those corrected queries.
Result Modifications
With our Neural-Infusion approach we are able to provide additional products to the ones delivered by your search engine. Also due to detailed tracking we know about low performing products or products that are not relevant at all for a given query. This information is given to you trough the resultModifications
property: A list of product sets that should be added or removed to the generated result.
Example
{ "userQuery": "jeans", "resultModifications": [ {"modificationType": "Pin", "ids": ["100012", "100049", "100139"]}, {"modificationType": "Add", "ids": ["100472", "100202", "100387"]}, {"modificationType": "Penalize", "ids": ["100355"]} ] }
In this example 3 products should be “pinned” to the top of the result, 3 more products should be added and submerged into the result and 1 product should get a penalty to not show up in the top results.
These are the available resultModification types:
Add: The specified IDs should be inserted into the result at any position during result generation, allowing them to be discovered through scrolling, filtering, or sorting..
Remove: Products with the specified IDs should be excluded from the result, if present. This should occur during result generation to ensure accurate facet filtering.
Pin: The specified product IDs should be included in the result and placed at the top.
Penalize: The specified products should receive a score reduction, pushing them toward the end of the result.
In the best case, your search engine is be able to perform those modifications during result generation, so those changes are already reflected in the filter facets. In the rather suboptimal case, you have to filter or insert those products artificially in the returned result. Please reach out to us for integration help with your search engine. In the following example queries we show how it could be done with common open source engines.
Adding Products
This request type allows you to search for documents that match either your query or a list of document IDs.
q=your_field:search term OR id:(id1 id2 id3)
The should clause allows you to search for documents that match either your query or a list of document IDs. The terms query filters documents by the specified list of document IDs. In case your document IDs don’t match the product IDs, use the according field.
{
"query": {
"bool": {
"should": [
{
"match": {
"your_field": "search term"
}
},
{
"terms": {
"_id": ["id1", "id2", "id3"]
}
}
]
}
}
}
Pinning Products
Pinning goes beyond adding products. Those products should be shown at the very beginning of your result.
To achieve prioritization of specific document IDs in Solr use the Query Elevation Component <https://solr.apache.org/guide/solr/latest/query-guide/query-elevation-component.html>. It provides the ‘elevateIds’ parameter that allows you to specify document IDs that should be prioritized in the search results. In this case, id1, id2, and id3 will receive a boost and appear higher in the results.
q={!edismax qf=your_field}search term&elevateIds=id1,id2,id3
If you want the documents found by _id to be prioritized over those that match the query, you can use a pinned query <https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-pinned-query.html>:
{
"query": {
"pinned": {
"ids": [ "id1", "id2", "id3" ],
"organic": {
"match": {
"your_field": {
"query": "search term"
}
}
}
}
}
}
As of now, Opensearch does not support pinned query. You can achieve similar results by boosting the according products with a high boosting value for example by using a query-string query.
{
"query": {
"bool": {
"should": [
{
"match": {
"your_field": "search term"
}
},
{
"query_string": {
"query": "id1^10000 id2^9999 id3^9998",
"default_field": "_id",
"default_operator": "OR"
}
}
]
}
}
}
Removing Products
Products with the specified IDs should be excluded from the result, if present. This should occur during result generation to ensure accurate facet filtering.
To achieve exclusion of specific document IDs in Solr using the excludeIds request parameters, you can do the following:
q={!edismax qf=your_field}search term&elevateIds=id1,id2,id3&excludeIds=id1,id2,id3
This parameter specifies document IDs that should be excluded from the search results entirely. Here, id1, id2, and id3 will be removed from the results even if they match the search term.
If you want to exclude the documents matched by _id from the final search results, you can achieve this by using a combination of must_not clause within your query. This ensures that documents found by _id are removed from the final set of results, while still running your main query.
{
"query": {
"bool": {
"must": [
{
"match": {
"your_field": "search term"
}
}
],
"must_not": [
{
"terms": {
"_id": ["id1", "id2", "id3"]
}
}
]
}
}
}
Penalizing Products
Products with the specified IDs should get a score penalty to went down in the result order.
To achieve a penalty of specific document IDs in Solr, the boost query can be used with a negated query. Basically all other documents get boosted but the ones listed.
defType=edismax&q=search term&fl=id,score&boost={!func}if(not(query({!lucene v='id:(id1 OR id2 OR id3)'})),10,1)
You can use the boosting query (ES docs / OS docs) to wrap your standard query into the “positive” clause while adding a terms query with the penalized products into the “negative” clause.
{
"query": {
"boosting": {
"positive": {
"match": {
"your_field": "search term"
}
},
"negative": {
"terms": {
"_id": ["id1", "id2", "id3"]
}
},
"negative_boost": 0.5
}
}
}
The ‘negative_boost’ value is multiplied with the score of the positive query for the products matching the negative query.
Integration with sessionID
If our search collector is integrated into the frontend of your system, it is recommended to pass the corresponding sessionId to smartQuery. This sessionId is used for clusters with queries being tested to distribute the search traffic evenly between both queries. Without the sessionId, the informative value and success rate of these tests are lower.
For implementation, the value of the SearchCollectorSession
cookie MUST be used. Using a different sessionId will lead to unexpected results.
If the SearchCollectorSession
cookie does not exist or is not provided for a request, pass ‘null’ instead.
The session ID is passed as additional parameter to the QueryMapper::mapQuery
method.
The session ID is passed to the service endpoint with the query parameter sessionId
.
Instrumenting
smartQuery optionally exposes internal metrics using the Micrometer framework. If you’d like to receive these metrics, add the desired Micrometer connector to your dependencies, as well as the MeterRegistry implementation.
Since that micrometer dependency is optional and we don’t want to cause class-loading errors, the according MeterRegistry has to be passed with a “MeterRegistryAdapter” when available. That’s as simple as MeterRegistryAdapter.of(meterRegistry)
.
// ...
MeterRegistry meterRegistry = getYourMeterRegistryInstance();
// Example: To expose metrics over JMX, create a JmxMeterRegistry
meterRegistry = new JmxMeterRegistry(JmxConfig.DEFAULT, Clock.SYSTEM);
// and add it to the QueryMapperManager.builder afterwards
queryMapperManagerBuilder.addMetricsRegistryAdapter(MeterRegistryAdapter.of(meterRegistry));
The HTTP service exposes those metrics per default. Only at the Java integration they have to be enabled explicitly.
Subsequently, you will be able to track the metrics described in the Operations > Monitoring section.