How can I query Solr to get high scores for documents with more similarity?How to configure Solr to use Levenshtein approximate string matching?is Lucene boolean OR queries scoring biased towards the term having more alternativesWhat does “documents” and “cores” mean in SOLR? And how can i use them?Single-word searches in Solr are not accurateSolr query: prefer phrase over occurrence of single words, but accept bothSolr: Scoring exact matches higher than partial matchesHow to enforce stable result scores in SOLR? By defining fixed docFreq?Solr configuration for scored searchSOLR (4.3) - reducing score of “poor” quality (very short) documentsSolr querying nested documents with ChildDocTransformerFactory, get “Parent query yields document which is not matched by parents filter”How to add a new query listener via the SolR Config API?
How to use grep to search through the --help output?
In Pokémon Go, why does one of my Pikachu have an option to evolve, but another one doesn't?
Infeasibility in mathematical optimization models
What are the uses and limitations of Persuasion, Insight, and Deception against other PCs?
Double blind peer review when paper cites author's GitHub repo for code
Accidentals - some in brackets, some not
Why does Intel's Haswell chip allow FP multiplication to be twice as fast as addition?
Author changing name
Non-OR journals which regularly publish OR research
Can I call myself an assistant professor without a PhD?
Which I-94 date do I believe?
Acceptable to cut steak before searing?
What does Apple mean by "This may decrease battery life"?
Who are these characters/superheroes in the posters from Chris's room in Family Guy?
Strangeness with gears
Are any jet engines used in combat aircraft water cooled?
During the Space Shuttle Columbia Disaster of 2003, Why Did The Flight Director Say, "Lock the doors."?
What word can be used to describe a bug in a movie?
Y2K... in 2019?
What are these two characters marked red? い_める and いじめ_いよォ?
TColorBox and Tikz, arrows are not on the diagram
Why couldn't soldiers sight their own weapons without officers' orders?
'sudo apt-get update' get a warning
show stdout containing n with line breaks
How can I query Solr to get high scores for documents with more similarity?
How to configure Solr to use Levenshtein approximate string matching?is Lucene boolean OR queries scoring biased towards the term having more alternativesWhat does “documents” and “cores” mean in SOLR? And how can i use them?Single-word searches in Solr are not accurateSolr query: prefer phrase over occurrence of single words, but accept bothSolr: Scoring exact matches higher than partial matchesHow to enforce stable result scores in SOLR? By defining fixed docFreq?Solr configuration for scored searchSOLR (4.3) - reducing score of “poor” quality (very short) documentsSolr querying nested documents with ChildDocTransformerFactory, get “Parent query yields document which is not matched by parents filter”How to add a new query listener via the SolR Config API?
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;
I have a Solr core populated from products table in an ecom database. "Title" is a field in the core. Which is the best approach to get proper score/ranking? For example, if someone searches for "iPhone", the product with title "iPhone 6s" should get a score higher than one with "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Does Solr have some variation of levenstein distance which may help? Solr Version is 7.3. Clarification - both the values I mentioned, "iPhone 6s" and "Beautiful case for iPhone 6, iPhone 7 and similar iPhones" are in the same field - Title. If we take levenstein distance, "iPhone 6s" is closer to "iPhone" than "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Can this fact be put to use so that iPhoe 6s comes before the other one?
solr levenshtein-distance
|
show 1 more comment
I have a Solr core populated from products table in an ecom database. "Title" is a field in the core. Which is the best approach to get proper score/ranking? For example, if someone searches for "iPhone", the product with title "iPhone 6s" should get a score higher than one with "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Does Solr have some variation of levenstein distance which may help? Solr Version is 7.3. Clarification - both the values I mentioned, "iPhone 6s" and "Beautiful case for iPhone 6, iPhone 7 and similar iPhones" are in the same field - Title. If we take levenstein distance, "iPhone 6s" is closer to "iPhone" than "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Can this fact be put to use so that iPhoe 6s comes before the other one?
solr levenshtein-distance
Have a look at the answers to stackoverflow.com/questions/1752301/…
– kellyfj
Mar 26 at 14:12
You can give different "boost" values to each field, for example:q=iphone&qf=title^10+description&defType=edismax
would give a bigger boost to matches on title vs matches on the description.
– Hector Correa
Mar 26 at 15:45
You can start by settingomitNorms
and if you're not doing phrase searches,omitTermFreqAndPositions
on the field - this will disable any scoring based on field length and the number of occurences.
– MatsLindh
Mar 26 at 20:03
1
@HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.
– Jayadevan
Mar 27 at 7:32
@MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.
– Tomer Arazy
Mar 27 at 15:19
|
show 1 more comment
I have a Solr core populated from products table in an ecom database. "Title" is a field in the core. Which is the best approach to get proper score/ranking? For example, if someone searches for "iPhone", the product with title "iPhone 6s" should get a score higher than one with "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Does Solr have some variation of levenstein distance which may help? Solr Version is 7.3. Clarification - both the values I mentioned, "iPhone 6s" and "Beautiful case for iPhone 6, iPhone 7 and similar iPhones" are in the same field - Title. If we take levenstein distance, "iPhone 6s" is closer to "iPhone" than "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Can this fact be put to use so that iPhoe 6s comes before the other one?
solr levenshtein-distance
I have a Solr core populated from products table in an ecom database. "Title" is a field in the core. Which is the best approach to get proper score/ranking? For example, if someone searches for "iPhone", the product with title "iPhone 6s" should get a score higher than one with "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Does Solr have some variation of levenstein distance which may help? Solr Version is 7.3. Clarification - both the values I mentioned, "iPhone 6s" and "Beautiful case for iPhone 6, iPhone 7 and similar iPhones" are in the same field - Title. If we take levenstein distance, "iPhone 6s" is closer to "iPhone" than "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Can this fact be put to use so that iPhoe 6s comes before the other one?
solr levenshtein-distance
solr levenshtein-distance
edited Mar 27 at 7:30
Jayadevan
asked Mar 26 at 13:19
JayadevanJayadevan
7807 silver badges20 bronze badges
7807 silver badges20 bronze badges
Have a look at the answers to stackoverflow.com/questions/1752301/…
– kellyfj
Mar 26 at 14:12
You can give different "boost" values to each field, for example:q=iphone&qf=title^10+description&defType=edismax
would give a bigger boost to matches on title vs matches on the description.
– Hector Correa
Mar 26 at 15:45
You can start by settingomitNorms
and if you're not doing phrase searches,omitTermFreqAndPositions
on the field - this will disable any scoring based on field length and the number of occurences.
– MatsLindh
Mar 26 at 20:03
1
@HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.
– Jayadevan
Mar 27 at 7:32
@MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.
– Tomer Arazy
Mar 27 at 15:19
|
show 1 more comment
Have a look at the answers to stackoverflow.com/questions/1752301/…
– kellyfj
Mar 26 at 14:12
You can give different "boost" values to each field, for example:q=iphone&qf=title^10+description&defType=edismax
would give a bigger boost to matches on title vs matches on the description.
– Hector Correa
Mar 26 at 15:45
You can start by settingomitNorms
and if you're not doing phrase searches,omitTermFreqAndPositions
on the field - this will disable any scoring based on field length and the number of occurences.
– MatsLindh
Mar 26 at 20:03
1
@HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.
– Jayadevan
Mar 27 at 7:32
@MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.
– Tomer Arazy
Mar 27 at 15:19
Have a look at the answers to stackoverflow.com/questions/1752301/…
– kellyfj
Mar 26 at 14:12
Have a look at the answers to stackoverflow.com/questions/1752301/…
– kellyfj
Mar 26 at 14:12
You can give different "boost" values to each field, for example:
q=iphone&qf=title^10+description&defType=edismax
would give a bigger boost to matches on title vs matches on the description.– Hector Correa
Mar 26 at 15:45
You can give different "boost" values to each field, for example:
q=iphone&qf=title^10+description&defType=edismax
would give a bigger boost to matches on title vs matches on the description.– Hector Correa
Mar 26 at 15:45
You can start by setting
omitNorms
and if you're not doing phrase searches, omitTermFreqAndPositions
on the field - this will disable any scoring based on field length and the number of occurences.– MatsLindh
Mar 26 at 20:03
You can start by setting
omitNorms
and if you're not doing phrase searches, omitTermFreqAndPositions
on the field - this will disable any scoring based on field length and the number of occurences.– MatsLindh
Mar 26 at 20:03
1
1
@HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.
– Jayadevan
Mar 27 at 7:32
@HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.
– Jayadevan
Mar 27 at 7:32
@MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.
– Tomer Arazy
Mar 27 at 15:19
@MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.
– Tomer Arazy
Mar 27 at 15:19
|
show 1 more comment
0
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55358183%2fhow-can-i-query-solr-to-get-high-scores-for-documents-with-more-similarity%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Is this question similar to what you get asked at work? Learn more about asking and sharing private information with your coworkers using Stack Overflow for Teams.
Is this question similar to what you get asked at work? Learn more about asking and sharing private information with your coworkers using Stack Overflow for Teams.
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55358183%2fhow-can-i-query-solr-to-get-high-scores-for-documents-with-more-similarity%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Have a look at the answers to stackoverflow.com/questions/1752301/…
– kellyfj
Mar 26 at 14:12
You can give different "boost" values to each field, for example:
q=iphone&qf=title^10+description&defType=edismax
would give a bigger boost to matches on title vs matches on the description.– Hector Correa
Mar 26 at 15:45
You can start by setting
omitNorms
and if you're not doing phrase searches,omitTermFreqAndPositions
on the field - this will disable any scoring based on field length and the number of occurences.– MatsLindh
Mar 26 at 20:03
1
@HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.
– Jayadevan
Mar 27 at 7:32
@MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.
– Tomer Arazy
Mar 27 at 15:19