How can I query Solr to get high scores for documents with more similarity?How to configure Solr to use Levenshtein approximate string matching?is Lucene boolean OR queries scoring biased towards the term having more alternativesWhat does “documents” and “cores” mean in SOLR? And how can i use them?Single-word searches in Solr are not accurateSolr query: prefer phrase over occurrence of single words, but accept bothSolr: Scoring exact matches higher than partial matchesHow to enforce stable result scores in SOLR? By defining fixed docFreq?Solr configuration for scored searchSOLR (4.3) - reducing score of “poor” quality (very short) documentsSolr querying nested documents with ChildDocTransformerFactory, get “Parent query yields document which is not matched by parents filter”How to add a new query listener via the SolR Config API?

How to use grep to search through the --help output?

In Pokémon Go, why does one of my Pikachu have an option to evolve, but another one doesn't?

Infeasibility in mathematical optimization models

What are the uses and limitations of Persuasion, Insight, and Deception against other PCs?

Double blind peer review when paper cites author's GitHub repo for code

Accidentals - some in brackets, some not

Why does Intel's Haswell chip allow FP multiplication to be twice as fast as addition?

Author changing name

Non-OR journals which regularly publish OR research

Can I call myself an assistant professor without a PhD?

Which I-94 date do I believe?

Acceptable to cut steak before searing?

What does Apple mean by "This may decrease battery life"?

Who are these characters/superheroes in the posters from Chris's room in Family Guy?

Strangeness with gears

Are any jet engines used in combat aircraft water cooled?

During the Space Shuttle Columbia Disaster of 2003, Why Did The Flight Director Say, "Lock the doors."?

What word can be used to describe a bug in a movie?

Y2K... in 2019?

What are these two characters marked red? い_める and いじめ_いよォ?

TColorBox and Tikz, arrows are not on the diagram

Why couldn't soldiers sight their own weapons without officers' orders?

'sudo apt-get update' get a warning

show stdout containing n with line breaks



How can I query Solr to get high scores for documents with more similarity?


How to configure Solr to use Levenshtein approximate string matching?is Lucene boolean OR queries scoring biased towards the term having more alternativesWhat does “documents” and “cores” mean in SOLR? And how can i use them?Single-word searches in Solr are not accurateSolr query: prefer phrase over occurrence of single words, but accept bothSolr: Scoring exact matches higher than partial matchesHow to enforce stable result scores in SOLR? By defining fixed docFreq?Solr configuration for scored searchSOLR (4.3) - reducing score of “poor” quality (very short) documentsSolr querying nested documents with ChildDocTransformerFactory, get “Parent query yields document which is not matched by parents filter”How to add a new query listener via the SolR Config API?






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;








1















I have a Solr core populated from products table in an ecom database. "Title" is a field in the core. Which is the best approach to get proper score/ranking? For example, if someone searches for "iPhone", the product with title "iPhone 6s" should get a score higher than one with "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Does Solr have some variation of levenstein distance which may help? Solr Version is 7.3. Clarification - both the values I mentioned, "iPhone 6s" and "Beautiful case for iPhone 6, iPhone 7 and similar iPhones" are in the same field - Title. If we take levenstein distance, "iPhone 6s" is closer to "iPhone" than "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Can this fact be put to use so that iPhoe 6s comes before the other one?










share|improve this question


























  • Have a look at the answers to stackoverflow.com/questions/1752301/…

    – kellyfj
    Mar 26 at 14:12











  • You can give different "boost" values to each field, for example: q=iphone&qf=title^10+description&defType=edismax would give a bigger boost to matches on title vs matches on the description.

    – Hector Correa
    Mar 26 at 15:45











  • You can start by setting omitNorms and if you're not doing phrase searches, omitTermFreqAndPositions on the field - this will disable any scoring based on field length and the number of occurences.

    – MatsLindh
    Mar 26 at 20:03






  • 1





    @HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.

    – Jayadevan
    Mar 27 at 7:32











  • @MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.

    – Tomer Arazy
    Mar 27 at 15:19

















1















I have a Solr core populated from products table in an ecom database. "Title" is a field in the core. Which is the best approach to get proper score/ranking? For example, if someone searches for "iPhone", the product with title "iPhone 6s" should get a score higher than one with "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Does Solr have some variation of levenstein distance which may help? Solr Version is 7.3. Clarification - both the values I mentioned, "iPhone 6s" and "Beautiful case for iPhone 6, iPhone 7 and similar iPhones" are in the same field - Title. If we take levenstein distance, "iPhone 6s" is closer to "iPhone" than "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Can this fact be put to use so that iPhoe 6s comes before the other one?










share|improve this question


























  • Have a look at the answers to stackoverflow.com/questions/1752301/…

    – kellyfj
    Mar 26 at 14:12











  • You can give different "boost" values to each field, for example: q=iphone&qf=title^10+description&defType=edismax would give a bigger boost to matches on title vs matches on the description.

    – Hector Correa
    Mar 26 at 15:45











  • You can start by setting omitNorms and if you're not doing phrase searches, omitTermFreqAndPositions on the field - this will disable any scoring based on field length and the number of occurences.

    – MatsLindh
    Mar 26 at 20:03






  • 1





    @HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.

    – Jayadevan
    Mar 27 at 7:32











  • @MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.

    – Tomer Arazy
    Mar 27 at 15:19













1












1








1








I have a Solr core populated from products table in an ecom database. "Title" is a field in the core. Which is the best approach to get proper score/ranking? For example, if someone searches for "iPhone", the product with title "iPhone 6s" should get a score higher than one with "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Does Solr have some variation of levenstein distance which may help? Solr Version is 7.3. Clarification - both the values I mentioned, "iPhone 6s" and "Beautiful case for iPhone 6, iPhone 7 and similar iPhones" are in the same field - Title. If we take levenstein distance, "iPhone 6s" is closer to "iPhone" than "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Can this fact be put to use so that iPhoe 6s comes before the other one?










share|improve this question
















I have a Solr core populated from products table in an ecom database. "Title" is a field in the core. Which is the best approach to get proper score/ranking? For example, if someone searches for "iPhone", the product with title "iPhone 6s" should get a score higher than one with "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Does Solr have some variation of levenstein distance which may help? Solr Version is 7.3. Clarification - both the values I mentioned, "iPhone 6s" and "Beautiful case for iPhone 6, iPhone 7 and similar iPhones" are in the same field - Title. If we take levenstein distance, "iPhone 6s" is closer to "iPhone" than "Beautiful case for iPhone 6, iPhone 7 and similar iPhones". Can this fact be put to use so that iPhoe 6s comes before the other one?







solr levenshtein-distance






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Mar 27 at 7:30







Jayadevan

















asked Mar 26 at 13:19









JayadevanJayadevan

7807 silver badges20 bronze badges




7807 silver badges20 bronze badges















  • Have a look at the answers to stackoverflow.com/questions/1752301/…

    – kellyfj
    Mar 26 at 14:12











  • You can give different "boost" values to each field, for example: q=iphone&qf=title^10+description&defType=edismax would give a bigger boost to matches on title vs matches on the description.

    – Hector Correa
    Mar 26 at 15:45











  • You can start by setting omitNorms and if you're not doing phrase searches, omitTermFreqAndPositions on the field - this will disable any scoring based on field length and the number of occurences.

    – MatsLindh
    Mar 26 at 20:03






  • 1





    @HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.

    – Jayadevan
    Mar 27 at 7:32











  • @MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.

    – Tomer Arazy
    Mar 27 at 15:19

















  • Have a look at the answers to stackoverflow.com/questions/1752301/…

    – kellyfj
    Mar 26 at 14:12











  • You can give different "boost" values to each field, for example: q=iphone&qf=title^10+description&defType=edismax would give a bigger boost to matches on title vs matches on the description.

    – Hector Correa
    Mar 26 at 15:45











  • You can start by setting omitNorms and if you're not doing phrase searches, omitTermFreqAndPositions on the field - this will disable any scoring based on field length and the number of occurences.

    – MatsLindh
    Mar 26 at 20:03






  • 1





    @HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.

    – Jayadevan
    Mar 27 at 7:32











  • @MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.

    – Tomer Arazy
    Mar 27 at 15:19
















Have a look at the answers to stackoverflow.com/questions/1752301/…

– kellyfj
Mar 26 at 14:12





Have a look at the answers to stackoverflow.com/questions/1752301/…

– kellyfj
Mar 26 at 14:12













You can give different "boost" values to each field, for example: q=iphone&qf=title^10+description&defType=edismax would give a bigger boost to matches on title vs matches on the description.

– Hector Correa
Mar 26 at 15:45





You can give different "boost" values to each field, for example: q=iphone&qf=title^10+description&defType=edismax would give a bigger boost to matches on title vs matches on the description.

– Hector Correa
Mar 26 at 15:45













You can start by setting omitNorms and if you're not doing phrase searches, omitTermFreqAndPositions on the field - this will disable any scoring based on field length and the number of occurences.

– MatsLindh
Mar 26 at 20:03





You can start by setting omitNorms and if you're not doing phrase searches, omitTermFreqAndPositions on the field - this will disable any scoring based on field length and the number of occurences.

– MatsLindh
Mar 26 at 20:03




1




1





@HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.

– Jayadevan
Mar 27 at 7:32





@HectorCorrea - both values appear in Title column. Vendors keep repeating a term like "iPhone" many times in Title hoping that it will give them an edge in Search Engine Optimization.

– Jayadevan
Mar 27 at 7:32













@MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.

– Tomer Arazy
Mar 27 at 15:19





@MatsLindh - why to omit norms? in such cases norms will actually help since they will normalize the field score by the length of the field.

– Tomer Arazy
Mar 27 at 15:19












0






active

oldest

votes










Your Answer






StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55358183%2fhow-can-i-query-solr-to-get-high-scores-for-documents-with-more-similarity%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes




Is this question similar to what you get asked at work? Learn more about asking and sharing private information with your coworkers using Stack Overflow for Teams.







Is this question similar to what you get asked at work? Learn more about asking and sharing private information with your coworkers using Stack Overflow for Teams.



















draft saved

draft discarded
















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55358183%2fhow-can-i-query-solr-to-get-high-scores-for-documents-with-more-similarity%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Kamusi Yaliyomo Aina za kamusi | Muundo wa kamusi | Faida za kamusi | Dhima ya picha katika kamusi | Marejeo | Tazama pia | Viungo vya nje | UrambazajiKuhusu kamusiGo-SwahiliWiki-KamusiKamusi ya Kiswahili na Kiingerezakuihariri na kuongeza habari

Swift 4 - func physicsWorld not invoked on collision? The Next CEO of Stack OverflowHow to call Objective-C code from Swift#ifdef replacement in the Swift language@selector() in Swift?#pragma mark in Swift?Swift for loop: for index, element in array?dispatch_after - GCD in Swift?Swift Beta performance: sorting arraysSplit a String into an array in Swift?The use of Swift 3 @objc inference in Swift 4 mode is deprecated?How to optimize UITableViewCell, because my UITableView lags

Access current req object everywhere in Node.js ExpressWhy are global variables considered bad practice? (node.js)Using req & res across functionsHow do I get the path to the current script with Node.js?What is Node.js' Connect, Express and “middleware”?Node.js w/ express error handling in callbackHow to access the GET parameters after “?” in Express?Modify Node.js req object parametersAccess “app” variable inside of ExpressJS/ConnectJS middleware?Node.js Express app - request objectAngular Http Module considered middleware?Session variables in ExpressJSAdd properties to the req object in expressjs with Typescript