How can I find nearest neighbors of points in a data frame from another data frameHow do I return multiple values from a function?How to join (merge) data frames (inner, outer, left, right)Peak detection in a 2D arrayhow to combine two data frames in python pandas“Large data” work flows using pandasReplace invalid values with None in Pandas DataFrameHow to reset index in a pandas data frame?Find K nearest neighbors, starting from a distance matrixfinding nearest neighbors of pdb models using kd-treeNearest Neighbor matching with replacement Python

Accurately recalling the key - can everyone do it?

Plotting Chebyshev polynomials using PolarPlot and FilledCurve

A conjectural trigonometric identity

What is the difference between 2/4 and 4/4 when it comes the accented beats?

Does the problem of P vs NP come under the category of Operational Research?

How to trick a fairly simplistic kill-counter?

The grades of the students in a class

Being told my "network" isn't PCI compliant. I don't even have a server! Do I have to comply?

Will medical institutions reject an applicant based on being 28 years of age?

"Will flex for food". What does this phrase mean?

When did J.K. Rowling decide to make Ron and Hermione a couple?

Move label of an angle in Tikz

Heinlein story regarding suspended animation and reading newspapers?

How to power down external drive safely

Does the use of a new concept require a prior definition?

Can Otiluke's Freezing Spheres be stockpiled?

How do I safety check that there is no light in Darkroom / Darkbag?

What is the most 'environmentally friendly' way to learn to fly?

How to get maximum number that newcount can hold?

Why interlaced CRT scanning wasn't done back and forth?

Why do my fried eggs start browning very fast?

Is Norway in the Single Market?

Partial Fractions: Why does this shortcut method work?

Backpacking with incontinence



How can I find nearest neighbors of points in a data frame from another data frame


How do I return multiple values from a function?How to join (merge) data frames (inner, outer, left, right)Peak detection in a 2D arrayhow to combine two data frames in python pandas“Large data” work flows using pandasReplace invalid values with None in Pandas DataFrameHow to reset index in a pandas data frame?Find K nearest neighbors, starting from a distance matrixfinding nearest neighbors of pdb models using kd-treeNearest Neighbor matching with replacement Python






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;








1















I want to find k nearest neighbors of all points in dataframe A from a dataframe B. How is that doable?
It seems sklearn.neighbors.NearestNeighbors takes only one set of data, and just one query point.



Like:



samples = [[0., 0., 0.], [0., .5, 0.], [1., 1., .5]]
from sklearn.neighbors import NearestNeighbors
neigh = NearestNeighbors(n_neighbors=1)
neigh.fit(samples)

print(neigh.kneighbors([[1., 1., 1.]]))


from Python documentaion



I want to have a data frame with more than one query point instead of [[1., 1., 1.]]



P.S. I want the metric to be Mahalanobis which sklearn.neighbors.KDTree does not accept, and scipy.spatial.KDTreeeven does not have any metric option.










share|improve this question
































    1















    I want to find k nearest neighbors of all points in dataframe A from a dataframe B. How is that doable?
    It seems sklearn.neighbors.NearestNeighbors takes only one set of data, and just one query point.



    Like:



    samples = [[0., 0., 0.], [0., .5, 0.], [1., 1., .5]]
    from sklearn.neighbors import NearestNeighbors
    neigh = NearestNeighbors(n_neighbors=1)
    neigh.fit(samples)

    print(neigh.kneighbors([[1., 1., 1.]]))


    from Python documentaion



    I want to have a data frame with more than one query point instead of [[1., 1., 1.]]



    P.S. I want the metric to be Mahalanobis which sklearn.neighbors.KDTree does not accept, and scipy.spatial.KDTreeeven does not have any metric option.










    share|improve this question




























      1












      1








      1








      I want to find k nearest neighbors of all points in dataframe A from a dataframe B. How is that doable?
      It seems sklearn.neighbors.NearestNeighbors takes only one set of data, and just one query point.



      Like:



      samples = [[0., 0., 0.], [0., .5, 0.], [1., 1., .5]]
      from sklearn.neighbors import NearestNeighbors
      neigh = NearestNeighbors(n_neighbors=1)
      neigh.fit(samples)

      print(neigh.kneighbors([[1., 1., 1.]]))


      from Python documentaion



      I want to have a data frame with more than one query point instead of [[1., 1., 1.]]



      P.S. I want the metric to be Mahalanobis which sklearn.neighbors.KDTree does not accept, and scipy.spatial.KDTreeeven does not have any metric option.










      share|improve this question
















      I want to find k nearest neighbors of all points in dataframe A from a dataframe B. How is that doable?
      It seems sklearn.neighbors.NearestNeighbors takes only one set of data, and just one query point.



      Like:



      samples = [[0., 0., 0.], [0., .5, 0.], [1., 1., .5]]
      from sklearn.neighbors import NearestNeighbors
      neigh = NearestNeighbors(n_neighbors=1)
      neigh.fit(samples)

      print(neigh.kneighbors([[1., 1., 1.]]))


      from Python documentaion



      I want to have a data frame with more than one query point instead of [[1., 1., 1.]]



      P.S. I want the metric to be Mahalanobis which sklearn.neighbors.KDTree does not accept, and scipy.spatial.KDTreeeven does not have any metric option.







      python pandas dataframe knn






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Mar 27 at 1:41







      No Lie

















      asked Mar 27 at 0:31









      No LieNo Lie

      3831 silver badge9 bronze badges




      3831 silver badge9 bronze badges

























          1 Answer
          1






          active

          oldest

          votes


















          3














          import pandas as pd
          from scipy.spatial import KDTree

          dataA = pd.DataFrame(pd.np.random.rand(100, 100))
          dataB = pd.DataFrame(pd.np.random.rand(100, 100))

          kdB = KDTree(dataB.values)
          print(kdB.query(dataA.values, k=3)[-1]) # k desired number of neighbors
          #returns indices of 3-neighbors for all rows/points in A





          share|improve this answer

























          • Thank you @bubble, I need the metric to be Mahalanobis.

            – No Lie
            Mar 27 at 1:41











          • May be BallTree would be useful; it supports custom distances

            – bubble
            Mar 27 at 2:06










          Your Answer






          StackExchange.ifUsing("editor", function ()
          StackExchange.using("externalEditor", function ()
          StackExchange.using("snippets", function ()
          StackExchange.snippets.init();
          );
          );
          , "code-snippets");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "1"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55368113%2fhow-can-i-find-nearest-neighbors-of-points-in-a-data-frame-from-another-data-fra%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          3














          import pandas as pd
          from scipy.spatial import KDTree

          dataA = pd.DataFrame(pd.np.random.rand(100, 100))
          dataB = pd.DataFrame(pd.np.random.rand(100, 100))

          kdB = KDTree(dataB.values)
          print(kdB.query(dataA.values, k=3)[-1]) # k desired number of neighbors
          #returns indices of 3-neighbors for all rows/points in A





          share|improve this answer

























          • Thank you @bubble, I need the metric to be Mahalanobis.

            – No Lie
            Mar 27 at 1:41











          • May be BallTree would be useful; it supports custom distances

            – bubble
            Mar 27 at 2:06















          3














          import pandas as pd
          from scipy.spatial import KDTree

          dataA = pd.DataFrame(pd.np.random.rand(100, 100))
          dataB = pd.DataFrame(pd.np.random.rand(100, 100))

          kdB = KDTree(dataB.values)
          print(kdB.query(dataA.values, k=3)[-1]) # k desired number of neighbors
          #returns indices of 3-neighbors for all rows/points in A





          share|improve this answer

























          • Thank you @bubble, I need the metric to be Mahalanobis.

            – No Lie
            Mar 27 at 1:41











          • May be BallTree would be useful; it supports custom distances

            – bubble
            Mar 27 at 2:06













          3












          3








          3







          import pandas as pd
          from scipy.spatial import KDTree

          dataA = pd.DataFrame(pd.np.random.rand(100, 100))
          dataB = pd.DataFrame(pd.np.random.rand(100, 100))

          kdB = KDTree(dataB.values)
          print(kdB.query(dataA.values, k=3)[-1]) # k desired number of neighbors
          #returns indices of 3-neighbors for all rows/points in A





          share|improve this answer













          import pandas as pd
          from scipy.spatial import KDTree

          dataA = pd.DataFrame(pd.np.random.rand(100, 100))
          dataB = pd.DataFrame(pd.np.random.rand(100, 100))

          kdB = KDTree(dataB.values)
          print(kdB.query(dataA.values, k=3)[-1]) # k desired number of neighbors
          #returns indices of 3-neighbors for all rows/points in A






          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered Mar 27 at 0:44









          bubblebubble

          1,1808 silver badges13 bronze badges




          1,1808 silver badges13 bronze badges















          • Thank you @bubble, I need the metric to be Mahalanobis.

            – No Lie
            Mar 27 at 1:41











          • May be BallTree would be useful; it supports custom distances

            – bubble
            Mar 27 at 2:06

















          • Thank you @bubble, I need the metric to be Mahalanobis.

            – No Lie
            Mar 27 at 1:41











          • May be BallTree would be useful; it supports custom distances

            – bubble
            Mar 27 at 2:06
















          Thank you @bubble, I need the metric to be Mahalanobis.

          – No Lie
          Mar 27 at 1:41





          Thank you @bubble, I need the metric to be Mahalanobis.

          – No Lie
          Mar 27 at 1:41













          May be BallTree would be useful; it supports custom distances

          – bubble
          Mar 27 at 2:06





          May be BallTree would be useful; it supports custom distances

          – bubble
          Mar 27 at 2:06








          Got a question that you can’t ask on public Stack Overflow? Learn more about sharing private information with Stack Overflow for Teams.







          Got a question that you can’t ask on public Stack Overflow? Learn more about sharing private information with Stack Overflow for Teams.



















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55368113%2fhow-can-i-find-nearest-neighbors-of-points-in-a-data-frame-from-another-data-fra%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Kamusi Yaliyomo Aina za kamusi | Muundo wa kamusi | Faida za kamusi | Dhima ya picha katika kamusi | Marejeo | Tazama pia | Viungo vya nje | UrambazajiKuhusu kamusiGo-SwahiliWiki-KamusiKamusi ya Kiswahili na Kiingerezakuihariri na kuongeza habari

          Swift 4 - func physicsWorld not invoked on collision? The Next CEO of Stack OverflowHow to call Objective-C code from Swift#ifdef replacement in the Swift language@selector() in Swift?#pragma mark in Swift?Swift for loop: for index, element in array?dispatch_after - GCD in Swift?Swift Beta performance: sorting arraysSplit a String into an array in Swift?The use of Swift 3 @objc inference in Swift 4 mode is deprecated?How to optimize UITableViewCell, because my UITableView lags

          Access current req object everywhere in Node.js ExpressWhy are global variables considered bad practice? (node.js)Using req & res across functionsHow do I get the path to the current script with Node.js?What is Node.js' Connect, Express and “middleware”?Node.js w/ express error handling in callbackHow to access the GET parameters after “?” in Express?Modify Node.js req object parametersAccess “app” variable inside of ExpressJS/ConnectJS middleware?Node.js Express app - request objectAngular Http Module considered middleware?Session variables in ExpressJSAdd properties to the req object in expressjs with Typescript