How to use string features in random forest for real-time big dataRandom forest on a big datasetRandom Forest by R package party overfits on random dataHow to tackle classification with string features?Using KDDCup 99 Data with Spark MLLib RandomForestHow to handle categorical features for Decision Tree, Random Forest in spark ml?Feature selection using Random forest with importance / varImp functions with factor variablesReal time Openscoring Bad request for Linear Regression modelWeka Random Forest model file size is too bigIs Isolation Forest (iForest) a method that could be directly applied to Big Data?How to implement random forest algorithm from scratch in C++

How to search for Android apps without ads?

What do I need to do, tax-wise, for a sudden windfall?

What did the 8086 (and 8088) do upon encountering an illegal instruction?

French citizen, did I need a visa in 2004 and 2006 when I visited the US as a child?

Why do the “Shtei HaLechem” not play a prominent part in the davenning for Shavuos?

Is it ethical to cite a reviewer's papers even if they are rather irrelevant?

My parents claim they cannot pay for my college education; what are my options?

Why is C++ template use not recommended in space/radiated environment?

Jam with honey & without pectin has a saucy consistency always

Interview was just a one hour panel. Got an offer the next day; do I accept or is this a red flag?

What does the "titan" monster tag mean?

What publication claimed that Michael Jackson died in a nuclear holocaust?

Can I attach a DC blower to intake manifold of my 150CC Yamaha FZS FI engine?

Print "N NE E SE S SW W NW"

What is the color associated with lukewarm?

Why is it bad to use your whole foot in rock climbing

Is it possible to install Firefox on Ubuntu with no desktop enviroment?

Optimising matrix generation time

Why can't we feel the Earth's revolution?

Fastest way from 10 to 1 with everyone in between

Opposite of "Concerto Grosso"?

Commencez à vous connecter -- I don't understand the phrasing of this

Is this equation correct? And if so, is this famous?

Why is Skinner so awkward in Hot Fuzz?



How to use string features in random forest for real-time big data


Random forest on a big datasetRandom Forest by R package party overfits on random dataHow to tackle classification with string features?Using KDDCup 99 Data with Spark MLLib RandomForestHow to handle categorical features for Decision Tree, Random Forest in spark ml?Feature selection using Random forest with importance / varImp functions with factor variablesReal time Openscoring Bad request for Linear Regression modelWeka Random Forest model file size is too bigIs Isolation Forest (iForest) a method that could be directly applied to Big Data?How to implement random forest algorithm from scratch in C++






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;








0















I have a huge dataset that needs to run a binary classification on it. Some features in the dataset are string, so cannot use them without converting to numeric values. I tried fit_transform and applied the RandomForest after and worked properly.
However, we are implementing a real-time system that time is a big issue! fit_transform is time-consuming. Any idea of how I can use string values or other libraries to convert string to digit as quickly as possible?
I also have access to Spark so if MLlib has something that can help please let me know!










share|improve this question




























    0















    I have a huge dataset that needs to run a binary classification on it. Some features in the dataset are string, so cannot use them without converting to numeric values. I tried fit_transform and applied the RandomForest after and worked properly.
    However, we are implementing a real-time system that time is a big issue! fit_transform is time-consuming. Any idea of how I can use string values or other libraries to convert string to digit as quickly as possible?
    I also have access to Spark so if MLlib has something that can help please let me know!










    share|improve this question
























      0












      0








      0








      I have a huge dataset that needs to run a binary classification on it. Some features in the dataset are string, so cannot use them without converting to numeric values. I tried fit_transform and applied the RandomForest after and worked properly.
      However, we are implementing a real-time system that time is a big issue! fit_transform is time-consuming. Any idea of how I can use string values or other libraries to convert string to digit as quickly as possible?
      I also have access to Spark so if MLlib has something that can help please let me know!










      share|improve this question














      I have a huge dataset that needs to run a binary classification on it. Some features in the dataset are string, so cannot use them without converting to numeric values. I tried fit_transform and applied the RandomForest after and worked properly.
      However, we are implementing a real-time system that time is a big issue! fit_transform is time-consuming. Any idea of how I can use string values or other libraries to convert string to digit as quickly as possible?
      I also have access to Spark so if MLlib has something that can help please let me know!







      bigdata real-time random-forest






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Mar 25 at 1:09









      Neda EbrahimiNeda Ebrahimi

      104




      104






















          0






          active

          oldest

          votes












          Your Answer






          StackExchange.ifUsing("editor", function ()
          StackExchange.using("externalEditor", function ()
          StackExchange.using("snippets", function ()
          StackExchange.snippets.init();
          );
          );
          , "code-snippets");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "1"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55330128%2fhow-to-use-string-features-in-random-forest-for-real-time-big-data%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55330128%2fhow-to-use-string-features-in-random-forest-for-real-time-big-data%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          SQL error code 1064 with creating Laravel foreign keysForeign key constraints: When to use ON UPDATE and ON DELETEDropping column with foreign key Laravel error: General error: 1025 Error on renameLaravel SQL Can't create tableLaravel Migration foreign key errorLaravel php artisan migrate:refresh giving a syntax errorSQLSTATE[42S01]: Base table or view already exists or Base table or view already exists: 1050 Tableerror in migrating laravel file to xampp serverSyntax error or access violation: 1064:syntax to use near 'unsigned not null, modelName varchar(191) not null, title varchar(191) not nLaravel cannot create new table field in mysqlLaravel 5.7:Last migration creates table but is not registered in the migration table

          용인 삼성생명 블루밍스 목차 통계 역대 감독 선수단 응원단 경기장 같이 보기 외부 링크 둘러보기 메뉴samsungblueminx.comeh선수 명단용인 삼성생명 블루밍스용인 삼성생명 블루밍스ehsamsungblueminx.comeheheheh

          155 수학 과학 기타 둘러보기 메뉴eh추가해eh문서를 완성해