Visualizing High Dimensional Data at onceHow to generate legible plots in pandas when looping over columns?Why can't Python parse this JSON data?matplotlib Legend Markers Only OnceHow to define a two-dimensional array in PythonHow do I write JSON data to a file?“Large data” work flows using pandasShowing data and model predictions in one plot using Seaborn and Statsmodelsadjust colorbar to the same height with image in pyplotpython multiple plots for numpy arrayIssue with xticklabels when saving a figure with matplotlibBased on the visualised plots, which features data matrix is better for PCA, “X_scaled” or “X”, Why?

Word for Food that's Gone 'Bad', but is Still Edible?

How can I roleplay a follower-type character when I as a player have a leader-type personality?

I'm in your subnets, golfing your code

My advisor talks about me to his colleague

Are the Night's Watch still required?

Refinish or replace an old staircase

Why does sound not move through a wall?

What does this wavy downward arrow preceding a piano chord mean?

How long would it take for people to notice a mass disappearance?

What does 'made on' mean here?

Why did the Apollo 13 crew extend the LM landing gear?

Why aren't nationalizations in Russia described as socialist?

Does it make sense for a function to return an rvalue reference?

Identifying characters

How can I support myself financially as a 17 year old with a loan?

Is “snitty” a popular American English term? What is its origin?

Can I use a fetch land to shuffle my deck while the opponent has Ashiok, Dream Render in play?

US born but as a child of foreign diplomat

Where are the "shires" in the UK?

What are the differences between credential stuffing and password spraying?

Something that can be activated/enabled

Is there an idiom that support the idea that "inflation is bad"?

Can my company stop me from working overtime?

What exactly are the `size issues' preventing formation of presheaves being a left adjoint to some forgetful functor?



Visualizing High Dimensional Data at once


How to generate legible plots in pandas when looping over columns?Why can't Python parse this JSON data?matplotlib Legend Markers Only OnceHow to define a two-dimensional array in PythonHow do I write JSON data to a file?“Large data” work flows using pandasShowing data and model predictions in one plot using Seaborn and Statsmodelsadjust colorbar to the same height with image in pyplotpython multiple plots for numpy arrayIssue with xticklabels when saving a figure with matplotlibBased on the visualised plots, which features data matrix is better for PCA, “X_scaled” or “X”, Why?






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;








-1















I am trying to visualize my data, so that I can view all 200 columns one by one. I would like to have a more intuitive way to understand my data.



I have tried google but didn't get any help in visualizing high dimensional data. People say to use PCA but I want to visualize my raw data in columns.



Data set Link



My code



x0=df[df["target"]==0]
x1=df[df["target"]==1]

x0_100=x0[1:300]
x1_100=x1[1:300]
x=x1_100.append(x0_100)
y=x["target"]
x=x.drop("target",axis=1)

import matplotlib.pyplot as plt
fig = plt.figure(figsize = (60, 60))
j = 0
for i in x:
plt.subplot(51,4, j+1)
j += 1
sns.boxplot(x=y,y=x[i])


I am getting this type of small figures really difficult to understand










share|improve this question
























  • First, please provide a data test case to help us inderstand your particular case. Second, you absolutly need to tell us what do you want to see ? Your problem looks similar to the initial problem here (stackoverflow.com/a/55266943/7237062). In either way, when there is too much data, it seems to me to be a batter idea to use math/stats tools to peform some analysis for you first (see provided link; one can think of statistical tests, clustering, or whatever)

    – LoneWanderer
    Mar 23 at 11:19











  • here i am trying to visualize boxplot of every coloum.

    – Sohaib Anwaar
    Mar 23 at 11:54

















-1















I am trying to visualize my data, so that I can view all 200 columns one by one. I would like to have a more intuitive way to understand my data.



I have tried google but didn't get any help in visualizing high dimensional data. People say to use PCA but I want to visualize my raw data in columns.



Data set Link



My code



x0=df[df["target"]==0]
x1=df[df["target"]==1]

x0_100=x0[1:300]
x1_100=x1[1:300]
x=x1_100.append(x0_100)
y=x["target"]
x=x.drop("target",axis=1)

import matplotlib.pyplot as plt
fig = plt.figure(figsize = (60, 60))
j = 0
for i in x:
plt.subplot(51,4, j+1)
j += 1
sns.boxplot(x=y,y=x[i])


I am getting this type of small figures really difficult to understand










share|improve this question
























  • First, please provide a data test case to help us inderstand your particular case. Second, you absolutly need to tell us what do you want to see ? Your problem looks similar to the initial problem here (stackoverflow.com/a/55266943/7237062). In either way, when there is too much data, it seems to me to be a batter idea to use math/stats tools to peform some analysis for you first (see provided link; one can think of statistical tests, clustering, or whatever)

    – LoneWanderer
    Mar 23 at 11:19











  • here i am trying to visualize boxplot of every coloum.

    – Sohaib Anwaar
    Mar 23 at 11:54













-1












-1








-1








I am trying to visualize my data, so that I can view all 200 columns one by one. I would like to have a more intuitive way to understand my data.



I have tried google but didn't get any help in visualizing high dimensional data. People say to use PCA but I want to visualize my raw data in columns.



Data set Link



My code



x0=df[df["target"]==0]
x1=df[df["target"]==1]

x0_100=x0[1:300]
x1_100=x1[1:300]
x=x1_100.append(x0_100)
y=x["target"]
x=x.drop("target",axis=1)

import matplotlib.pyplot as plt
fig = plt.figure(figsize = (60, 60))
j = 0
for i in x:
plt.subplot(51,4, j+1)
j += 1
sns.boxplot(x=y,y=x[i])


I am getting this type of small figures really difficult to understand










share|improve this question
















I am trying to visualize my data, so that I can view all 200 columns one by one. I would like to have a more intuitive way to understand my data.



I have tried google but didn't get any help in visualizing high dimensional data. People say to use PCA but I want to visualize my raw data in columns.



Data set Link



My code



x0=df[df["target"]==0]
x1=df[df["target"]==1]

x0_100=x0[1:300]
x1_100=x1[1:300]
x=x1_100.append(x0_100)
y=x["target"]
x=x.drop("target",axis=1)

import matplotlib.pyplot as plt
fig = plt.figure(figsize = (60, 60))
j = 0
for i in x:
plt.subplot(51,4, j+1)
j += 1
sns.boxplot(x=y,y=x[i])


I am getting this type of small figures really difficult to understand







python matplotlib visualization seaborn






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Apr 19 at 8:10









marc_s

588k13011281275




588k13011281275










asked Mar 22 at 23:36









Sohaib AnwaarSohaib Anwaar

559




559












  • First, please provide a data test case to help us inderstand your particular case. Second, you absolutly need to tell us what do you want to see ? Your problem looks similar to the initial problem here (stackoverflow.com/a/55266943/7237062). In either way, when there is too much data, it seems to me to be a batter idea to use math/stats tools to peform some analysis for you first (see provided link; one can think of statistical tests, clustering, or whatever)

    – LoneWanderer
    Mar 23 at 11:19











  • here i am trying to visualize boxplot of every coloum.

    – Sohaib Anwaar
    Mar 23 at 11:54

















  • First, please provide a data test case to help us inderstand your particular case. Second, you absolutly need to tell us what do you want to see ? Your problem looks similar to the initial problem here (stackoverflow.com/a/55266943/7237062). In either way, when there is too much data, it seems to me to be a batter idea to use math/stats tools to peform some analysis for you first (see provided link; one can think of statistical tests, clustering, or whatever)

    – LoneWanderer
    Mar 23 at 11:19











  • here i am trying to visualize boxplot of every coloum.

    – Sohaib Anwaar
    Mar 23 at 11:54
















First, please provide a data test case to help us inderstand your particular case. Second, you absolutly need to tell us what do you want to see ? Your problem looks similar to the initial problem here (stackoverflow.com/a/55266943/7237062). In either way, when there is too much data, it seems to me to be a batter idea to use math/stats tools to peform some analysis for you first (see provided link; one can think of statistical tests, clustering, or whatever)

– LoneWanderer
Mar 23 at 11:19





First, please provide a data test case to help us inderstand your particular case. Second, you absolutly need to tell us what do you want to see ? Your problem looks similar to the initial problem here (stackoverflow.com/a/55266943/7237062). In either way, when there is too much data, it seems to me to be a batter idea to use math/stats tools to peform some analysis for you first (see provided link; one can think of statistical tests, clustering, or whatever)

– LoneWanderer
Mar 23 at 11:19













here i am trying to visualize boxplot of every coloum.

– Sohaib Anwaar
Mar 23 at 11:54





here i am trying to visualize boxplot of every coloum.

– Sohaib Anwaar
Mar 23 at 11:54












1 Answer
1






active

oldest

votes


















1














I would recommend plotting two different box plots with a hundred columns on each:



import numpy as np; np.random.seed(0)
import pandas as pd
import cufflinks as cf

df = cf.datagen.box(200)

df.iloc[:,0:100].plot(kind='box', rot=90, figsize=(14, 8))
plt.tight_layout()

df.iloc[:,100:].plot(kind='box', rot=90, figsize=(14, 8))
plt.tight_layout()


Box plot 1Box plot 2






share|improve this answer























  • But I want to see according to the label values. Boxplot having Target label 1 and Target label 0. By the way that good idea 100 100 coloums but not full filling my problem. Because I have Imbalance data

    – Sohaib Anwaar
    Mar 23 at 0:51











  • Can you include some sample data to help me better understand what you mean? I cannot run your code without sample data, and I can’t tell what your data structure is.

    – Nathaniel
    Mar 23 at 1:14











Your Answer






StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55309096%2fvisualizing-high-dimensional-data-at-once%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes









1














I would recommend plotting two different box plots with a hundred columns on each:



import numpy as np; np.random.seed(0)
import pandas as pd
import cufflinks as cf

df = cf.datagen.box(200)

df.iloc[:,0:100].plot(kind='box', rot=90, figsize=(14, 8))
plt.tight_layout()

df.iloc[:,100:].plot(kind='box', rot=90, figsize=(14, 8))
plt.tight_layout()


Box plot 1Box plot 2






share|improve this answer























  • But I want to see according to the label values. Boxplot having Target label 1 and Target label 0. By the way that good idea 100 100 coloums but not full filling my problem. Because I have Imbalance data

    – Sohaib Anwaar
    Mar 23 at 0:51











  • Can you include some sample data to help me better understand what you mean? I cannot run your code without sample data, and I can’t tell what your data structure is.

    – Nathaniel
    Mar 23 at 1:14















1














I would recommend plotting two different box plots with a hundred columns on each:



import numpy as np; np.random.seed(0)
import pandas as pd
import cufflinks as cf

df = cf.datagen.box(200)

df.iloc[:,0:100].plot(kind='box', rot=90, figsize=(14, 8))
plt.tight_layout()

df.iloc[:,100:].plot(kind='box', rot=90, figsize=(14, 8))
plt.tight_layout()


Box plot 1Box plot 2






share|improve this answer























  • But I want to see according to the label values. Boxplot having Target label 1 and Target label 0. By the way that good idea 100 100 coloums but not full filling my problem. Because I have Imbalance data

    – Sohaib Anwaar
    Mar 23 at 0:51











  • Can you include some sample data to help me better understand what you mean? I cannot run your code without sample data, and I can’t tell what your data structure is.

    – Nathaniel
    Mar 23 at 1:14













1












1








1







I would recommend plotting two different box plots with a hundred columns on each:



import numpy as np; np.random.seed(0)
import pandas as pd
import cufflinks as cf

df = cf.datagen.box(200)

df.iloc[:,0:100].plot(kind='box', rot=90, figsize=(14, 8))
plt.tight_layout()

df.iloc[:,100:].plot(kind='box', rot=90, figsize=(14, 8))
plt.tight_layout()


Box plot 1Box plot 2






share|improve this answer













I would recommend plotting two different box plots with a hundred columns on each:



import numpy as np; np.random.seed(0)
import pandas as pd
import cufflinks as cf

df = cf.datagen.box(200)

df.iloc[:,0:100].plot(kind='box', rot=90, figsize=(14, 8))
plt.tight_layout()

df.iloc[:,100:].plot(kind='box', rot=90, figsize=(14, 8))
plt.tight_layout()


Box plot 1Box plot 2







share|improve this answer












share|improve this answer



share|improve this answer










answered Mar 23 at 0:26









NathanielNathaniel

2,220214




2,220214












  • But I want to see according to the label values. Boxplot having Target label 1 and Target label 0. By the way that good idea 100 100 coloums but not full filling my problem. Because I have Imbalance data

    – Sohaib Anwaar
    Mar 23 at 0:51











  • Can you include some sample data to help me better understand what you mean? I cannot run your code without sample data, and I can’t tell what your data structure is.

    – Nathaniel
    Mar 23 at 1:14

















  • But I want to see according to the label values. Boxplot having Target label 1 and Target label 0. By the way that good idea 100 100 coloums but not full filling my problem. Because I have Imbalance data

    – Sohaib Anwaar
    Mar 23 at 0:51











  • Can you include some sample data to help me better understand what you mean? I cannot run your code without sample data, and I can’t tell what your data structure is.

    – Nathaniel
    Mar 23 at 1:14
















But I want to see according to the label values. Boxplot having Target label 1 and Target label 0. By the way that good idea 100 100 coloums but not full filling my problem. Because I have Imbalance data

– Sohaib Anwaar
Mar 23 at 0:51





But I want to see according to the label values. Boxplot having Target label 1 and Target label 0. By the way that good idea 100 100 coloums but not full filling my problem. Because I have Imbalance data

– Sohaib Anwaar
Mar 23 at 0:51













Can you include some sample data to help me better understand what you mean? I cannot run your code without sample data, and I can’t tell what your data structure is.

– Nathaniel
Mar 23 at 1:14





Can you include some sample data to help me better understand what you mean? I cannot run your code without sample data, and I can’t tell what your data structure is.

– Nathaniel
Mar 23 at 1:14



















draft saved

draft discarded
















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55309096%2fvisualizing-high-dimensional-data-at-once%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Kamusi Yaliyomo Aina za kamusi | Muundo wa kamusi | Faida za kamusi | Dhima ya picha katika kamusi | Marejeo | Tazama pia | Viungo vya nje | UrambazajiKuhusu kamusiGo-SwahiliWiki-KamusiKamusi ya Kiswahili na Kiingerezakuihariri na kuongeza habari

Swift 4 - func physicsWorld not invoked on collision? The Next CEO of Stack OverflowHow to call Objective-C code from Swift#ifdef replacement in the Swift language@selector() in Swift?#pragma mark in Swift?Swift for loop: for index, element in array?dispatch_after - GCD in Swift?Swift Beta performance: sorting arraysSplit a String into an array in Swift?The use of Swift 3 @objc inference in Swift 4 mode is deprecated?How to optimize UITableViewCell, because my UITableView lags

Access current req object everywhere in Node.js ExpressWhy are global variables considered bad practice? (node.js)Using req & res across functionsHow do I get the path to the current script with Node.js?What is Node.js' Connect, Express and “middleware”?Node.js w/ express error handling in callbackHow to access the GET parameters after “?” in Express?Modify Node.js req object parametersAccess “app” variable inside of ExpressJS/ConnectJS middleware?Node.js Express app - request objectAngular Http Module considered middleware?Session variables in ExpressJSAdd properties to the req object in expressjs with Typescript