How can I plot using histogram2d in Python?

Question

0.00/5 (No votes)

See more:

I have some data and want to read the 5th and 6th column so to plot the 2D histogram of it. I have written the following code but it fails. I would be appreciated if someone can help me in this process.

There is a dimensional error message that I don't understand, something like, "The dimension of bins must be equal to the dimension of the sample x."
I have provided the code that I've tried. Any suggestion is much appreciated.

Here is the link to my sampled data.
https://gofile.io/d/U4NT0e[^]

What I have tried:

import numpy as np
                            import pandas as pd
import matplotlib.pyplot as plt 

path = r'D:\test_file\data\final.txt'
df = pd.read_csv( path , header = None, sep = '   ', dtype = float, engine = 'python')
df1 = df[4].to_string()
df2 = df[5].to_string()

lmin = -1
lmax = 1
nbins = 100    
xedge = np.linspace(lmin, lmax, nbins + 1)
yedge = np.linspace(lmin, lmax, nbins + 1)
fxyz = np.zeros((nbins,nbins))

weight = None
hist,xedges,yedges=np.histogram2d(df1, df2, bins=(xedge,yedge), normed=True, weights=weight)
fxyz += hist.T
extent = (xedges[0],xedges[-1],yedges[0],yedges[-1])  
with np.errstate(divide='ignore',invalid='ignore'):
    plt.figure(figsize=(10,6))
    images = plt.imshow(np.log10(fxyz),  origin='lower', extent=extent,cmap='jet')
    plt.ylim(0,0.7); plt.xlim(-0.7,0.7)
    plt.show()

Posted 16-Sep-20 6:27am

Mahdi Hasanzadeh

Updated 16-Sep-20 21:29pm

v3

Add a Solution

Comments

Richard MacCutchan 16-Sep-20 12:36pm

When you receive an error message, please add the complete text to your question, and indicate which line of code caused it to occur

Mahdi Hasanzadeh 16-Sep-20 13:16pm

hist,xedges,yedges=np.histogram2d(df_pe, df_pa, bins=(xedge,yedge), normed=True, weights=weight)
File "D:\Anaconda\lib\site-packages\numpy\lib\twodim_base.py", line 655, in histogram2d
hist, edges = histogramdd([x, y], bins, range, normed, weights)
File "D:\Anaconda\lib\site-packages\numpy\lib\function_base.py", line 917, in histogramdd
'The dimension of bins must be equal to the dimension of the '
ValueError: The dimension of bins must be
equal to the dimension of the sample x.

Mahdi Hasanzadeh 18-Sep-20 15:49pm

Thanks for your kindly response and answer on my question.

[no name] 16-Sep-20 12:50pm

Sounds like your Csv "column count / indexing" might be off.

Mahdi Hasanzadeh 16-Sep-20 13:17pm

Thanks for the response. what do you mean though? I print out the df1 and df2. They looks right though.

[no name] 16-Sep-20 13:23pm

What does this do? (You say you're plotting "2" columns)

nbins = 100

Mahdi Hasanzadeh 16-Sep-20 13:34pm

This is the number of the bins in each direction. I need sectioning the domain to do the histogram plot.

MehreenTahir 16-Sep-20 12:53pm

Did you try hist2d from matplotlib.pyplot? Also it would be helpful if you indicate the dataset you're using and yes please add the complete error message and the line you're getting error on.

Mahdi Hasanzadeh 16-Sep-20 13:32pm

Thanks for the response. I have provided a link to a sampled data.

1 solution

Add a Solution

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Richard MacCutchan · Accepted Answer · 2020-09-16T21:29:00

Python

df1 = df[4].to_string()
df2 = df[5].to_string()

# ...

hist,xedges,yedges=np.histogram2d(df1, df2, bins=(xedge,yedge), normed=True, weights=weight)
fxyz += hist.T

You are calling to_string() on variables which are already strings, read from the text file. You then pass these strings to np.histogram2d which expects two arrays in the firest two parameters.

See pandas.read_csv — pandas 1.1.2 documentation[^] and numpy.histogram2d — NumPy v1.19 Manual[^].