Skip to content Skip to sidebar Skip to footer

How To Properly Fetch Single Cells In Pandas: Loc[index,column] Vs Get_value(index,column)

Which method is better (in terms of performance and reliability) to use in order to fetch single cells from a pandas DataFrame: get_value() or loc[]?

Solution 1:

You can find info in docs in the end:

For getting a value explicitly (equiv to deprecated df.get_value('a','A'))

# thisisalsoequivalentto ``df1.at['a','A']``
In[55]: df1.loc['a', 'A']Out[55]: 0.13200317033032932

but if use it there is no warning.

But if check Index.get_value:

Fast lookup of value from 1-dimensional ndarray. Only use this if you know what you’re doing

So I think better is use iat, at, loc, ix.

Timings:

df = pd.DataFrame({'A':[1,2,3],
                   'B':[4,5,6],
                   'C':[7,8,9],
                   'D':[1,3,5],
                   'E':[5,3,6],
                   'F':[7,4,3]})

print (df)

In [93]: %timeit (df.loc[0, 'A'])
The slowest run took 6.40times longer than the fastest. This could mean that an intermediate result is being cached.
10000 loops, best of 3: 177 µs per loop

In [96]: %timeit (df.at[0, 'A'])
The slowest run took 17.01times longer than the fastest. This could mean that an intermediate result is being cached.
100000 loops, best of 3: 7.61 µs per loop

In [94]: %timeit (df.get_value(0, 'A'))
The slowest run took 23.49times longer than the fastest. This could mean that an intermediate result is being cached.
100000 loops, best of 3: 3.36 µs per loop

Post a Comment for "How To Properly Fetch Single Cells In Pandas: Loc[index,column] Vs Get_value(index,column)"