Where communities thrive

  • Join over 1.5M+ people
  • Join over 100K+ communities
  • Free without limits
  • Create your own community
Repo info
  • 19:43
    jbrockmendel commented #45597
  • 19:36
    wl2522 labeled #45599
  • 19:36
    wl2522 opened #45599
  • 19:36
    wl2522 labeled #45599
  • 19:25
    adi611 commented #45592
  • 19:03
    f4kt labeled #45598
  • 19:03
    f4kt labeled #45598
  • 19:03
    f4kt opened #45598
  • 18:50
    mroeschke labeled #45597
  • 18:50
    mroeschke labeled #45597
  • 18:49
    mroeschke opened #45597
  • 18:45
    Dr-Irv commented #45252
  • 18:33
    NumberPiOso synchronize #45578
  • 18:31
    mroeschke synchronize #45478
  • 18:20
    shubham11941140 review_requested #45017
  • 18:16
    shubham11941140 commented #45017
  • 18:05
    jbrockmendel opened #45596
  • 18:02
    mroeschke milestoned #45581
  • 18:01
    mroeschke synchronize #45437
  • 17:56
    mroeschke commented #45592
Can anyone help me
My gmail account is hacked
Professor Crypto / Eater
Marc Garcia
@robbuckley thanks for the heads up, can you please open an issue in GitHub?
Marc Garcia
@ncernek personally I think that GitHub should let anyone assign issues to themselves, so you could filter by the unassigned ones (that's how it works in all other platforms afaik), but they don't give us this option. I contacted GitHub support few times about this, feel free to do the same
hello! could someone take a look at this PR on formatting isort output? I'm learning lots about awk / bash in general but I'm having trouble figuring out what's needed in terms of following the current logging structure within ci pandas-dev/pandas#27334
thanks in advance :)
@datapythonista there’s already an issue to track the missing contributors list pandas-dev/pandas#27687. whoops, i should have checked the issues list first. I can see a fix has been merged to both master and the 0.25.x branch. But I can’t seem to find the rendered docs with the fix. For example https://pandas.pydata.org/pandas-docs/stable/whatsnew/v0.25.0.html#contributors still has the empty list
Jeff Reback
when 0.25.1 is released this will be updated
Can we request for additional feature/enhancement here?
Nico Cernek
@datapythonista The “unassigned” filter feature works, unfortunately seems like most contributors do not use the “assigned” feature, so its not helpful in finding new issues. I would love to contribute more, but there’s no easy way to find available issues. I’ve lost the patience to randomly click on issues and see if someone isn't already working on them. I’m open to any ideas as to how to move forwards.
Tom Augspurger

Contributors can’t self assing. That would have to be done by maintainers. But I don’t think it’d be possible for us to keep them up to date. So I’d also be happy to hear how that can be improved.

It’d be nice if GitHub had a way to filter based on whether a PR says it’ll close an issue.

Nico Cernek

It’d be nice if GitHub had a way to filter based on whether a PR says it’ll close an issue

Yeah thats really what it comes down to.

Marc Garcia
until recently, GitHub didn't even allow maintainers to assign issues to non-maintainers, now we can for non-maintainers that commented on the issue, but still useless for a project like pandas; I'll be contacting GitHub once more to see if they can fix it, otherwise we can move to Gitlab :)
does anybody know a way to run a python script on schedule? I am importing data from a sql table, manipulating/cleaning /merging it using Pandas. I want to export the resulting dataframe as csv twice a week on specific days and times. what's the best way to do it? Thanks.
Benjamin Root
on linux or mac, you can use a crontab. on windows, there is a feature called "scheduler", I think
Roy Mor

Hey, I'm trying to assign a new categorial column with pandas like so:

encoded_df = encoded_df.assign(Floor=lambda x: x.Cabin[0])

Cabin is something like C186
when I view the new column all I get is NaN any idea why ?

Carlos P Ceballos
hi, when dealing with large DFs downcasting Float64 to Float32 can free up some hundreds MB, but when doing a group_by or a sum, I loose precision. Recasting before works perfectly and takes no time. But it looks bad, I wonder if adding the arg recast_float would be a welcome addition to aggregation functions.
ali Fazeli
you can check my blog posts about my experience as a data analyst at Huawei Technologies in my portfolio: https://faze.li/
hi i need some help with numpy and pandas.. been stuck for a few days
i have a table with items that flow through a job from level 1 to level x
Item Job # Level Qty
A 111 1 1
B 112 1 200
C 113 1 300
D 114 1 300
E 115 1 400
E 115 2 350
B 112 2 100
E 115 3 350
E 115 4 200
B 112 2 100
B 112 3 100
A 111 2 200
A 111 3 200
A 111 4 200
need to calculate a z value based on each job # to see if there are any fluctuates in each level with the quantity and remove outliers
Carlos P Ceballos
@carlosjpc that was what i needed. thank you!
I have a dataframe called "po" that imports a csv file called Pasta1.csv, whose code is here https://repl.it/@RogerRegor/BubblyWobblyListener-1 . It has some empty and null values in a column called "DATA FINAL". But, the commands isna() or isnull() are not returning correctly the datas(in this case, it returns 0. But in this dataframe there are several null values. I do not know what is happening.
Carlos P Ceballos
@r063r3 you must have invisible characters which are not null. Sometimes visualizing the file in Excel just increases frustration, you can open the file in your editor to find the true value of them, or use iloc and target those rows and columns to get the value in that cell and then you can filter that out
Thank you...it was this....there are empty espaces in the start of each row...
Now, I am having other problem in this file...
One of the column has a data with 12 numbers. i.e, 12345678910...when I import in Pandas, the columns shows something like that: 123456+98
Someone knows what is message erro means? OutOfBoundsDatetime: Out of bounds nanosecond timestamp: 3000-01-01 00:00:00
Younggun Kim
FYI. We are going to sprint on pandas at PyCon KR from 15th Aug 10 AM (UTC+9).
Cera Kamikazi
Kolmar Kafran
@r063r3 pd.Timestamp.max and pd.Timestamp.min goes from 1677 to 2262. Pandas can't handle the year 3000. You should use pd.Period to handle this date.
Hey, not sure what's happening while trying to build pandas from source. I followed the instructions from here(https://pandas.pydata.org/pandas-docs/stable/development/contributing.html#where-to-start). Would be awesome if someone could help me out. This is the error I get after running the installation.
(pandas-dev) Mathews-MacBook-Pro:cpython mathewgeorge$ python3
Python 3.7.2 (v3.7.2:9a3ffc0492, Dec 24 2018, 02:44:43) 
[Clang 6.0 (clang-600.0.57)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import pandas
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/__init__.py", line 55, in <module>
    from pandas.core.api import (
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/core/api.py", line 5, in <module>
    from pandas.core.arrays.integer import (
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/core/arrays/__init__.py", line 1, in <module>
    from .base import (  # noqa: F401
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/core/arrays/base.py", line 14, in <module>
    from pandas.compat.numpy import function as nv
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/compat/numpy/function.py", line 28, in <module>
    from pandas.util._validators import (
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/util/__init__.py", line 3, in <module>
    from pandas.core.util.hashing import hash_array, hash_pandas_object  # noqa
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/core/util/hashing.py", line 11, in <module>
    from pandas.core.dtypes.cast import infer_dtype_from_scalar
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/core/dtypes/cast.py", line 9, in <module>
    from pandas.util._validators import validate_bool_kwarg
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/util/_validators.py", line 7, in <module>
    from pandas.core.dtypes.common import is_bool
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/core/dtypes/common.py", line 11, in <module>
    from pandas.core.dtypes.dtypes import (
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/core/dtypes/dtypes.py", line 17, in <module>
    from .inference import is_bool, is_list_like
  File "/Users/mathewgeorge/projects/my_projects/pandas/pandas/core/dtypes/inference.py", line 26, in <module>
    is_list_like = lib.is_list_like
AttributeError: module 'pandas._libs.lib' has no attribute 'is_list_like'
Bit more info: I had already contributed a bit to fixing docstrings earlier. I just updated my fork and my master. Tried the above and encountered the same error. Tried rebuilding, and also made sure my c compiler was installed.
Tom Augspurger
It looks like the C extensions weren’t (re)built. What’s the output of python setup.py build_ext -i?
Andrei Berceanu
Is this a suitable place to ask user questions about pandas?
Or is there something like a separate slack channel for that?
Tom Augspurger
Use StackOverflow with the pandas tag.
Andrei Berceanu
Thanks @TomAugspurger
@TomAugspurger : It gives me this
(pandas-dev) Mathews-MacBook-Pro:pandas mathewgeorge$ python3 setup.py build_ext -i
running build_ext
(pandas-dev) Mathews-MacBook-Pro:pandas mathewgeorge$
I tried rebuilding the extension with the --force flag and that seemed to at least start the process. It built till the end, after which I proceeded to install pandas using python3 -m pip install -e .. Still the same error :(.
Tom Augspurger
Strange. Is your Cython version up to date @killerontherun1?
I believe so. I have updated the master of my cython repository locally. Should I be doing something to build cython? Since the "contributing to pandas" doc mentioned I shouldn't have to do the configure and make steps, I skipped that and just installed the compiler via xcode (which already existed).
I tried reinstalling via xcode post my update of the cython repository.
Marc Garcia
@killerontherun1 can you make sure your copy of the pandas source is updated (i.e. git fetch upstream && git merge upstream/master), and in the root of the project run conda env update if you already have a conda environment, or conda env create if you don't. Then a python setup.py build_ext --inplace --force should work, otherwise there is something weird going on
Hasan Shaukat
I have a dataframe df = pd.DataFrame({'col1':['a', 'b', 'a', 'b', 'c']})
I want to make an 'is_first' column which would have [1,1,0,0,1] (1 if the col1 value appeared for the first time, 0 otherwise)
Any pandorable, vectorized approach for this?
Gareth Elston
Where should I go to report problems with the pandas.pydata.org website?