<tfoot id='3pNPa'></tfoot>
        <i id='3pNPa'><tr id='3pNPa'><dt id='3pNPa'><q id='3pNPa'><span id='3pNPa'><b id='3pNPa'><form id='3pNPa'><ins id='3pNPa'></ins><ul id='3pNPa'></ul><sub id='3pNPa'></sub></form><legend id='3pNPa'></legend><bdo id='3pNPa'><pre id='3pNPa'><center id='3pNPa'></center></pre></bdo></b><th id='3pNPa'></th></span></q></dt></tr></i><div id='3pNPa'><tfoot id='3pNPa'></tfoot><dl id='3pNPa'><fieldset id='3pNPa'></fieldset></dl></div>
      1. <legend id='3pNPa'><style id='3pNPa'><dir id='3pNPa'><q id='3pNPa'></q></dir></style></legend>
        • <bdo id='3pNPa'></bdo><ul id='3pNPa'></ul>

        <small id='3pNPa'></small><noframes id='3pNPa'>

        如何使用带有 Pandas 的时间戳按小时对数据帧进行

        时间:2023-09-14
        <tfoot id='YcGnA'></tfoot>

                <bdo id='YcGnA'></bdo><ul id='YcGnA'></ul>

              • <legend id='YcGnA'><style id='YcGnA'><dir id='YcGnA'><q id='YcGnA'></q></dir></style></legend>
                <i id='YcGnA'><tr id='YcGnA'><dt id='YcGnA'><q id='YcGnA'><span id='YcGnA'><b id='YcGnA'><form id='YcGnA'><ins id='YcGnA'></ins><ul id='YcGnA'></ul><sub id='YcGnA'></sub></form><legend id='YcGnA'></legend><bdo id='YcGnA'><pre id='YcGnA'><center id='YcGnA'></center></pre></bdo></b><th id='YcGnA'></th></span></q></dt></tr></i><div id='YcGnA'><tfoot id='YcGnA'></tfoot><dl id='YcGnA'><fieldset id='YcGnA'></fieldset></dl></div>
                1. <small id='YcGnA'></small><noframes id='YcGnA'>

                    <tbody id='YcGnA'></tbody>
                2. 本文介绍了如何使用带有 Pandas 的时间戳按小时对数据帧进行分组的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

                  问题描述

                  我有以下使用时间戳索引的数据帧结构:

                  I have the following dataframe structure that is indexed with a timestamp:

                      neg neu norm    pol pos date
                  time                        
                  1520353341  0.000   1.000   0.0000  0.000000    0.000   
                  1520353342  0.121   0.879   -0.2960 0.347851    0.000   
                  1520353342  0.217   0.783   -0.6124 0.465833    0.000   
                  

                  我根据时间戳创建一个日期:

                  I create a date from the timestamp:

                  data_frame['date'] = [datetime.datetime.fromtimestamp(d) for d in data_frame.time]
                  

                  结果:

                      neg neu norm    pol pos date
                  time                        
                  1520353341  0.000   1.000   0.0000  0.000000    0.000   2018-03-06 10:22:21
                  1520353342  0.121   0.879   -0.2960 0.347851    0.000   2018-03-06 10:22:22
                  1520353342  0.217   0.783   -0.6124 0.465833    0.000   2018-03-06 10:22:22
                  

                  我想按小时分组,同时获得除时间戳以外的所有值的平均值,应该是小时小组开始的地方.所以这是我要归档的结果:

                  I want to group by hour, while getting the mean for all the values, except the timestamp, that should be the hour from where the group started. So this is the result I want to archive:

                      neg neu norm    pol pos
                  time                    
                  1520352000  0.027989    0.893233    0.122535    0.221079    0.078779
                  1520355600  0.028861    0.899321    0.103698    0.209353    0.071811
                  

                  到目前为止,我得到的最接近的是这个 回答:

                  The closest I have gotten so far has been with this answer:

                  data = data.groupby(data.date.dt.hour).mean()
                  

                  结果:

                      neg neu norm    pol pos
                  date                    
                  0   0.027989    0.893233    0.122535    0.221079    0.078779
                  1   0.028861    0.899321    0.103698    0.209353    0.071811
                  

                  但我不知道如何保留考虑到 grouby 开始时间的时间戳.

                  But I cant figure out how to keep the timestamp that takes in account he hour where the grouby started.

                  推荐答案

                  我遇到了这个 gem,pd.DataFrame.resample,在我发布了按小时计算的解决方案之后.

                  I came across this gem, pd.DataFrame.resample, after I posted my round-to-hour solution.

                  # Construct example dataframe
                  times = pd.date_range('1/1/2018', periods=5, freq='25min')
                  values = [4,8,3,4,1]
                  df = pd.DataFrame({'val':values}, index=times)
                  
                  # Resample by hour and calculate medians
                  df.resample('H').median()
                  

                  或者你可以使用 groupbyGrouper 如果您不想将时间作为索引:

                  Or you can use groupby with Grouper if you don't want times as index:

                  df = pd.DataFrame({'val':values, 'times':times})
                  df.groupby(pd.Grouper(level='times', freq='H')).median()
                  

                  这篇关于如何使用带有 Pandas 的时间戳按小时对数据帧进行分组的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持html5模板网!

                  上一篇:对于 1970-01-01 之前的日期,Windows 上 datetime time 下一篇:使用python中的pandas解析年月日和小时在不同列中

                  相关文章

                  最新文章

                  <legend id='i4kqv'><style id='i4kqv'><dir id='i4kqv'><q id='i4kqv'></q></dir></style></legend>
                  <i id='i4kqv'><tr id='i4kqv'><dt id='i4kqv'><q id='i4kqv'><span id='i4kqv'><b id='i4kqv'><form id='i4kqv'><ins id='i4kqv'></ins><ul id='i4kqv'></ul><sub id='i4kqv'></sub></form><legend id='i4kqv'></legend><bdo id='i4kqv'><pre id='i4kqv'><center id='i4kqv'></center></pre></bdo></b><th id='i4kqv'></th></span></q></dt></tr></i><div id='i4kqv'><tfoot id='i4kqv'></tfoot><dl id='i4kqv'><fieldset id='i4kqv'></fieldset></dl></div>
                  <tfoot id='i4kqv'></tfoot>

                    • <bdo id='i4kqv'></bdo><ul id='i4kqv'></ul>
                    1. <small id='i4kqv'></small><noframes id='i4kqv'>