Python 熊猫创建一个包含 n 个元素的系列(顺序或随机)

声明:本页面是StackOverFlow热门问题的中英对照翻译,遵循CC BY-SA 4.0协议,如果您需要使用它,必须同样遵循CC BY-SA许可,注明原文地址和作者信息,同时你必须将它归于原作者(不是我):StackOverFlow 原文地址: http://stackoverflow.com/questions/33828475/
Warning: these are provided under cc-by-sa 4.0 license. You are free to use/share it, But you must attribute it to the original authors (not me): StackOverFlow

提示:将鼠标放在中文语句上可以显示对应的英文。显示中英文
时间:2020-08-19 14:03:58  来源:igfitidea点击:

pandas create a series with n elements (sequential or randbetween)

pythonpandasdataframeseriesrandom-sample

提问by becinyus

I am trying to create a pandasseries.

我正在尝试创建一个pandas系列。

One column of the series should contain n sequential numbers. [1, 2, 3, ..., n]

该系列的一列应包含 n 个序列号。 [1, 2, 3, ..., n]

One column should contain random numbers between kand k+100.

一列应包含k和之间的随机数k+100

One column should contain random selection between strings in a list. ['A', 'B', 'C', ... 'Z']

一列应包含列表中字符串之间的随机选择。 ['A', 'B', 'C', ... 'Z']

采纳答案by jezrael

There can be a lot of solutions. In the comments of the code block (#) you will find a few links for more information:

可以有很多解决方案。在代码块 ( #)的注释中,您会找到一些链接以获取更多信息:

import pandas as pd
import numpy as np
import random
import string

k = 5
N = 10

#http://docs.scipy.org/doc/numpy/reference/generated/numpy.random.randint.html
#http://stackoverflow.com/a/2257449/2901002

df = pd.DataFrame({ 'A' : range(1, N + 1 ,1),
    'B' : np.random.randint(k, k + 100 , size=N),
    'C' : pd.Series(random.choice(string.ascii_uppercase) for _ in range(N)) })

print df
#    A   B  C
#0   1  60  O
#1   2  94  L
#2   3  10  W
#3   4  94  X
#4   5  60  O
#5   6  20  K
#6   7  58  Y
#7   8  40  I
#8   9  49  X
#9  10  65  S

Numpysolution:

麻木解决方案:

import pandas as pd
import numpy as np

k = 5
N = 10

alphabet = list('ABCDEFGHIJKLMNOPQRSTUVWXYZ')

#http://docs.scipy.org/doc/numpy/reference/generated/numpy.random.choice.html

df = pd.DataFrame({ 'A' : range(1, N + 1 ,1),
    'B' : np.random.randint(k, k + 100 , size=N),
    'C' : np.random.choice(np.array(alphabet, dtype="|S1"), N) })

print df
#    A    B  C
#0   1   16  U
#1   2   76  X
#2   3  101  N
#3   4   61  F
#4   5   52  J
#5   6   62  A
#6   7   99  L
#7   8   23  N
#8   9   75  D
#9  10   16  Q

回答by Tom Ron

import pandas
n = 30
k = 40
pandas.DataFrame([(i, random.randint(k, k+100), chr(random.randint(ord('A'), ord('Z')))) for i in xrange(0, n)

If you want you specify the column names otherwise it is set to 0,1,2

如果您想指定列名,否则将其设置为 0,1,2