我能够根据这样的列表元素过滤数据帧;

import pandas as pd
W1 = ['Animal','Ball','Cat','Derry','Element','Lapse','Animate this']
W2 = ['Krota','Catch','Yankee','Global','Zeb','Rat','Try']
df = pd.DataFrame({'W1':W1,'W2':W2})

l1 = ['Animal','Zeb','Q']
print df[df['W1'].isin(l1) | df['W2'].isin(l1)]

        W1     W2
  0   Animal  Krota
  4  Element    Zeb

但是有没有一种方法可以通过应用正则表达式进行过滤呢;
对于ex;
 l1 = ['An','Cat']

 Intended result;
          W1         W2
  0   Animal        Krota
  1   Ball          Catch
  2   Cat           Yankee
  6   Animate this  Try

最佳答案

试试这个:

df[df['W1'].str.contains("|".join(l1)) | df['W2'].str.contains("|".join(l1))]


             W1      W2
0        Animal   Krota
1          Ball   Catch
2           Cat  Yankee
6  Animate this     Try

关于python - Python Pandas:通过应用正则表达式过滤数据框,我们在Stack Overflow上找到一个类似的问题:https://stackoverflow.com/questions/22093006/

10-12 14:00