IN Pandas Python Dataframe I need to set values of a column based on the values of other columns

Question

Home

IN Pandas Python Dataframe I need to set values of a column based on the values of other columns

0

I Have a dataframe with 3 columns namely cuid ,type , errorreason. Now the error reason is empty and I have to fill it with the following logic- 1.) If cuid is unique and type is ‘COL’ then errorreason is ‘NO ERROR'( ALL UNIQUE VALUES ARE ‘NO ERROR’)

2.) If cuid is not unique , and type is ‘COL’ AND ‘ROT’ , then error is errorreason is ‘AD’

3.) If cuid is not unique , and type is ‘COL’ AND ‘TOT’ , then error is errorreason is ‘RE’

4.) Any other case , except the above mentioned , errorreason is ‘Unidentified’

I have already seperated the unique and non unique values , so first point is done. Kinda stuck on the next points . I was trying to group by the non unique values and then apply a function. Kinda stuck here.

Jessiedoylejuliet Asked on July 16, 2020 in Python.

Share
Comment(0)

Add Comment

2 Answer(s)

Votes
Oldest

0

This is a quite long solution, but I inserted explanations for each step so that they are clear to you. At the end you obtain your desired output

import numpy as np import pandas as pd  # sample data df = pd.DataFrame({     'cuid': [100814, 100814, 100815, 100815, 100816],     'type': ['col', 'rot', 'col', 'tot', 'col'] })  # define function for concatenating 'type' within the same 'cuid' def str_cat(x):     return x.str.cat(sep=', ')  # create a lookup dataset that we will merge later on df_lookup = df.groupby('cuid').agg({     'cuid': 'count',     'type': str_cat }).rename(columns={'cuid': 'unique'})  # create the variable 'error_reason' on this lookup dataset thanks to a case when like statement using np.select df_lookup['error_reason'] = np.select(     [         (df_lookup['cuid'] == 1) & (df_lookup['type'] == 'col'),         (df_lookup['cuid'] > 1) & (df_lookup['type'].str.contains('col')) & (df_lookup['type'].str.contains('rot')),         (df_lookup['cuid'] > 1) & (df_lookup['type'].str.contains('col')) & (df_lookup['type'].str.contains('tot'))     ],     [         'NO ERROR',         'AD',         'RE'     ],     default = 'Unidentified' )  # merge the two datasets df.merge(df_lookup.drop(columns=['type', 'unique']), on='cuid')

Output

     cuid type error_reason 0  100814  col           AD 1  100814  rot           AD 2  100815  col           RE 3  100815  tot           RE 4  100816  col     NO ERROR

Erniekennithjosefa Answered on July 16, 2020.

Share
Comment(0)

Add Comment

0

Try to use this:

df.groupby('CUID',as_index=False)['TYPE'].aggregate(lambda x: list(x))

I have not tested this solution so let me know if it does not work.

Trumancarrieleanna Answered on July 16, 2020.

Share
Comment(0)

Add Comment

Your Answer

Answer 1

BuddyPress is a plugin for WordPress that enables you to create a social network or community website. It has all the...

Answer 2

I value you getting some margin to help me with this task. Without you, no part of this would have...

Answer 3

Try to define a Cohesive class, until and unless the methods are written relevant to the class and it defines...

Answer 4

Try to add exportAllData: true, as an other option, hope it helps :)

Answer 5

DataSet can read an XML, infer schema and create a tabular representation that's easy to manipulate: DataSet ip1 = new...

Answer 6

I created a class and used Xml Linq : using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Xml; using...

Answer 7

XDocument first = XDocument.Load(args[0]); XDocument second = XDocument.Load(args[1]); var result = new XElement( "ipaddresses", first.Root.Elements("ip") .Zip(second.Root.Elements("ip"), (f, s) => {...

Answer 8

Following your code for the header row, you could achieve this by an <xsl:apply-templates select="/report/order_actions/order_action[order_id = current()/order_id]" /> As well...

Answer 9

BuddyPress is a plugin for WordPress that enables you to create a social network or community website. It has all the...

Answer 10

I value you getting some margin to help me with this task. Without you, no part of this would have...

Answer 11

Try to define a Cohesive class, until and unless the methods are written relevant to the class and it defines...

Answer 12

Try to add exportAllData: true, as an other option, hope it helps :)

Answer 13

DataSet can read an XML, infer schema and create a tabular representation that's easy to manipulate: DataSet ip1 = new...

Answer 14

I created a class and used Xml Linq : using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Xml; using...

Answer 15

XDocument first = XDocument.Load(args[0]); XDocument second = XDocument.Load(args[1]); var result = new XElement( "ipaddresses", first.Root.Elements("ip") .Zip(second.Root.Elements("ip"), (f, s) => {...

Answer 16

Following your code for the header row, you could achieve this by an <xsl:apply-templates select="/report/order_actions/order_action[order_id = current()/order_id]" /> As well...

LATEST ANSWERS

IN Pandas Python Dataframe I need to set values of a column based on the values of other columns

Your Answer

TOP USERS

HOT QUESTIONS

LATEST ANSWERS

IN Pandas Python Dataframe I need to set values of a column based on the values of other columns

Your Answer

Tags Widget

TOP USERS

HOT QUESTIONS