How to traverse lists containing a dictionary?

Question

Home

How to traverse lists containing a dictionary?

0

I am trying to traverse JSON data brought into a Dataframe.

Here is the code used to bring the data in:

df = json_normalize(data['PatentBulkData'])

Each series of the Dataframe is a list. Each list contains a list of dictionaries as represented below.

For example, here is the list of dictionaries returned when I enter df['prosecutionHistoryDataBag.prosecutionHistoryData'][i]:

[{'eventCode': 'PG-ISSUE',   'eventDate': '2020-04-23',   'eventDescriptionText': 'PG-Pub Issue Notification'},  {'eventCode': 'RQPR',   'eventDate': '2020-01-02',   'eventDescriptionText': 'Request for Foreign Priority (Priority Papers May Be Included)'},  {'eventCode': 'M844',   'eventDate': '2020-01-03',   'eventDescriptionText': 'Information Disclosure Statement (IDS) Filed'},  {'eventCode': 'M844',   'eventDate': '2020-01-02',   'eventDescriptionText': 'Information Disclosure Statement (IDS) Filed'},  {'eventCode': 'COMP',   'eventDate': '2020-02-04',   'eventDescriptionText': 'Application Is Now Complete'}]

Then, df['prosecutionHistoryDataBag.prosecutionHistoryData'][i][j] would return the dictionary:

{'eventCode': 'PG-ISSUE',  'eventDate': '2020-04-23',  'eventDescriptionText': 'PG-Pub Issue Notification'}

I would like to iterate through each entry in the df['prosecutionHistoryDataBag.prosecutionHistoryData'] to identify rows containing a specific string in 'eventDescriptionText'.

In the above example df['prosecutionHistoryDataBag.prosecutionHistoryData'] is a Series, df['prosecutionHistoryDataBag.prosecutionHistoryData'][i] is a list, and ['prosecutionHistoryDataBag.prosecutionHistoryData'][i][j] is a dictionary.

I would like to initially iterate through the list – and for each list iterate through the dictionary to see if ‘eventDescriptionText’ contains a specific string.

Thanks!

Clauderobbiemarjorie Asked on July 16, 2020 in Python.

Share
Comment(0)

Add Comment

2 Answer(s)

Votes
Oldest

0

Try using the below code.

for lst in df['prosecutionHistoryDataBag.prosecutionHistoryData']:     for I in lst:         if I.get("eventDescriptionText").find(your_string) != -1:             # do something             pass

Stephenjuliesophia Answered on July 16, 2020.

Share
Comment(0)

Add Comment

0

If I understand your question correctly then

df['prosecutionHistoryDataBag.prosecutionHistoryData']

is, in fact, a list whose elements are lists of dictionaries. See also my comment above. If that is the case, then the boring way is:

lst = df['prosecutionHistoryDataBag.prosecutionHistoryData'] for dicts in lst:     for d in dicts:         if d['eventDescriptionText'] == 'SOME TEXT YOU SEARCH FOR':             code = d['eventCode']             date = d['eventDate']             # Do something with code and date.

Now, you could flatten that list of lists and use a generator:

lst = df['prosecutionHistoryDataBag.prosecutionHistoryData'] for d in (d for dicts in lst for d in dicts):     if d['eventDescriptionText'] == 'SOME TEXT YOU SEARCH FOR':         code = d['eventCode']         date = d['eventDate']         # Do something with code and date.

Next, squeeze the test into the lists-flattening-generator as well to make the code a bit less readable:

lst = df['prosecutionHistoryDataBag.prosecutionHistoryData'] for code, date in ((d['eventCode'], d['eventDate']) for dicts in lst for d in dicts if d['eventDescriptionText'] == 'SOME TEXT YOU SEARCH FOR'):     # Do something with code and date.

The filter() function doesn’t help much with readability here

for code, date in ((d['eventCode'], d['eventDate']) for d in filter(lambda d: d['eventDescriptionText'] == 'SOME TEXT YOU SEARCH FOR', (d for dicts in lst for d in dicts))):     # Do something with code and date.

but other itertools or more-itertools may be of use (e.g. the flatten() function).

Erniekennithjosefa Answered on July 16, 2020.

Share
Comment(0)

Add Comment

Your Answer

Answer 1

BuddyPress is a plugin for WordPress that enables you to create a social network or community website. It has all the...

Answer 2

I value you getting some margin to help me with this task. Without you, no part of this would have...

Answer 3

Try to define a Cohesive class, until and unless the methods are written relevant to the class and it defines...

Answer 4

Try to add exportAllData: true, as an other option, hope it helps :)

Answer 5

DataSet can read an XML, infer schema and create a tabular representation that's easy to manipulate: DataSet ip1 = new...

Answer 6

I created a class and used Xml Linq : using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Xml; using...

Answer 7

XDocument first = XDocument.Load(args[0]); XDocument second = XDocument.Load(args[1]); var result = new XElement( "ipaddresses", first.Root.Elements("ip") .Zip(second.Root.Elements("ip"), (f, s) => {...

Answer 8

Following your code for the header row, you could achieve this by an <xsl:apply-templates select="/report/order_actions/order_action[order_id = current()/order_id]" /> As well...

Answer 9

BuddyPress is a plugin for WordPress that enables you to create a social network or community website. It has all the...

Answer 10

I value you getting some margin to help me with this task. Without you, no part of this would have...

Answer 11

Try to define a Cohesive class, until and unless the methods are written relevant to the class and it defines...

Answer 12

Try to add exportAllData: true, as an other option, hope it helps :)

Answer 13

DataSet can read an XML, infer schema and create a tabular representation that's easy to manipulate: DataSet ip1 = new...

Answer 14

I created a class and used Xml Linq : using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Xml; using...

Answer 15

XDocument first = XDocument.Load(args[0]); XDocument second = XDocument.Load(args[1]); var result = new XElement( "ipaddresses", first.Root.Elements("ip") .Zip(second.Root.Elements("ip"), (f, s) => {...

Answer 16

Following your code for the header row, you could achieve this by an <xsl:apply-templates select="/report/order_actions/order_action[order_id = current()/order_id]" /> As well...

LATEST ANSWERS

How to traverse lists containing a dictionary?

Your Answer

TOP USERS

HOT QUESTIONS

LATEST ANSWERS

How to traverse lists containing a dictionary?

Your Answer

Tags Widget

TOP USERS

HOT QUESTIONS