Struggling to understand Google Sheets IMPORTXML and Xpath

Question

Home

Struggling to understand Google Sheets IMPORTXML and Xpath

0

I’m really looking for an education here on how to understand this. I’ve posted before and someone helped my specific scenario, but I’m hoping someone can help me with the "why" along with an answer.

I’m trying to import stock information off of www.barchart.com. This time, I’m trying to pull analyst ratings from this page: https://www.barchart.com/stocks/quotes/W/analyst-ratings

Inserting screenshot of what I’m trying to gather…www.barchart.com analyst ratings Within this screenshot, I’d like to build two queries: (1) that grabs the rating (hold, buy, etc.) and (2) that grabs the number of analysts that contribute to that average rating.

I’ve tried this:

=IMPORTXML("https://www.barchart.com/stocks/quotes/W/analyst-ratings","//div[@class='block__colored-header rating']")

That gives me a "#N/A Imported Content is Empty" error in Google Sheets.

I’ve tried this:

=IMPORTXML("https://www.barchart.com/stocks/quotes/W/analyst-ratings","/html/body/div[2]/div/div[2]/div[2]/div/div[2]/div/div/div/div[3]/div[2]/div[1]/div[1]/div/div/div[4]/div[1]/div[2]")

That also gives me a "#N/A Imported Content is Empty" error in Google Sheets.

Other wonderful users here on stackoverflow swoop in and magically give me the answer (other examples were other websites), but then I can’t seem to reverse engineer the knowledge to apply it to other scenarios that I want to accomplish.

So, I’d love to get help with this problem. But I would really love it if someone could help me UNDERSTAND why it is the answer and maybe point me to some resources I can use to educate myself and become self-sufficient.

Thanks in advance!!

Judsonheribertoeffie Asked on July 17, 2020 in XML.

Share
Comment(0)

Add Comment

2 Answer(s)

Votes
Oldest

0

The page that you are trying to scrap has blocked that. By using //*/text() it returns

--+---------------------------------------------------------------------------+   |                                   A                                       | --+---------------------------------------------------------------------------+ 1 | https://www.barchart.com/ondemand                                         | --+---------------------------------------------------------------------------+ 2 | Interested in API access? Barchart offers data through                    |  --+---------------------------------------------------------------------------+ 3 | Barchart OnDemand                                                         | --+---------------------------------------------------------------------------+ 4 | ". Barchart OnDemand provides both premium market data delivery through a |   | collection of web services APIs (JSON, XML and CSV formats) and a limited |   | free service."                                                            | --+---------------------------------------------------------------------------+

IMPORTXML

can only get data from the source code
can only grab nodes that are from HTML files that are well formed according to the XML rules or at least the whole path is well formed according those rules.
It uses Google servers and an user-agent that could be easily identified by web pages to block it.

Tip

Don’t use Chrome developers tools to grab the xPath from the Elements tab because it returns the xPath of the DOM after the web page was parsed (the original DOM might was modified dynamically by JavaScript, by the Chrome web page parsing engine and/or installed and enabled Chrome extensions.

Frankiejerryangelita Answered on July 17, 2020.

Share
Comment(0)

Add Comment

0

Once you’re done understanding what’s going wrong, you can fetch the data with IMPORTFROMWEB addon with JS activated.

Frankiejerryangelita Answered on July 17, 2020.

Share
Comment(0)

Add Comment

Your Answer

Answer 1

BuddyPress is a plugin for WordPress that enables you to create a social network or community website. It has all the...

Answer 2

I value you getting some margin to help me with this task. Without you, no part of this would have...

Answer 3

Try to define a Cohesive class, until and unless the methods are written relevant to the class and it defines...

Answer 4

Try to add exportAllData: true, as an other option, hope it helps :)

Answer 5

DataSet can read an XML, infer schema and create a tabular representation that's easy to manipulate: DataSet ip1 = new...

Answer 6

I created a class and used Xml Linq : using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Xml; using...

Answer 7

XDocument first = XDocument.Load(args[0]); XDocument second = XDocument.Load(args[1]); var result = new XElement( "ipaddresses", first.Root.Elements("ip") .Zip(second.Root.Elements("ip"), (f, s) => {...

Answer 8

Following your code for the header row, you could achieve this by an <xsl:apply-templates select="/report/order_actions/order_action[order_id = current()/order_id]" /> As well...

Answer 9

BuddyPress is a plugin for WordPress that enables you to create a social network or community website. It has all the...

Answer 10

I value you getting some margin to help me with this task. Without you, no part of this would have...

Answer 11

Try to define a Cohesive class, until and unless the methods are written relevant to the class and it defines...

Answer 12

Try to add exportAllData: true, as an other option, hope it helps :)

Answer 13

DataSet can read an XML, infer schema and create a tabular representation that's easy to manipulate: DataSet ip1 = new...

Answer 14

I created a class and used Xml Linq : using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Xml; using...

Answer 15

XDocument first = XDocument.Load(args[0]); XDocument second = XDocument.Load(args[1]); var result = new XElement( "ipaddresses", first.Root.Elements("ip") .Zip(second.Root.Elements("ip"), (f, s) => {...

Answer 16

Following your code for the header row, you could achieve this by an <xsl:apply-templates select="/report/order_actions/order_action[order_id = current()/order_id]" /> As well...

LATEST ANSWERS

Struggling to understand Google Sheets IMPORTXML and Xpath

Your Answer

TOP USERS

HOT QUESTIONS

LATEST ANSWERS

Struggling to understand Google Sheets IMPORTXML and Xpath

Your Answer

Tags Widget

TOP USERS

HOT QUESTIONS