Selenium and instagram: count unread messages

Question

0.00/5 (No votes)

See more:

, +

Good afternoon,
I am trying to scrape the overall volume of unread messages on my Instagram profile and I am using Selenium through Python to access it. I have managed to reach my mailbox and I have 5 unread messages, signified with the classic 'blue' dot next to them.
The issue I am facing is that BeautifulSoup is not reading the respective div and classes to count the number of unread messages.

counter = 0
#count messages
soup = BeautifulSoup(browser.page_source, features='html.parser')
new_message = soup.find_all(lambda tag: tag.name=="div" and tag.get("class") == "Igw0E   rBNOH          YBx95   ybXk5    _4EzTm                      soMvl")
for i in new_message:
    counter += 1
print('Unread messages: ', counter)

The class, as shown through the console is as follows. However, something tells me that Instagram's based on JS and this is why I cannot count the divs. Any ideas?

<pre><div class="                     Igw0E   rBNOH          YBx95   ybXk5    _4EzTm                      soMvl                                                                                        "><div class=" _41V_T   Sapc9                 Igw0E     IwRSH      eGOV_         _4EzTm                                                                                                              " style="height: 8px; width: 8px;"></div></div>

What I have tried:

I have tried numerous variations of new_message, such as:

new_message = soup.find_all("div", {"class" : "Igw0E   rBNOH          YBx95   ybXk5    _4EzTm                      soMvl"})

new_message = soup.find_all("div", {"class" : "_41V_T   Sapc9                 Igw0E     IwRSH      eGOV_         _4EzTm"})

and by its style, but to no avail.

new_message = soup.find_all("div",{"style" : "height: 8px; width: 8px;"})

Also tried checking whether it locates something to print and it does, but I am unsure as to why the counter is not working:

browser.implicitly_wait(10)
new_message = soup.find_all(lambda tag: tag.name=="div" and tag.get("class") == "_4EzTm")
for message in new_message:
    counter =+ 1
    print('Unread messages: ', counter)
if new_message is not None:
    print("Found")
else:
    print("Failed")

Posted 31-Jan-21 5:02am

Giorgio Anagio

Updated 31-Jan-21 7:56am

v3

Add a Solution

Comments

Dave Kreskowiak 1-Feb-21 0:49am

Possibly because the data is filled in using javascript. You'll have to use Seleniums JavaScriptExecutor to fill in the data.

JavaScriptExecutor in Selenium WebDriver with Example[^]

Giorgio Anagio 1-Feb-21 9:30am

Could you please elaborate on how I could incorporate that to my code?
I have tried finding the element by xpath, but my knowledge is kind of limiting me from seeing this work at the moment.

Dave Kreskowiak 1-Feb-21 10:15am

Nope. I already gave you a link that does that very thing, with examples, and I have no use for Selenium myself.

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)