I made a Coursera Downloader using Selenium and Python.




View Reddit by Broke-Code-MonkeyView Source

Categories: PYTHON

16 Comments

Broke-Code-Monkey · August 5, 2020 at 7:29 am

So, there used to be a python module called coursera-dl that could download entire courses from Coursera. But a few weeks back, it broke. So I decided to make a downloader myself, using Selenium.

B4li5tic · August 5, 2020 at 7:57 am

Thats alot of dependencies

Mr_Lkn · August 5, 2020 at 8:22 am

is there a github? I am planning to make a Selenium project and this can be a good reference.

Legend_X_5 · August 5, 2020 at 8:22 am

Selenium is not effective solution. It uses a lot of ram and work slow. Better to use requests module. You need to log in in your account and copy cookies and put it in request. Congratulations, now you are log in. Download something by link is easy

Umroayyar · August 5, 2020 at 8:27 am

Good work. I checked coursera-dl is working.

lestrenched · August 5, 2020 at 8:27 am

Wonderful project! Do you have a repo anywhere? Also, if you’re using it for yourself, you could just put your username and password in an environment variable.

Are you rotating proxies and UserAgents? [This is the robots.txt file of Coursera.](https://www.coursera.org/robots.txt) You should rotate these, if not for anything but peace of mind.

From the video, it seems you have created a lot of files, each with a section of the code. I understand that you want to compartmentalise, but perhaps a class would be a better idea? Well, I’m just taking a guess, I haven’t actually looked at your code yet.

Cheers!

Demonix_Hermit · August 5, 2020 at 8:29 am

Nice job dude ! Though I believe that maybe you could have made it all command line which might be faster .Then again , I am not sure about that.

hellfiniter · August 5, 2020 at 8:34 am

i dont feel like this is good usecase with all those captcha images and stuff …i think people are better off using browser and just download it at the end.

Nevertheless, it looks cool and i myself feel like trying some “python automated clicks”. Thanks for inspiration

cpt_alfaromeo · August 5, 2020 at 8:41 am

Sauce?

Acurus_Cow · August 5, 2020 at 9:04 am

test4

The best module name.

kaash1mora · August 5, 2020 at 9:28 am

Can you share the github link?

HYOUG · August 5, 2020 at 9:41 am

How did u make the countdown ?

vjb_reddit_scrap · August 5, 2020 at 9:43 am

Please share the code.

sysconfig · August 5, 2020 at 10:03 am

I did something similar at work a long time ago. I supported an Oracle application that our finance/accounting department used. We had a user that would get this random session error in one part of the web app and then get kicked out. It happened at random and it was impossible to consistently reproduce. So I wrote a selenium test with the Firefox plugin, then converted the code to python. Upped the logging on the Oracle app and let it run on a second monitor and went on with my day. Worked pretty well, but holy shit trying to de-cypher all the damn HTML that app spewed out was a freaking nightmare

stha_sam2 · August 5, 2020 at 10:16 am

It’s fun watching people solve captcha.

s_arme · August 5, 2020 at 10:18 am

Github !?

Leave a Reply

Your email address will not be published. Required fields are marked *