• Categories
  • Recent
  • Popular
  • Users
  • Search
  • FingerprintSwitcher
  • CustomServers
  • AutomationPlugins
Skins
  • Light
  • Default
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Quartz
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Slate
  • Solar
  • Superhero
  • Vapor
Collapse

Bablosoft

Is this website really build a protection against scraping?

Scheduled Pinned Locked Moved Support
9 Posts 4 Posters 1607 Views
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • H Offline
    H Offline
    hungrym
    wrote on last edited by
    #1

    the website is https://www.proxyrotator.com/free-proxy-list/ .

    After months mastering BAS (and learning javascript,node,regex,xpath etc...) I became confident there is no website I can't take the information from... until I tried the one above.

    If someone knows or has idea how to take the proxies from it, please share.

    I'm really curious what solutions (maybe without the obvious one - screenshot and OCR over it) we have in BAS for it.

    ? 1 Reply Last reply
    0
  • ? Offline
    ? Offline
    A Former User
    replied to hungrym on last edited by
    #2

    @hungrym said in Is this website really build a protection against scraping?:

    the website is https://www.proxyrotator.com/free-proxy-list/ .
    After months mastering BAS (and learning javascript,node,regex,xpath etc...) I became confident there is no website I can't take the information from... until I tried the one above.
    If someone knows or has idea how to take the proxies from it, please share.
    I'm really curious what solutions (maybe without the obvious one - screenshot and OCR over it) we have in BAS for it.

    What exactly is the problem? I looked at this site, it is simple in my opinion, a little inconvenient to parse, but in General it is not a problem.

    H 1 Reply Last reply
    0
  • H Offline
    H Offline
    hungrym
    replied to Guest on last edited by
    #3

    @usertrue And how exactly? Did you check the source code? It's not possible to copy/paste the proxy (you can just try in normal browser), how about to write BAS script to parse it. I mean what xpath,css, regex you will use to take the full proxy and add it to a list in BAS ?

    ? 1 Reply Last reply
    0
  • ? Offline
    ? Offline
    A Former User
    replied to hungrym on last edited by
    #4

    @hungrym said in Is this website really build a protection against scraping?:

    And how exactly? Did you check the source code? It's not possible to copy/paste the proxy (you can just try in normal browser), how about to write BAS script to parse it. I mean what xpath,css, regex you will use to take the full proxy and add it to a list in BAS ?

    Yes, I looked at the page code, it has everything you need.

    H 1 Reply Last reply
    0
  • H Offline
    H Offline
    hungrym
    replied to Guest on last edited by
    #5

    @usertrue : Ok. In that case can you give idea how to parse the IP address and the Port of the proxy? If you're busy don't go in details, just overview of how you will do it.
    I really can't find a way and you're saying it's simple...

    1 Reply Last reply
    0
  • ? Offline
    ? Offline
    A Former User
    wrote on last edited by A Former User
    #6

    @hungrym I wrote a js that runs in a browser and collects data. But port comes in the form of base64 pictures of the - think themselves further. There are recognition modules in node js, but I don't have time for that.

    {
    	let proxy = [];
    	let rows = Array.from(document.querySelectorAll('tbody tr:not([class])') );
    	rows.forEach( row => {
    		let ip = Array.from(row.querySelectorAll('td:nth-of-type(2)>*') ).filter(el=> {
    			let xy = el.getBoundingClientRect();
    			return el == document.elementFromPoint(xy.x, xy.y);
    		}).map( el => el.textContent).slice(0,-1).join('');
    		let port = row.querySelectorAll('td:nth-of-type(3)>img')[0].src.split(';')[2];
    		let loc =  row.querySelectorAll('td:nth-of-type(4)')[0].textContent.trim();
    		let type = row.querySelectorAll('td:nth-of-type(6)')[0].textContent;
    
    		proxy.push({ip,port,type,loc});
    	});
    	JSON.stringify(proxy)
    }
    

    0_1565013145122_proxyrotator.xml

    H 1 Reply Last reply
    1
  • H Offline
    H Offline
    hungrym
    replied to Guest on last edited by
    #7

    @usertrue Thank you so much! Very clever and elegant!
    Yes, for the port - node module ng-ocr or API to ocr.space works for me.
    Definitly I'm going to read your answers in the forum and study from them! Thanks again!

    1 Reply Last reply
    0
  • J Offline
    J Offline
    jattdagar
    wrote on last edited by
    #8

    any one help me getting the port number automatically or post project file here which is working

    1 Reply Last reply
    0
  • M Offline
    M Offline
    Mandygust
    wrote on last edited by
    #9

    this is bullshit fr, now there s a new service ake. i tried it a month ago, and think i will stay with it, was another one, it is also ok, but these are just for parsing, u can find promo from their forums. i have one but idk XXCEHX u can try

    1 Reply Last reply
    0

  • A

    why no confirmation dialogue box with "Add Extension" is appearing ... I really need Help... Thanks

    Scheduled Pinned Locked Moved Support
    0 Votes
    1 Posts
    394 Views
    No one has replied
  • hellomembershaH

    Shape Bot protection

    Scheduled Pinned Locked Moved Support
    0 Votes
    1 Posts
    434 Views
    No one has replied
  • andreiemA

    BAS detectable by websites?

    Scheduled Pinned Locked Moved Support
    0 Votes
    1 Posts
    816 Views
    No one has replied
  • R

    Scraping telegram

    Scheduled Pinned Locked Moved Support
    0 Votes
    1 Posts
    464 Views
    No one has replied
  • A

    Help - How to scrape data from Multiple New Email - from same sender address

    Scheduled Pinned Locked Moved Support
    0 Votes
    1 Posts
    1111 Views
    No one has replied
  • Login

  • Don't have an account? Register

  • Login or register to search.
  • First post
    Last post
0
  • Categories
  • Recent
  • Popular
  • Users
  • Search
  • FingerprintSwitcher
  • CustomServers
  • AutomationPlugins
  • Login

  • Don't have an account? Register

  • Login or register to search.