I would like to scrape this webite https://hdyc.neis-one.org/ which has two page login authenticatonNeed help in reading the data from credentials xml fileJava EE Login Page ProblemWeb-scraping JavaScript page with PythonFacebook oauth dialog shows “An error occurred. Please try again later.”How to use Python to retrieve xml page that requires http login?facebook login blank pageHow to submit login form in Rvest package w/o button argumentHow to use python to login to a page and retrieve the redirected link?mechanicalsoup is not redirecting to where it should redirect toR code-unable to get log-in form when doing webscraping

Changing the opacity of lines on a plot based on their value

Cascading Repair Costs following Blown Head Gasket on a 2004 Subaru Outback

A player is constantly pestering me about rules, what do I do as a DM?

What sort of mathematical problems are there in AI that people are working on?

Apply brace expansion in "reverse order"

Plotting with different color for a single curve

Should my manager be aware of private LinkedIn approaches I receive? How to politely have this happen?

Unusual mail headers, evidence of an attempted attack. Have I been pwned?

Links to webpages in books

Can’t attend PhD conferences

Alphabet completion rate

Why is C++ initial allocation so much larger than C's?

What is the legal status of travelling with (unprescribed) methadone in your carry-on?

Story-based adventure with functions and relationships

Is adding a new player (or players) a DM decision, or a group decision?

Peace Arch without exiting USA

How to split an equation over two lines?

What are the penalties for overstaying in USA?

Hot coffee brewing solutions for deep woods camping

Employer wants to use my work email account after I quit, is this legal under German law? Is this a GDPR waiver?

Should I include salary information on my CV?

No IMPLICIT_CONVERSION warning in this query plan

What happens when your group is victim of a surprise attack but you can't be surprised?

What do you call a weak person's act of taking on bigger opponents?



I would like to scrape this webite https://hdyc.neis-one.org/ which has two page login authenticaton


Need help in reading the data from credentials xml fileJava EE Login Page ProblemWeb-scraping JavaScript page with PythonFacebook oauth dialog shows “An error occurred. Please try again later.”How to use Python to retrieve xml page that requires http login?facebook login blank pageHow to submit login form in Rvest package w/o button argumentHow to use python to login to a page and retrieve the redirected link?mechanicalsoup is not redirecting to where it should redirect toR code-unable to get log-in form when doing webscraping






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;








-1















library('XML')
library('rvest')



login <- "https://www.openstreetmap.org/login?cookie_test=true&referer=%2Foauth%2Fauthorize%3Foauth_callback%3Dhttps%253A%252F%252Fhdyc.neis-one.org%252Fland.html%26oauth_token%3DDfkqaWSgSDkhReTb75sOaq22QO6f6txdxFk4qJR7"



pgsession<-html_session(login)



pgform <- html_form(pgsession)[[1]]



filled_form<-set_values(pgform, username="********", password="**********")
submit_form(pgsession, filled_form)



html_object <- read_html("https://hdyc.neis-one.org/?**********")










share|improve this question



















  • 1





    Please include your code

    – QHarr
    Mar 25 at 11:09

















-1















library('XML')
library('rvest')



login <- "https://www.openstreetmap.org/login?cookie_test=true&referer=%2Foauth%2Fauthorize%3Foauth_callback%3Dhttps%253A%252F%252Fhdyc.neis-one.org%252Fland.html%26oauth_token%3DDfkqaWSgSDkhReTb75sOaq22QO6f6txdxFk4qJR7"



pgsession<-html_session(login)



pgform <- html_form(pgsession)[[1]]



filled_form<-set_values(pgform, username="********", password="**********")
submit_form(pgsession, filled_form)



html_object <- read_html("https://hdyc.neis-one.org/?**********")










share|improve this question



















  • 1





    Please include your code

    – QHarr
    Mar 25 at 11:09













-1












-1








-1








library('XML')
library('rvest')



login <- "https://www.openstreetmap.org/login?cookie_test=true&referer=%2Foauth%2Fauthorize%3Foauth_callback%3Dhttps%253A%252F%252Fhdyc.neis-one.org%252Fland.html%26oauth_token%3DDfkqaWSgSDkhReTb75sOaq22QO6f6txdxFk4qJR7"



pgsession<-html_session(login)



pgform <- html_form(pgsession)[[1]]



filled_form<-set_values(pgform, username="********", password="**********")
submit_form(pgsession, filled_form)



html_object <- read_html("https://hdyc.neis-one.org/?**********")










share|improve this question
















library('XML')
library('rvest')



login <- "https://www.openstreetmap.org/login?cookie_test=true&referer=%2Foauth%2Fauthorize%3Foauth_callback%3Dhttps%253A%252F%252Fhdyc.neis-one.org%252Fland.html%26oauth_token%3DDfkqaWSgSDkhReTb75sOaq22QO6f6txdxFk4qJR7"



pgsession<-html_session(login)



pgform <- html_form(pgsession)[[1]]



filled_form<-set_values(pgform, username="********", password="**********")
submit_form(pgsession, filled_form)



html_object <- read_html("https://hdyc.neis-one.org/?**********")







python web-scraping login






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Mar 25 at 18:32







Saranya Balu

















asked Mar 25 at 10:21









Saranya BaluSaranya Balu

13 bronze badges




13 bronze badges







  • 1





    Please include your code

    – QHarr
    Mar 25 at 11:09












  • 1





    Please include your code

    – QHarr
    Mar 25 at 11:09







1




1





Please include your code

– QHarr
Mar 25 at 11:09





Please include your code

– QHarr
Mar 25 at 11:09












1 Answer
1






active

oldest

votes


















0














Try to dockerize your selenium code so that it can run wherever you want. Here are some things that might help.



  1. Create a Dockerfile just like this one.



  2. Add these lines of code when you initialize your driver.



    chrome_options = webdriver.ChromeOptions()
    chrome_options.add_argument('--no-sandbox')
    chrome_options.add_argument('--headless')
    chrome_options.add_argument('--disable-gpu')
    driver = webdriver.Chrome(chrome_options=chrome_options)






share|improve this answer

























    Your Answer






    StackExchange.ifUsing("editor", function ()
    StackExchange.using("externalEditor", function ()
    StackExchange.using("snippets", function ()
    StackExchange.snippets.init();
    );
    );
    , "code-snippets");

    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "1"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55335601%2fi-would-like-to-scrape-this-webite-https-hdyc-neis-one-org-which-has-two-page%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    0














    Try to dockerize your selenium code so that it can run wherever you want. Here are some things that might help.



    1. Create a Dockerfile just like this one.



    2. Add these lines of code when you initialize your driver.



      chrome_options = webdriver.ChromeOptions()
      chrome_options.add_argument('--no-sandbox')
      chrome_options.add_argument('--headless')
      chrome_options.add_argument('--disable-gpu')
      driver = webdriver.Chrome(chrome_options=chrome_options)






    share|improve this answer



























      0














      Try to dockerize your selenium code so that it can run wherever you want. Here are some things that might help.



      1. Create a Dockerfile just like this one.



      2. Add these lines of code when you initialize your driver.



        chrome_options = webdriver.ChromeOptions()
        chrome_options.add_argument('--no-sandbox')
        chrome_options.add_argument('--headless')
        chrome_options.add_argument('--disable-gpu')
        driver = webdriver.Chrome(chrome_options=chrome_options)






      share|improve this answer

























        0












        0








        0







        Try to dockerize your selenium code so that it can run wherever you want. Here are some things that might help.



        1. Create a Dockerfile just like this one.



        2. Add these lines of code when you initialize your driver.



          chrome_options = webdriver.ChromeOptions()
          chrome_options.add_argument('--no-sandbox')
          chrome_options.add_argument('--headless')
          chrome_options.add_argument('--disable-gpu')
          driver = webdriver.Chrome(chrome_options=chrome_options)






        share|improve this answer













        Try to dockerize your selenium code so that it can run wherever you want. Here are some things that might help.



        1. Create a Dockerfile just like this one.



        2. Add these lines of code when you initialize your driver.



          chrome_options = webdriver.ChromeOptions()
          chrome_options.add_argument('--no-sandbox')
          chrome_options.add_argument('--headless')
          chrome_options.add_argument('--disable-gpu')
          driver = webdriver.Chrome(chrome_options=chrome_options)







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Mar 25 at 12:46









        ashutosh singhashutosh singh

        1327 bronze badges




        1327 bronze badges





























            draft saved

            draft discarded
















































            Thanks for contributing an answer to Stack Overflow!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55335601%2fi-would-like-to-scrape-this-webite-https-hdyc-neis-one-org-which-has-two-page%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Kamusi Yaliyomo Aina za kamusi | Muundo wa kamusi | Faida za kamusi | Dhima ya picha katika kamusi | Marejeo | Tazama pia | Viungo vya nje | UrambazajiKuhusu kamusiGo-SwahiliWiki-KamusiKamusi ya Kiswahili na Kiingerezakuihariri na kuongeza habari

            Swift 4 - func physicsWorld not invoked on collision? The Next CEO of Stack OverflowHow to call Objective-C code from Swift#ifdef replacement in the Swift language@selector() in Swift?#pragma mark in Swift?Swift for loop: for index, element in array?dispatch_after - GCD in Swift?Swift Beta performance: sorting arraysSplit a String into an array in Swift?The use of Swift 3 @objc inference in Swift 4 mode is deprecated?How to optimize UITableViewCell, because my UITableView lags

            Access current req object everywhere in Node.js ExpressWhy are global variables considered bad practice? (node.js)Using req & res across functionsHow do I get the path to the current script with Node.js?What is Node.js' Connect, Express and “middleware”?Node.js w/ express error handling in callbackHow to access the GET parameters after “?” in Express?Modify Node.js req object parametersAccess “app” variable inside of ExpressJS/ConnectJS middleware?Node.js Express app - request objectAngular Http Module considered middleware?Session variables in ExpressJSAdd properties to the req object in expressjs with Typescript