0

Is there a way/tool to get all links of a website ? Just the links , not looking to create a local copy/download a website . Example - Links of all questions posted on Superuser . Platform Windows 7 , Ubuntu 14.04

Renuka
  • 161
  • 1
  • 3
  • 12

1 Answers1

1

Sorry for keeping you waiting. I have uploaded my program here.

The program is still in very-very early phase, so most features do not work, but it does, however, grab all links to other pages on the website.

It needs java to run and you should be able to double click the file and a UI should load up. Type in the SearchW box (in the GUI) the website address i.e. http://google.com, http://bbc.co.uk

Then you can copy and paste all the links as they are printed (I still need to implement an export feature but you'll be able to copy the links for the moment)

Let me know if you have any issues! And if you like it, I will, (once it's in a decent state) post a link to my repo where you'll be able to download the newer versions.

  • Hello . I tried it first on filehippo.com . it gave me this output which looks ok . The site isnt that big. http://pastebin.com/CfSe4RgD . Then I tried it on a bigger site . 9gag.com . It gave a output of only 61 lines http://pastebin.com/VE30u8DE. Filehippo was 208 lines . its impossible . 9gag has millions of posts . – Renuka Oct 27 '14 at 13:02
  • BTW Thank You . Keep updating if you can. I will keep testing it for you . :) – Renuka Oct 27 '14 at 13:12
  • 1
    @Renuka sorry I should have mentioned. The Parser only scan one level i.e. Grabs all links from the home page and then it checks each page for an email (I know it's not of interest to you). But if you want it to grab literally every single link on a website I will have to change a bit of code. The only problem is that if there is a link (say on homepage) which point out to another website, it will also start grabbing links from other websites (it doesn't know if the link is from the current website or a different website. But I'll see what I can do – benscabbia Oct 27 '14 at 13:13
  • 1
    @Renuka you're most welcome :). I won't be able to work on it for a few days but now I have someone who is waiting for updated versions I will do my best to keep developing it asap! – benscabbia Oct 27 '14 at 13:15
  • No problem :) .BTW I just double clicked the jar file and it gave me a UI . Didn't go to cmd to type that command you asked to . – Renuka Oct 27 '14 at 13:20
  • 1
    @Renuka I was hoping it would run directly (they weren't actually instructions but I have re-worded now). Can be a real pain to get it to run through console (would probably require system variables configuration etc) – benscabbia Oct 27 '14 at 13:25