So I am trying to learn SEO and I am honestly confused and have following 8 questions.
Do I tell a bot not to visit a certain link through
X-Robots-Tagor throughrobot meta tagorrobots.txt?Is it ok to include all 3 (robots.txt, robot meta tag, and X-Robots-Tag header) or I should always only provide 1?
Do I get penalized if I show same info in
X-Robots-Tagand inrobotsmeta tag androbots.txt?Let's say for
/test1myrobots.txtsaysDisallowbut myrobots meta tagsaysindex, followand myX-Robots-Tagsaysindex, nofollow, noarchive. Do I get penalized if those values are different?Let's say for
/test1myrobots.txtsaysDisallowbut myrobotsmeta tag saysindex, followand myX-Robots-Tagsaysindex,nofollow,noarchive. Which rule will be followed by the bot? What is the importance here?Let's say my
robots.txthas a rule sayingDisallow: /andAllow: /link_one/link_twoand myX-Robots-Tagandrobot meta tagfor every link except/link_one/link_twosaysnofollow,noindex,noarchive. From what I understand bot will never get to/link_one/link_twosince I prevented it from crawling at root level. Now if I provide asitemap.xmlin therobots.txtthat has/link_one/link_twothere, will it actually end up being crawled?Will bot crawl into the directory provided by
sitemap.(xml/txt)even though it is not accessible through home page or any pages following the home page?And overall I would appreciate some clarification on what is the difference between
robots.txt,X-Robots-Tagandrobot meta tagandsitemap.(xml/txt). To me they seem like they do the exact same thing.
I already saw that there are some questions that answer a small subset of what I asked. But I want the whole big explanation.