Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

you'd think it would at least come up in the internet archive if not anywhere else.



That's unfortunate. But understandable in a way.

    # robots.txt web.archive.org 2013-10-02

    User-agent: *
    Disallow: /

    User-agent: ia_archiver
    Allow: /


touche, I don't suppose the old non commercial websites mentioned in the article suffer the same problem though right? Maybe an accidental robots.txt file was mistakenly left around?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: