CS174
Chris Pollett
May 11, 2016
Sitemap: url
Download the robots.txt file for flikr.com. What are the different user-agents that are specified in it? What are the different directives you see? For each find out what it does. Download one of the sitemap xml files and see what tags it has try to find out what each does. Write up this info and post it to the May 11 Discussion Thread.
To find out about humans on flikr, download the humans.txt file.
<?xml version="1.0"?> <rss version="2.0" xmlns:xlink="http://www.w3.org/1999/xlink"> <channel> <title>Computer Science Department, San Jose State University - Old News Items </title> <link>http://www.cs.sjsu.edu/</link> <description>Archived of selected news items from the Department of Computer Science at SJSU.</description> <language>en-us</language> <lastBuildDate>Thursday, 16 February 2006</lastBuildDate> <item> <title>Job opportunity for CS students or graduates.</title> <pubDate>Monday, 8 May 2006</pubDate> <description> JOB OPPORTUNITY! PageBites, a Palo Alto startup, is hiring. If you are interested </description> </item> <!-- could add more items --> </channel> </rss>