Due March 11, 2008 by midnight Total points: 10 ------------------------------------------------------------- Each group of two students should create two zip archives with the training data for the Bayesian classifier. -- One archive should contain 20 spam messages (please, don't include headers and footers) saved as text files. -- The second archive should contain 20 non-spam messages (also without headers/footers) saved as text files. Please, name your files as follows spamXX.txt and nonspamXX.txt, where XX are the numbers in the range specified below. I listed DH450 user names of one group member and ranges for their files. For example, a group whose member has username k2002 will name their files: spam01.txt, spam02.txt, ...., spam20.txt and nonspam01.txt, etc.. These files will be used by the whole class for training the Bayesian classifier. User name Range Status ============================================== k2002 01 .. 20 done k2004 21 .. 40 k2005 41 .. 60 done k2008 61 .. 80 done k2012 81 .. 100 done k2032 101 .. 120 k2016 121 .. 140 done k2019 141 .. 160 done k2009 161 .. 180 done k2025 181 .. 200 done k2027 201 .. 230 done ==================================