============================================================================= List of Transactions UCI Repository of Machine Learning Databases Since October 24, 1989 David W. Aha (Site Librarian) ============================================================================= Date: Inquirer: Action: ===== ================== ================================================== 10/24 Denis Howlett Responded to request concerning lost, mailed tape. Created tape for him, waiting for his mail address. 10/25 Denis Howlett I mailed his tape to him (arrived in mailbox today) 11/5 Marek Druzdzel Request for Bratko's databases: I suggested getting them from Hans Tallis at the same site 11/9 Steven Hanson I was notified about his robotics database by JHG 11/10 Les Valiant Info Request: replied immediately 11/10 Steven Hanson Sending out request/info for his database He replied saying he'll send something asap 11/10 Dave Lewis Finally, sent us the huge IR collection. This should be a popular one. 11/10 Mike de la Maza Having problems ftp'g compressed files. Has he been using the bin option of ftp? 11/14 Mike Hudak Responded to his generic info request. 11/14 Yoram Reich Sent the breast cancer database to him. 11/20 David Lewis Received a tar tape and paper -- notified him 11/30 Diana Gordon Replied "no" concerning structured dbase inquiry 12/7 Terrence Fogarty Replied with HELLO file 12/8 Keith Hacke Replied, we don't have numeric vals for the H-R db 12/8 Keith Hacke I Suggested using cpu, echocardiogram, & heart disease databases for his needs 12/11 Jim Corter I Mailed 7 databases to him at his request 12/11 Kevin Thompson Mailed the economic sanctions database 12/15 Wei-Min Shen Asked for 4 dbs, I mentioned the restriction on them 1/9 Terence Fogarty Received 1/4 inch tape and $10. Loaded tape and will mail out tomorrow morning. 1/17 Ed Wisniewski Sent overview file 1/19 Peter Clark The "|" in Quinlan's database's means "ignore the rest of this line" 1/19 Harish Ragayan Was able to ftp the databases to his location 1/22 George Drastal Sent table of contents and Pazzani's database. He requested databases with domain theories. 1/22 Me Decided to add my small EBL databases here 1/23 Terence Fogarty Hasn't yet received his tape. I told him to wait longer and that I'll send the 3 he asked for now Not sure he's in an academic institution, so I'm holding onto the breast cancer db until he confirms 1/23 Brad Allen Replied to request for info 1/25 Michel Manago Doesn't have any databases readily available, but will check on two others (with domain theories) 1/25 Chien-Chung Chan Information request 1/31 Me The original copy of cleveland.dat is corrupted 2/1 Hamid Berenji Sent outline info and 1 database (Cpu) 2/2 David Tcheng Having problems copying compressed files. I mailed the annealing database and asked which others he would like uncompressed. 2/2 David Tcheng Asked to uncompress all files -- I did as asked and have also updated the HELLO (overview) file 2/4 David Tcheng I mailed the Ljubljana files to him. Also, I corrected errors in their citation requirements (all 3 read "this lymphography db"). 2/7 Me I moved the king-rook versus king-pawn database from the undocumented to the chess-end-games sub-directory. Updated HELLO accordingly. 2/8 Me Added O'Rorke's database of theorems from Principia Mathematica to the undocumented sub-directory. 2/8 Larry Hall Mailed the outline file (HELLO) 2/8 Paul O'Rorke There are still some errors in the principia db, so we'll leave those separate and unreadable 4 now. 2/9 H. Michael Chung Mike Pazzani asked me to snailmail a reply to his request for databases. Done. 2/14 Jeff Schlimmer Jeff passed along Graham's request for info. I Graham Clarke mailed it today and cc'd Jeff. 2/14 Wendy Sarrett Passed along information on access. 2/15 Terence Fogarty Reported that he received the tape, confirmed that his _is_ an academic institute; I'll send him the Ljubjana databases now 2/19 Graham Clarke Likes what he has...asked for info on others in the UK who have the complete set. 6 do. I sent him the compiled list and cc'd myself. 2/20 Bradley Richards Accessing request. I sent simple directions also. and Ray Mooney 2/22 John Anderson Requested breast cancer database. I sent that along with the general description. 2/26 Peter Clark Suggested I format all databases identically. I'll get to work on it now...all but the audiology, labor-negotiations, spectrometer, and university databases, which have unusual data formats. I have also not done this for undocumented databases. 2/26 Peter Clark Many suggestions for standardization. I implemented them later in the day. 3/3 John Gennari I've loaded his C file for creating "animals", each represented by a set of cylinders. This is one of the few databases we have that includes structured objects (i.e., instances with higher-order relations). See undocumented/gennari. 3/5 Michael de la Maza I'm sending him the 3 restricted databases now. 3/5 Glenn Silverstein Sent overview file. 3/6 Bill Simpson Noticed 3 problems. 1: a bad value for 97th diagnosis iin the cleveland data (ca was 9.0). No longer true (not sure why): I mailed him another copy. 2: lrs.data is corrupt! I'll check a backup copy if I can find one and otherwise ask for another copy. 3: there are several attributes with only 1 value (or its missing). We're not sure how to interpret this and I've asked him to check with the donator on this problem (R. Quinlan). 3/7 John Gennari Gave me copy of uncorrupted lrs.data file. I'm sending it to Simpson and will ask for confirmation that it looks okay. (Looks good in my brief scan.) I'll delete the corrupted copy at that time. 3/11 Erach Irani Sent overview listing. 3/12 Bob Reinke Asked him why Jude's copy of the soybean and chess end game differs from ours. I CC'd lots of folks who might know. 3/13 Powell Benedict Called -- I sent the lymphography database to him. 3/13 John Gennari John thinks it's deceiving that symbolic and Boolean attributes are coded using numeric values. I agree, but am concerned that there will now be multiple formats floating around. I'm also concerned that John's interpretation will not match Detrano's. For example, isn't the degree-of-disease attribute supposed to be numeric valued? John disagrees...I'll have to contact Detrano on this again. 3/13 Ray Bareiss Mailed the 26 test examples for the audiology data. 3/13 Ray Mooney Clarified why several different soybean datasets exist. I've included his explanation in the soybean databases directory. 3/13 Don Cohen I wrote a longish reply on what I mean by a database with a domain theory. He may have some. 3/15 Bill Simpson Found many problems with the heart disease, lymphography, and spectrometer databases. I've sent out questions to Cestnik and Stutz on the latter 2. I'll have to call Detrano on the final. 3/13 Powell Benedict Sending breast cancer and primary tumor databases. 3/18 Erach Irani I've no need to use compress the databases now, but will do so if needed in the future. Although uncompressed files take longer to transfer, many people have had problems transferring compressed files. 3/26 Powell Benedict Offerred to send Rendell's data generator after documenting it. Should be here in a few months. 3/26 Bill Simpson Asked for update. There are now 13 more databases, but I don't know which ones. Also, his requests I mailed to John Stutz and Bojan Cestnik are still missing. 4/5 Nick Flann About to send 5 domain theories and databases. 4/5 Mona Matar Sent 3 Ljubljana databases and iris. Asked about an ovine parsite (sp?) database she mentioned. 4/5 Mark Gluck Asked for info on the databases -- I've started off with the overview file and expect more communication. 4/5 Harish Kriplani Sent lymphography and breast cancer databases. 4/5 Bill Simpson Sent kr-vs-kp, echocardiogram, hayes-roth, the undocumented databases, and the small EBL d-theories 4/5 Mark Gluck Sent breast cancer and primary tumor databases, which might suffice as benchmarks. 4/6 Richard Forsyth Finally mailed the DOS databases to them. Michael Chung 4/11 Doug Fisher Sent overview file (an update). Stefanos Manganaris 4/12 Ming Tan Looking for a database with cost info. I suggested getting Nunez's gynecology database. 4/13 Mark Gluck Sent lymphography doc and database. Discussed the 70/30 split method. David W. Aha Site Librarian