[Thread Prev] | [Thread Next] | [Thread Index] | [Date Prev] | [Date Next] | [Date Index] |
On the Ecology and Evolution gopher archive on sunsite.unc.edu, I have put a small "synthetic" (that is, made-up) dataset representing a sample of pollen of variable morphology. The dataset is intended for training in multivariate statistics: the "pollen" can be sorted into groups that may represent discrete species. If you're interested in using the dataset yourself, please download it and let me know if it is useful to you. Also, if you have any similar datasets or other teaching exercises, and would like to share them with others, please let me know. Thanks! Here is the gopher link information: Type=1+ Name=Multivariate analysis of pollen grains Path=1/../.pub/academic/biology/ecology+evolution/teaching/pollen Host=sunsite.unc.edu Port=70 The dataset can be retrieved via anonymous FTP as well. Below, I have included an edited version of an actual FTP session to retrieve this dataset from the archive. Indented text is produced by the computer; my comments to you are in square brackets; the rest I typed to tell the computer what to do. Una Smith una.smith@yale.edu ---------------------------------------------------------------------- doliolum{una} 29: ftp sunsite.unc.edu Connected to sunsite.unc.edu. 220 calypso-2.oit.unc.edu FTP server (Version wu-2.4(30) ... Name (sunsite.unc.edu:una): anonymous 331 Guest login ok, send your complete e-mail address as password. Password: una.smith@yale.edu [I could not see this as I typed it] 230- WELCOME to UNC and SUN's anonymous ftp server 230- University of North Carolina 230- Office FOR Information Technology 230- SunSITE.unc.edu [I've deleted bits here] 230- 230- If you email to info@sunsite.unc.edu you will be sent help [...] 230- 230 Guest login ok, access restrictions apply. ftp> pwd [this means "print the working directory", or where am I now?] 257 "/" is current directory. ftp> cd pub/academic/biology/ecology+evolution [move to this (sub)directory] 250- 250-Welcome to the Ecology and Evolution community archive! The ... 250-here is on things of interest to research ecologists and evol... 250-biologists. 250- 250-Recent changes/additions as of December 1994: 250- 250-* bioguide/ "A Biologist's Guide to Internet Resources" has ... 250- directory now, to accomodate copies in various formats. 250- Current versions: 1.7 and 1.8a (December 1994 ... [...] 250- 250 CWD command successful. ftp> cd teaching/pollen [move to this subdirectory of the current directory] 250- 250-This dataset is synthetic. It was generated by David Coleman at 250-RCA Laboratories in Princeton, N.J. For convenience, we will 250-refer to it as the POLLEN DATA. The first three variables are the 250-lengths of geometric features observed sampled pollen grains - in 250-the x, y, and z dimensions: a "ridge" along x, a "nub" in the y 250-direction, and a "crack" in along the z dimension. The fourth 250-variable is pollen grain weight, and the fifth is density. 250- 250-There are 3848 observations, in random order (for people whose 250-software packages cannot handle this much data, it is recommended 250-that the data be sampled). The dataset is broken up into eight 250-pieces, POLLEN1.DAT - POLLEN8.DAT, each with 481 observations. 250-We will call the variables: 250- 250-1. RIDGE 250-2. NUB 250-3. CRACK 250-4. WEIGHT 250-5. DENSITY 250- 250-6. OBSERVATION NUMBER (for convenience) 250- 250-The data analyst is advised that there is more than one "feature" to 250-these data. Each feature can be observed through various graphical 250-techniques, but analytic methods, as well, can help "crack" the ... 250- 250- 250 CWD command successful. ftp> dir 200 PORT command successful. 150 Opening ASCII mode data connection for /bin/ls. total 205 drwxr-xr-x 3 90 25 512 Mar 30 1994 . drwxr-xr-x 6 90 25 512 Apr 1 1994 .. drwxr-xr-x 2 90 25 512 Mar 30 1994 .cap -rw-r--r-- 1 90 25 1035 Dec 5 1993 README -rw-r--r-- 1 90 25 25012 Dec 5 1993 pollen1.dat -rw-r--r-- 1 90 25 25012 Dec 5 1993 pollen2.dat -rw-r--r-- 1 90 25 25012 Dec 5 1993 pollen3.dat -rw-r--r-- 1 90 25 25012 Dec 5 1993 pollen4.dat -rw-r--r-- 1 90 25 25012 Dec 5 1993 pollen5.dat -rw-r--r-- 1 90 25 25012 Dec 5 1993 pollen6.dat -rw-r--r-- 1 90 25 25012 Dec 5 1993 pollen7.dat -rw-r--r-- 1 90 25 25012 Dec 5 1993 pollen8.dat 226 Transfer complete. 796 bytes received in 1.8 seconds (0.42 Kbytes/s) ftp> get README 200 PORT command successful. 150 Opening ASCII mode data connection for README (1035 bytes). 226 Transfer complete. local: README remote: README 1062 bytes received in 0.37 seconds (2.8 Kbytes/s) ftp> prompt Interactive mode off. ftp> mget *.dat 200 PORT command successful. 150 Opening ASCII mode data connection for pollen1.dat (25012 bytes). 226 Transfer complete. local: pollen1.dat remote: pollen1.dat 25493 bytes received in 14 seconds (1.7 Kbytes/s) 200 PORT command successful. 150 Opening ASCII mode data connection for pollen2.dat (25012 bytes). [and so on for all 8 files] ftp> quit 221 Goodbye. doliolum{una} 30: Okay? Note you don't need the .cap file; that's dressing for the gopher users only. The README file in each directory is printed by the FTP program when you enter the directory. That's where the text prefaced by "250-" comes from.
Partial index: