[Geysers] Time code

Lynn Stephens lstephens2006 at hotmail.com
Wed Jan 16 00:52:51 PST 2013




I was going to stay out of this discussion but now believe Will's use of my name in his post requires at least some response from me. I am opposed to adding a time or new code or "?" or "A" or assigning a time to entries imported from the electronic versions of the OFVC logbook into geysertimes.org.  The observer entered into the logbook the information the observer wanted to report.  I don't believe the information should be changed. I support with comments made by Jeff Cross and issues raised by MA Bellingham. From Will's comments, it appears some aspects of the OFVC logbooks don't fit into the geysertimes.org framework.  Perhaps the OFVC logbooks, especially those prior to the late 2000's shouldn't be merged into geysertimes.org.   Forcing the OFVC data to fit the geysertimes.org framework does not make sense, especially when the OFVC data is the majority, if not the the only, data available to input into geysertimes.org for years prior to the late 2000's.  (Maybe sometime in the future researchers will discover that some gazers who never entered data into either the handwritten logbooks or geysertimes.org have contributed their logbooks to a library thereby uncovering additional geyser data.  But I'm willing to bet those personal logbooks aren't going to match the geysertimes.org framework either.)
Will wrote "Now comes the questions about me applying this code to the old logs during  the transfer project but first I think it will help if I explain the current  processes and practices that Lynn has helped me put in place."   The only contribution I have made to the "processes and practices" Will has put in place has been to answer questions about specific entries in the electronic versions of the Old Faithful Visitor Center logbook.  The only reason I became involved in Will's efforts was to try to ensure that information did not get changed as it went from data base to the other, with the exception of situations where the OFVC logboook used an obsolete name or an "UNNG" that has a more recently applied name used by geysertimes.org.  I have been contacted at times by Will or his helpers asking for clarification of entries in the electronic versions of the OFVC logbook.   Most of those questions have involved obsolete geyser names, historical activity that occurred in the 1970s with which people were not familiar, etc.   Many of the questions have provided "teachable moments" in which I can provide a context within which the geyser activity occurred, tools used to collect the data, etc., although sometimes I simply have to say, "I don't know because by the time we started transcribing the logbooks, the people who could have answered the question were no longer around, so we entered just what was there."   Will made comments about difficulties he and his helpers seem to be having importing data from the OFVC logbooks into geysertimes.org and/or using data imported into geysertimes.org in Excel spreadsheets.   I am not going to address his comments other than to say there were reasons each database was set up the way it was.  What one of us sees as a "solution", the other sees as a "problem" and vice versa. Lynn Stephens  From: wolveslax65 at comcast.net
To: geysers at lists.wallawalla.edu
Date: Tue, 15 Jan 2013 10:33:15 -0700
Subject: [Geysers] Time code





Gazers,
 
The new code is necessary in my opinion because “0000?” can mean to things 
when only looking at the top level (in this case the top level is the date, time 
and codes only). By adding the “A” it will allow us once the search function is 
revamped in Geysertimes to easily select only the exact time eruptions and so 
forth in our data pulls. Also this addition would be added to the entry form 
just as another button in the row with the other codes and everyone will still 
be encouraged to put comments about the observation as we do now.
 
So lets take an example, lets say I did a data pull for a geyser and it had 
2000 entries and 200+ of them had “0000?”. I doubt many of us would want to read 
200+ entries to figure out whether it was an exact time or an overnight time. 
Then if it was an overnight time, what was the time range or the middle of the 
time range so I can manually go through the data I just pulled and change the 
0000? to the correct time frame.
 
Also by adding the “A” Geysertimes could then be programed to add a “~” to 
the interval number based on a simple has “A” or doesn’t have “A” algorithm. 
This is because the system will never be able to read and comprehend our 
comments (unless there is some easy magical way that I do not know about).
 
So in the end the final reason I want to add this is that “0000?” can mean 
two things at the top level, where as “0000A” can only mean one.
 
 
Now comes the questions about me applying this code to the old logs during 
the transfer project but first I think it will help if I explain the current 
processes and practices that Lynn has helped me put in place.
 
So first the process.
 
First I take the text file from the GOSA site and put it into Excel. This 
is the simple part.
Next comes the long part. When the text file is imported into Excel thing 
don’t always go into the right places. For example, two eruptions end up on the 
same line, data is not in the right column, etc.. . So the big task is moving 
everything around so it is in the respective column.
During this process I frequently find some weird entries. (Take a look 
sometime they are pretty confusing or funny) Whenever I hit one of these I flag 
it and send the entry to Lynn to get conformation or clarification on the entry. 
Sometimes a entry comes up where it is clear that the time was written in 12 
hour format and not in 24 hour. So I always check these with Lynn and if she 
agrees then it is changed BUT the original text is always listed with a note 
about the change the same way Lynn did when typing up the logs by using the 
“[comment]” notation.
Then comes the easy final step, the actual transfer. Once the log is 
perfect, the Excel file is saved as a csv and then uploaded to 
Geysertimes.
 
During this process it is made the top priority not to change entries. I 
also try and make that clear to my helpers as well. Almost nothing is left out 
currently  (only thing is some VR entries that have a non VR time also as 
making sure the VR entry is placed as the secondary is not possible right now.) 
this includes comments, UNNGs, notes, new geysers, etc..
 
 
Now for adding the code.
 
The reason for adding the “A” in the logs that will be transferred is the 
same as it will be for the future entries. That reason being clarity. In 
databases it is best to keep notation the same throughout the entire data. 
Currently with the notes not being visible in the data retrieve this sort of 
forces me to change the entry from a note to an eruption that is “0000?”. Again 
that can mean two things.
 
 
Now the question about why we need this now. My reason is simple, I don’t 
want to have to go through and clean the transfers of the logs. If I can make 
this clarification with the code “A” this first time through it will be much 
easier than having to go through everything again. Relooking through about 30-40 
years of logs with a minimum of 5000-6000 entries a year doesn’t sound too fun 
to me.
 
Now about filling the gaps. This code will not fill any data gaps, it will 
only clarify the current data that we already have. The only way we fill the 
data gaps is by people submitting missing data. If you are interested in doing 
that please contact me.
 
I hope this helps,
 
Will Boekel
 
 

_______________________________________________
Geysers mailing list
Geysers at lists.wallawalla.edu
 

 		 	   		  
-------------- next part --------------
An HTML attachment was scrubbed...
URL: </geyser-list/attachments/20130116/4411facb/attachment-0001.html>


More information about the Geysers mailing list