Trove – searching for old newspaper articles

The National Library has an excellent on-line site called Trove.  Their own description is best – it enables you to “Find and get over [249 million] Australian and online resources: books, images, historic newspapers, maps, music, archives and more”

Yesterday the helpful people at my local library showed me how to search the historic newspapers section. In fact a local paper, the Williamstown Chronicle (1856 – 1954), has had a large number of issues scanned, indexed and put up on Trove.

An important note is that Trove not only has the scanned articles (as ‘images’) but the computer has tried to ‘read’ the newspaper text and convert it to plain text! It does a great job, but doesn’t always get it right. You can volunteer to manually correct the text.

Here’s some quick steps on searching for articles that mention 25 Albert (street) in the Williamstown Chronicle :

  • Access Trove. From the main screen select Digitised Newspapers and More image

 

  • On the Find an Article screen. Select Advanced Search (under the green “Search articles” button)image

 

  • This is the key screen: Advanced Search – Digitised Newspapers and more.
    • In The phrase input area type in 25 Albert
    • Lower down on the page, there is the Newspaper Title & Location section. In this, scroll through the long list to find the Victoria section heading and keep going until you find Williamstown Chronicle. Select it  via the tick box beside it.

 

image

  • Click Search

 

The system will perform the search and (hopefully) show the results:

image

  • Click on any of the blue article titles (eg “Agent Sent for Trial”) to view the article; scanned original and ‘text’. I have selected another one; what appeared to be a party invitation from 1926:

image

Scroll down to find it: image

Notes:

  • The RHS is the original scanned image, the LHS is the “Electronically Translated Text” (including an option, as mentioned, to Fix this Text)
  • If you scroll down the scanned image, your (found) search text has a feint yellow/orange line under it, highlighting it.
  • There are options – top LHS – to Print the scanned document..or save as PDF, JPG image etc. Or buy it. You should read up on how you can legally use these scanned articles.
  • If you look carefully you’ll see the search for 25 Albert returned a reference to “..£25 Albert Edward …”.  It literally did as it was asked. I didn’t put street or st in the search as that seems to limit it.

The Age – doubly ambiguous weather summary

The top left hand corner of The Age website has a little Weather section, which you can personalise.

Here’s mine for just now:

If you glance at it, two things jump out at you:

  1. The term change. That’s a weather term, usually meaning cool change. Here it means change your city
  2. The word Snow . Besides Full Details. Sunny then Snow? Unfortunate layout as it’s actually Snow Report, with the 2nd word on the next line…

I’ve actually reported the first one to them already and they thanked me. Only spotted the second one today.

Emulating a Developer: Part 3 – Linux coding (“bashing away”)

Being the ongoing adventures in application development … by someone who really doesn’t develop code.

<Continues from Part 2>

Thus far I had ‘designed’ my simple program. Well, in my head at least. Then built a working PC (vbscript) version by my tried-and-true learn then code method. But it was slow and had annoying security warning popups. 

So I decided to start again and use the same ‘design’ but write it as a BASH shell script that runs directly on the Linux Server. This took a lot less time to do, but I still had to learn – actually re-learn – how BASH does things like input strings, break them up into individual values, add things up, loop around, create random values, merge strings etc. 

But it wasn’t that hard at all. A few hours later and here it is:

#!/bin/bash         
# creates a suitable rar command to break up large file(s) into 7, 14 or 21 smaller rar files
# pass in file sizes in MB and it will do the rest.
# example: crtRar 350 200 400
# results in:  sum is: 950 MB     using 14 parts…  rar a -v67m -m0 -R ~/temp/dio.rar
# just copy and paste and add your (3) file names at end of command

let mySum=0

# pass in 1 to n values. being file sizes in MB

for i in $*
do
let mySum=$mySum+$i
done
echo "sum is: "$mySum" MB"

# Generate simple 3 char random file name for .rar file
rarName=`head -c 500 /dev/urandom | tr -dc a-z | head -c 3; echo`

# Simple "if" rules, such if mySum (file size totals) le 700 (MB) then 7 parts, etc
if [ $mySum -le 700 ]; then
    let rarSize=$mySum/7 ; echo "using 7 parts…"
elif [ $mySum -le 1400 ]; then
    let rarSize=$mySum/14 ; echo "using 14 parts…"
else
    let rarSize=$mySum/21 ; echo "using 21 parts…"
fi
# finally build and show rar command
rarCmd="rar a -v"$rarSize"m -m0 -R ~/temp/"$rarName".rar "
echo $rarCmd

 

Even I can see it could do things smarter. But it’s Version 1, it works and so I’m happy. Plus I am still not a developer…

PS: Things I want to add in Version 2:

  • Pass the file or folder names per se and have the program use them to work out the rar part size
  • Then add the provided file/folder name to the (end of the) rar command

Emulating a Developer: Part 2 – PC coding (“what, no clipboard?!”)

Being the ongoing adventures in application development … by someone who really doesn’t develop code.

<The first part is, ironically, located in Part 1>

So my first cut, simple, crtRar utility has to do this:

  1. Call the program, passing the 1 or more file sizes in MB (eg “crtRar 600 100”)
  2. Break up the input ‘string’ (600 100) into individual values, i.e two in this case
  3. Add them up to give total filesize
  4. If this total filesize was 700 or less, I’ll break up into 7 equal parts. If between 700 and 1400 then 14 parts. If greater than 1400 then 21 parts
  5. Calculate the rar file part size as per 4. above
  6. Create a random, 2 letter file name
  7. Build the rar command using the output of 5 and 6 etc
  8. Put it up on the screen, ready to be copied and pasted…and used.

I decided to develop it on my Windows PC and opted for the ‘in built’ language called vbscript.  I don’t really know this language, but what better way to learn than dive in and code. Plus I could store the finished .vbs file on my Dropbox and have it automatically available on the other PCs.

So I literally created a new Test.vbs file in Notepad++ and thought, right:  Step 1 is “pass in the 1 or more file sizes…”  I wonder how vbscript does that?  Looked it up on Google.  Worked out how to write out results to the screen (so I could test things on the fly).

I got Step 1 coded, so then it was Step 2.  How do I break up that string into individual values.  Hello again friend Google.

Continue reading

Emulating a Developer: Part 1 – Learn as you type

Being the ongoing adventures in application development … by someone who really doesn’t develop code.

I never actually learnt how to write code. I can’t recall ever even vaguely doing it for a living. But ever since I can remember, I’ve dabbled and whipped up utility applications for my own use.

I was amazed to discover, years after leaving IBM, that one of my little utilities was still being used internally by some people.  Anyway, I had cause to whip one up just today. And the more things change, the more they stay the same.

I still learn as I type. That is, I don’t sit down and learn the language I’m writing in, then work out the design, then think it through some more, then actually start coding.

Nope. I get the Big Picture in my head, then dive in.  I’m picturing, in my head, a flow chart. Don’t write things down!   Then off we go entering the code for “crtRar

What crtRar had to do – the basics

My crtRar (“Create RAR”) program actually serves a valuable purpose.  I have some big files up there on a server and I want to get them down to the PC. It’s an FTP server; actually SFTP.  Say, the two files are:

  BigBuckBunny.avi  600 MB

  Test22.mpg 100 MB

The FTP server will only allow each download ‘stream’ to be about 200 kBps. So I can download the 2 files at the same time, but each one will only be at 200 kBps. That 200 kBps is about 1/7 of my total ADSL bandwidth. So it’s not efficient at whatever 2/7 as a percent is .

Both the PC and the FTP server (running Linux) support the RAR compression program. It’s a bit like zip or winzip, with one difference. It can easily break up 1 or more files into multiple files.  Can you see what’s coming next?

Take the two files (total of 700 MB) and create 7 different RAR files. Make them all the same size (700/7 = 100 MB) and then download all 7 at once, that is 7 FTP streams.  Will download much quicker than just the 2 original files. (Actually there’s also no point in compressing the data as both are already compressed, so we can save time by telling RAR not to pack the data.).  On the server this is a sample of the command:

  rar a -v100m -m0 -R ~/temp/zr.rar BigBuckBunny.avi  Test22.mpg

Won’t go into details, but it creates the multiple (7) x 100 MB rar files  zr.part01.rar, zr.part02.rar etc  (I chose the name “zr” at random, means nothing!  The command does a bit more, like catering for whole directories etc)

Continue reading

Locate32 and Dropbox – offline indexes of remote PCs

I regularly use Locate32 for my fast indexing – and searching – of PC files.  Plus I use Dropbox to store my shared files. So why not see if the two can work together?  A bit of experimenting showed the answer was yes.  Here’s one way to have an Index of one computer’s My Documents folder available on the other. And vice versa. 

I know that Locate32 can support remote shares (aka  \\server\share\) directly, but that has some considerations, not least of which is sharing  out My Documents. So I thought of putting the Locate32 index for one computer (Notebook) on Dropbox and having it used in another (Desktop). The steps, all done using Locate32 3.1 RC3 build 10.5090 :

On your first PC

  1. On the first PC (Notebook), create the new Locate32 index database file. Call it Notebook Docs, with a similar Description. Have it index My Documents or equivalent
  2. Save the index (say notebook.docs.dbs) to your My Dropbox folder
  3. Update the Locate32 index and test that it works locally

These are standard steps for Locate32. Nothing extra is required on this ‘source’ PC.  You can shutdown the Notebook now as we are finished. It is not needed for the following steps, including Searching its contents.

On your other PC

Now, on the other PC (Desktop) do this:

Start Locate32 and set it up to use the above index.

  1. Tools –> Settings –> Database
  2. Click on the Import button
  3. Navigate to My Dropbox and select the above index file (notebook.docs.dbs) and Open it
  4. On the Database options screen make two changes
  1. UncheckUpdate Globally
  2. CheckDo not overwrite existing data (e.g…”

Your screen should look a bit like this:

locate32_1

  1. Click OK a few times to exit

Make the results display more friendly and obvious

Now you can search this Index (and others).  A tip that helps further is to add the Database (name) or Database Description as a column on the main display/results screen. Just Right-Mouse on the column headings and select either the Database or Database Description. Makes it more obvious where the file really lives as can been seen below.

So, I was wondering where my document on my Grampians Trip in 2010 was. I was on the Desktop PC, so did a quick search.

locate32_2

There it is!  Note that the Database of Notebook Docs reminds me it’s on the other system – the Notebook.

Of course you can now do the vice-versa and create an Index for the Desktop and have it (read only) available via Dropbox on the Notebook. Or any other suitable computer.

Men of Rock & other great Geology documentaries

I love Geology. After a very sad start to this geeky affair (see at very end), it’s bloomed over the years. I’ve seen some fascinating documentaries, most of them hosted by Professor Iain Stewart. Here’s two recent series of his:

Men of Rock (2010) – retraces the steps of a band of maverick pioneers who made ground-breaking discoveries in the landscape of Scotland about how our planet works.

How Earth Made Us (2010)-  a five-part BBC documentary television series in which Professor Iain Stewart tells the epic story of how geology, geography and climate have influenced mankind.  (This beauty is in available in high definition too)

If you want to get suggestions on how to get these just use my general contact form.

My quick and sad story: first day at Uni means getting your timetable. I had nominated my 4 subjects:

  1. Physics (major)
  2. Geology
  3. Maths A
  4. Maths B

Smiling, I reached for my timetable printout – knowing I’d never have to touch that much-despised Chemistry ever again.

My timetable simply said “No combination possible”.  I went pale. 

Physics and Geology clashed with their times. I had to drop my major (Physics) to do Geology. Or drop Geology…and replace it via taking a full first year uni-level CHEMISTRY. I felt really sick. But it came to pass. Bye bye uni Geology…

Android Apps I Use (Nov 2010) – Main Ones

Okay will try and be quick. Here’s the main ones. I suppose my Must Have list. These tend to be on the front page of my phone or on a screen either side of it:

App Brain.  No brainer more like. Excellent way of managing apps from the phone or the web site. Yes the 2 can be kept in synch and hence gets a big tick from me. Plus has search and Top Apps listed in various categories and time frames. And it’s free.

Gmail for email

Android Agenda Widget for calendar summary on front home screen. Uses Google’s automatic Birthday calendar too. Nice.  Another screen has Jorte icon for the full Google calendar view etc.

NewsRob for RSS news headlines and summaries. Synchs to Google Reader. Images turned off to save time and bandwidth

SwiftKey Keyboard. Paid for this and am very happy with it. Excellent predictive words – in context – not just text/next-letter.

iSyncr for PC. After trying a few other ones, finally bought this one. Synchs your nominated iTunes playlists from the PC to the Android phone.  It is mainly a Windows App (.exe file) that runs directly from the SD Card; actually from the drive you mount the SD Card as inside Windows.

Zimly. My main music player. Can add playlists on the phone’

Dolphin HD Browser.  Fast web browsing

Handcent SMS. Fast, flexible SMS client

InstaFetch Instapaper client.

ColorNote Ideal for local (non-synched) notes. Like: Car parked in yellow J21 (true)

Got To Do  For not just to do list items but a bit of an info database as per here (replace iPhone with Android etc). It too synchs with Toodledo

BeyondPod. Podcast manager and player.

 

Other keys ones:

ToMarket Lite : shopping lists, sorted by aisle to make shopping quicker

Dropbox: as they say it ‘is the easiest way to store, sync, and, share files online”. Runs on my main PC, laptop and Android phone.

ES File Explorer : local and LAN (Wifi) access to your files. Copy, delete etc

WiFi Manager

Titanium Backup (phone is rooted). Paid for this to enable batching up of operations. Much easier than one-by-one.

Train Trapper: Melbourne train timetables, location aware.

 

That’s it for now. App Brain tells me I have 81 apps installed, some are services and extras so I don’t actually ‘see’ them.  With Android 2.2 – aka Froyo – I have a number that have installed themselves to the SD Card, which increases the total number that can be installed compared to 2.1 (which only used the limited ‘internal’ memory for apps).

1000 Steps Loop walk (Dandenongs)

Sunny day yesterday so went off to the Dandenongs to tackle the 1000 Steps walk again, including the ‘round the back’ extra bits. Total walk is about 11 km and you should allocate 3 to 4 hours.   This walk is not even an hour from my place.

It’s broken up into 4 main bits

  1. The 1000 Steps walk itself, aka Kokoda Memorial aka Tree Fern Gully. Quite steep. Up to the top of One Tree Hill (red, below, click on image to see larger versions at gallery)
  2. From the Hill it’s down then level along the ‘back’ of the mountains (blue)
  3. Up again – sustained up hill on a path – to the top of Chandlers Hill (blue)
  4. Back down and around to the ‘front’, being the start of the walk in the 1000 Steps car park (magenta)

And here’s just a few of the Steps themselves:

A few more photos are available, best to view as a slideshow. 

This is a really good walk particularly on a sunny day. Full details are Walk 22 (Ferntree Gully Forest) in the Day Walks Victoria book. Some tracks have been renamed or merged since this edition was published; but nothing too complicated.

All tracks are in the Melways, starting on Map 74 H4.

Kindle eBook – many editions of the same thing? – consumer confusion

I can’t recall a new technology that offered so many choices for apparently the same thing. And no clear reasons as to why I should get a particular one.

I’m talking about eBooks. In this case how a search for a single out-of-copyright book turned up a multitude of confusing choices.

The book is The First Men in the Moon by H.G. Wells.

As I have an Amazon Kindle I first went to their Kindle eBook web site. And so the fun began. Here are just some of the eBook versions there:

  • For $2.57 there is what seems to be an illustrated eBook, although it’s hard to tell as the reviews appear to be talking about the hardcopy (paper) version. So I’m still not sure.
  • Next was a $1.27 edition from a different publisher.
  • Following that there’s a $0.99 version from yet another publisher. 
  • There were others, but I stopped looking. From memory there was a free one, but it wasn’t available for Australian customers.

With me so far?  All three above are for that one title in a single eBook. Besides some possible illustrations, I can’t find out what is different between them.    So what follows really added to the confusion I felt there at Amazon:

  • For $1.00 a single eBook that had 30+ Adventure novels, including my target Moon one as well as other Wells pieces, such as War of the Worlds and The Time Machine.

This last one clearly stated it has “an active table of contents”, which is good news.  TOCs are not indispensible, but some free/cheap eBooks are just one big, fat plain text file, with no hint of TOC.   Begs the question if the first 3 do, but a quick check showed it wasn’t obvious.

Having done some research I had earlier discovered ManyBooks.net which offers “the best ebooks at the best price: free!”  I am sure these are all out-of-copyright ones (!).  So anyway, The First Men in the Moon was there.  The download dropdown options had Kindle as a format, so I took that and down it came to the PC. I then used the new – for me at least – ability to then email the eBook file from the PC to my Kindle email account…and when I turned the Kindle on, the eBook magically appeared.

It has no table of contents.  But hey, the Kindle lets me add my own notes and I can Go To them if I want.

So there you go. I’m guessing the non-free ones do have a TOC. Maybe they have been professionally edited too. But it’s not clear as to what added-value they bring as most of them don’t spell it out.  And spelling out such things is called marketing.