My first few weeks at ACMI (Australian Centre for the Moving Image) have been a whirlwind of discovery, exploration, planning and excitement around its forthcoming renewal. One of my projects is to continue to flesh out explorations of machine learning and related AI techniques to ACMI’s collection of Australian moving image works.

This is not that project.

Or if it is, then it’s more of a fun steam-letting-off part of that project. Drawing from Dan Hon’s experiment in generating British place names, I trained a Recurrent Neural Network (RNN) on titles from ACMI’s collection, and asked it to generate some more.

Method

I used Jeff Thompson’s OS X instructions to get torch-rnn running, trained, and predicting new titles on my Mac. A good way to tell you’re working with bleeding-edge tech is that it is painful to install. I didn’t have to do much textfile editing to get it working, and it didn’t break many other things, but next time round I would definitely try the Docker version first. The moral is: keep your AIs safely contained!

The next step was to get a list of titles to train the AI with. I grabbed the Objects file from ACMI’s Collection Data Starter Kit, and wrote a quick Python script to write the titles to a text file. Here’s the complete list, if you’re interested.

Training the titles took an hour or so on my laptop — I didn’t stick around to find out exactly how long. Interestingly, the training process generates ‘checkpoint’ files, which you can use to generate titles based on a less-experienced network before the training has completed. I’ll give some example titles from these checkpoints below.

Once the training had finished, it was a simple matter to generate titles. It’s also possible to provide different ‘Temperature’ values, which change the amount of ‘novelty’ in the system. In the examples below, I explore the effects of more training and less novelty. The results I’ve highlighted are (because I chose them) an improvement on the average quality of results. In the most entertaining scenarios I’ve linked to longer lists of results.

The effect of more training

These generation commands vary by checkpoint. A higher number means the network is more trained, and the highest number is 17150. th sample.lua -checkpoint cv/acmititles_**1000**.t7

This is pretty rudimentary —nearly ‘lorem ipsum’-ish, but clearly better than random characters, and I give it extra points for inventing “Australiation”. It has also adopted ACMI’s protocols for marking Captions and DVDs in titles. I like that it seems to generate nonsense that evokes a particular language — English, German, French, Latin. (N.B. ACMI uses ‘=’ to indicate translated titles. The transformation of ‘No.’ to ‘№’ is Medium’s.)

  • Dishorl (Coption. Volucu (Captioned)
  • Brainou dinawawd Spiture conzer
  • Fing, calle the well
  • France Fogral 2000 = Cray
  • The River sufits
  • Taby’s wildisifs Issoder wald. №1. [Inpact. — 0t = Clactione (A)
  • Mamson of the Cames Afrerzand of three pirm ant mathation of the A Blad york & ukou [DVD]
  • Eye
  • Australiation en
  • Cellle que dreacuat: the Truciona Schore [CVCA)
  • Chaples of the sundy
  • Stucter the Zittrest
  • Ut lenzunes Zreal stire touran
  • …etc.th sample.lua -checkpoint cv/acmititles_**5000**.t7

Now it’s beginning to look like some thought has gone into it. It may help to imagine some of these being said in an Auld Scots dialect:

  • Black Australia
  • Play shork
  • The tron’s celeve: Christ to Tomorrow
  • Frin’ of Guitz dous
  • War caries
  • dream view
  • The Buildren a languard King dill the swain. Come Reader from Bering
  • South of Bends
  • Kuman wenting: The Poopes with Don at Viglembority
  • The Cundrals City
  • Fackumo
  • Cat Critics: thrawd unges, the changlosia comafrico: sents in actories
  • The Alphont
  • Oration
  • Don’t time (D)
  • Lonster = the West of Tave
  • Chysids: a ginution in the fould to lugar
  • …etc.th sample.lua -checkpoint cv/acmititles_**10000**.t7

This model seems to have more sustained moments of being on the verge of making sense. I think the _temperature_ value may still be too high to be interesting — too much randomness can be as uninspiring as none.

  • The Craining
  • Gromastard of Apprets (Captione)
  • Beach Germany
  • Dogged with Fine. [№381
  • Mirror of Fire to gavey by Histry Vasie Miroh valuley [B&W]
  • The Tenny is is early white of marming [DVD]
  • Sky people
  • Wonderland Tada: fight of Glater sabute vitoriom
  • Muse and people and experiel: project of the babi
  • The Quoubed film!
  • Les Fluik Steagh: Prenstactic Rives for shong upon lifes
  • The New de city. Part 1. Spedite Nortan vook
  • Mumbler sama
  • …etc. More results here.th sample.lua -checkpoint cv/acmititles_**17150**.t7

This is the final model, after the training has finished. It’s pretty inventive — if only we knew what all those new words meant. I call this “young adult scifi novel mode”.

  • Making ackerous
  • The Zookboy
  • Sound of a parens
  • The Vision Island
  • The Nemplatage by speeng: a sea Ennital’s sense [Dabb)
  • Sourtelf asimation
  • Space ourth
  • Bangosia story in Australian eterness
  • Modern princes, the labon and lace
  • Zive and the didgent
  • Reductional priving’s Anumie
  • The Metrectival descreecces voice = monkey: Episodes
  • …etc. The full list is here.

The effect of novelty

Remember, this neural network has no knowledge of English words, grammar or punctuation. It has only ever been shown these titles, not any other text, and we haven’t told it that these are necessarily titles, or even words.

Given how little we have told this AI about what it’s supposed to do (just ‘make more of this sort of thing’), it’s doing an admirable job. It’s also being quite brave about taking things in new directions. Perhaps a little too brave.

Luckily there’s a temperature control, that allows us to cool down the novelty (and maybe to dial back any sentience that occurs). The default temperature value is 1.0, i.e.most novel, which is what we’ve used until now. Dialling the temperature back will result in more words and word patterns that are already found in the training list. Check it out: th sample.lua -checkpoint cv/acmititles_17150.t7 **-temperature 0.2**

Very low novelty — the patterns used are the most common ones in the collection. We have lots of ‘stories of’ works, it seems.

  • The Children of the school (TEFC)
  • The Community and the story
  • The Little the world of the story
  • The Beginning the school (TEFC)
  • The Sea and the search of the sea
  • The Story of the story
  • The Story of the search of the search of the search of the search of the world
  • The Story of the story
  • …etc.th sample.lua -checkpoint cv/acmititles_17150.t7 **-temperature 0.5**

Adding more novelty. In these results, most words are recognisably English and it is easier to select titles that are grammatically correct. I call this “bad concept album titles mode”. We’re close to being intriguingly novel, but the word patterns are a bit too repetitive (“The x in/of/and/to the y”) to be intriguing for long.

  • The Life of the office
  • The Plant community
  • Breaking the earth = The Great growth [DVD]
  • The Body in the clother sea = The Story of the electronical resease
  • Parning the resigners of the Seven (Captioned)
  • The Conside of the Beat
  • The Australian diary. №097 = Deutschland Spiegel
  • The Design to the commander
  • The Art in the lung
  • The Demonstre
  • The Children of the monder and the discoveries
  • Music panstraction
  • The Master of the Flemination
  • The Machination and peace for the spirit of the communication
  • The Strance panorama. [№70/12]
  • … etc. Here are 359 more.th sample.lua -checkpoint cv/acmititles_17150.t7 **-temperature 0.6**

‘Undergraduate artist mode’. At this point, changing _temperature_ is a tradeoff between wanting perhaps more novel word patterns, but fewer novel words. If we were doing this seriously, we’d perhaps choose a model that knows more about (Australian) English.

  • A Compilation of the poses
  • The Mazures of Survival (K)
  • Cities of the Grand Pictures of the father = The Allial change
  • The Cup Anternation
  • The Rose in the world
  • New parts of the cat
  • Challenge de Billy
  • How the sheep bun
  • Getters of the story
  • Fattle to three smars
  • Bean story
  • Saga: the killer at you and the learning
  • The Water in industrial program
  • The History of Heart
  • The Words of the babber South
  • Faminations 1990 & 2
  • A New life in the red show
  • Angel of the wind
  • Making out
  • The Interview of the self and compile
  • Blood moves
  • Mirror of Australia
  • … etc. Here are 374 more.

That’s about all. I’ll be exploring these concepts further in my new album, The Conside of the Beat, available in absolutely no good record stores.