The A Project->Getting Started->Mississippi

Mississippi

Home
  First Encounter
  Current Status
  GNU License
Downloads
Getting Started
  Examples
  Mississippi
Overview of A
  Structure of Data
  Syntax
  Relation to other APLs
  Language Reference

System Fns and Vars

Functions

Where next
Quibbles
Materials

Vector Home

The “Mississippi” challenge from Vector 18.2

This example is taken directly from Norman Thomson's J-ottings column in Vector 18.2. It is an interesting comparison of the speed of the 'obvious' outer-product solution and a very sneaky algorithm using upward ranking.

ă The Mississippi challenge (from Vector 18.2 - Jottings)
ă The object is to take a vector and return the occurrence number for each element
 s ű 'mississippi' 

ă The boring way with an outer product
 nub v: { ((vÉv)=ÉŇv)/v }
 unq ű nub s
 tbl ű unq Ę.= s 
ă Cum across and save where we had numbers  
 tbl ű tbl Ť +\@1 tbl   
 oc ű ˘1++/tbl

'The answer is ',îoc

ă Make it a function so we can time it ...
 oca s : { tbl ű (nub s)Ę.=s ; tbl ű tbl Ť +\@1 tbl; (+/tbl)-1}

ă Now for the super-sneaky approach (J-forum written up in Vector)
 t ű (sÉunq)[unqÉs]
 r ű ččt
 occ ű r-r[t]

'The answer is also ',îocc

ă Get a little nearer the J style using index ...
 t ű (unqÉs)#sÉunq
 r ű ččt
 occ ű r-t#r

'The answer is still ',îocc 

ă Finally as a function
 ocb s: { uűnub s; tű(uÉs)#sÉu; rűččt; r-t#r }

' oca s   ă tests the boring one'
' ocb s   ă tests the sneaky version'
' '
'Try some timings with various sized arrays ...' 
' time qq := ocb 500000 rho s   ă Arthur wrote a mean gradeup here!'

The timings are interesting, when we run this in A+ and in the two major Windows APLs:

The outer-product solution is actually the faster of the two algorithms in both Dyalog and +Win, but it runs out of memory fairly quickly in my standard setup of 20M. In A+ it shows very comparable timings, but of course you now have 'infinite' memory as A+ will just grab swapfile on your hard-drive as it needs to allocate space for ever bigger arrays.

The significant line is the bottom one which supports the claim in Jonathan Barman's first impressions of A - that sorting is impressively fast. This set of timings is definitely curving downwards, so the elapsed time is less than linear with the size of the array, which is very impressive.