I have a series of discrete, purportedly random whole numbers like this:
v1 v2 v3 v4 v5 v6 42 23 10 07 01 35 05 02 26 25 49 18 35 18 43 29 26 28 36 59 26 15 34 35
I want to identify if there are any values that co-occcur (in a row) more frequently than one would expect by chance. The co-occurance within a row is of interest regardless of which columns the #'s happen to be in.
Any ideas of a technique? Just keywords I could search for for more info would be highly appreciated.
The answers so far are greatly appreciated and interesting; let me clarify my interest. Pretend there are more lines than given below, and note the #'s 10 and 07, which seem to co-occur within a row a lot (it will not be this obvious, and I have thousands of rows). What I am interested in is identifying two or more numbers that appear in the same row together more frequently that one one expect at random. It should be noted that the numbers have a finite range (e.g. 1:60) and are supposed to be randomly drawn without replacement within each row.
v1 v2 v3 v4 v5 v6 42 23 10 07 01 35 10 02 26 25 07 18 07 10 43 29 26 28 36 59 07 15 34 10