Saturday, April 10, 2010

The current status of CasualConc beta - Cluster/Collocation/Cooccurrence

Only one minor change is made to Cluster.

When you select Left Only in Span, the result will be aligned to the left.


If you select Tag(s) mode in search word, the result of searching 'jj nn1' (adjective + singular noun) will look like this:


If Suppress tags in context is on, the result will look like this.


And Tag Only search will return results like this:



Now the Collocation.  In the current working version, the frequency of the total includes the frequency of the keyword position.  But in the new version, the total is only the total of word in the context.  So the keyword no longer comes on top of the list.


Another change is variable span.  Now you can set the span of the context words and do it separately for the left and right up to 5 words.


The result will look like this:


Collocation Stats calculations should reflect this to some extent.


Now with Tag(s) selected, the result will look like this:


 If Treat Keywords As One Word is checked, the result will look like this:


If Suppress tags in context is on, the results with or without Treat Keywords As One Word checked will look like this:

NOT Checked


Checked


Finally, Tag Only search will return the result like this:



The next is Cooccurrence.

The new feature of Cooccurrence is sorting.  You can now sort words in each position based on collocation statistics.  To use this feature, you need to run Word Count first.


In the normal frequency order, the result looks like this:


With MI (Mutual Information) sort, the result will look like this:



With Tag(s) mode, the result will look like this:


Now, you can export Cooccurrence result with frequency information.  Check Include frequency info in the Cooccurrence export in Preference -> Others.


The exported csv file with freq info will look like this on Excel.




Now with tag handling on Cooccurrence.  With Tag Only mode, the result will look like this:



These are the new features of Cluster/Collocation/Cooccurrence.

No comments: