Sunday, May 8, 2016

Current status of CasualConc

I haven't written here for quite a while, but I've been fixing bugs on CasualConc 2.0. Nothing fancy has been added since I posted an entry here on this blog. A few of the new or enhanced features are:

Visualization tools:
label coloring is easier; you can grab label colors from Label Coloring panel or even directly from the Label Coloring Database.

File Info:
tag filtering is available in File modes (simple/advanced); you can set multiple tags to count frequencies of words in different tagged parts; for the first time, you can set the tagged area using XPath (experimental)

This is an example of TF-IDF using XPath option. Speakers are tagged on the Friends scripts and simply frequency counts by each character's utterances are used to create this. If you know Friends, you might be able to see something meaningful.



I've been playing with XML to implement some basic XML features, but I'm not sure how to create a good interface. Any idea?


Finally, this is a kind of important note. If you have used R on CasualConc and recently upgraded R to 3.3, you need to turn off/on the Use R for Visualization on Preferences. It looks like you need to re-install non-default packages after you upgraded to 3.3. I got one report that even after re-installing packages, some packages were not installed properly (not the packages CasualConc tries to install, but the packages those 'packages' depend). In that case, you need to manually reinstall those packages on R or RStudio. I'll see if I can add a function to check them or at least display error message from R (I haven't figured this out yet...).

Sunday, October 18, 2015

New feature showcase

This is just a series of images that showcase the new features of CasualConc 2.0.

Line Chart

Line Chart
Bar Chart

Pie Chart

Radar Chart

Scatter Plot

Word Cloud (frequency)

Word Cloud (TF-IDF)

Hierarchical Cluster Analysis (dendrogram)


Correspondence Analysis

Principal Component Analysis (factor scores)

Principal Component Analysis (factor loadings)

Multi-dimensional Scaling (MDS)

MDS (3D)

Network Analysis

Network Analysis

Concordance Plot

Random Forest (keyword extraction)

Regular Expression Practice

Vocabulary Profiler

Tokenization (Japanese)

Tokenization (Korean)

Tokenization (Chinese)

Tokenization (Thai)




Sunday, October 4, 2015

CasualConc 2.0 beta

I'm pleased to announce the immediate release of CasualConc 2.0 beta, a new version of CasualConc.

For the last couple of years, I've mostly worked on CasualTranscriber, which is now version 2.5. Now it has most of the functions I wanted to use. So I decided to update CasualConc.

CasualConc was first released in 2008 and has been constantly updated up until a couple of years ago. The release of OS X 10.9 Mavericks changed something in the developing environment and made me unable to update the project. This was partly due to the scripting language/bridge I used to write CasualConc.

This new version of CasualConc is almost entirely rewritten. It is written in RubyMotion, which is kind of a successor to RubyCocoa.

Due to the difference in regular expression engine, the speed of text processing is a bit slower with RubyMotion, it is compensated by parallel processing. Also due to the same reason, your search result might be slightly different from the one with the older version. This is because the two engines treat some symbols/marks differently.

This new version may not have all the functions you use with 1.9.x, so if you need a particular feature, please let me know. I'll see if I can bring it back to the new version.

CasualConc 2.0 has the following new and improved features:

  • text mode
  • opening a result window for Collocation and Word Count
  • much faster File Info tools
  • collocation frequency table
  • utilization of build-in tagger and integration of TreeTagger (including an installer)
  • integration of a statistical environment R (needs to be installed separately)
  • keyword extraction with Mann-Whitney-U and Random Forest (requires R)
  • visualization of frequency information (requires R)
  • and more...

A manual, which covers most of the features, is available and help is included in the application.

CasualConc 2.0 beta is compatible with Mac with OS X 10.9 Mavericks or later, including 10.11 El Capitan.

It is very likely that this version still has many problems. Please try it and if you find any problems, your feedback is greatly appreciated.

This slideshow (in Japanese) presents new features. Text is minimal (mostly screenshots).

Sunday, October 26, 2014

Yosemite compatibility report

I released Yosemite-compatible versions of the following application.

CasualConc
CasualPConc
CasualMultiPConc
CasualMecab
CasualTextractor
CasualTreeTagger
IPATypist

CasualTranscriber 2.5 is already compatible with Yosemite.

If you encounter any problem, please let me know.

Tuesday, October 21, 2014

Yosemite issue

As you might have tried, CasualConc and other applications do not run on OS X 10.10 Yosemite. This is because my applications are written in RubyCocoa, which depends on Ruby 1.8.x. But Apple decided to drop Ruby 1.8.7 on Yosemite (Ruby 2.0.0 is there).

I'm working with the maintainer of RubyCocoa to work around this issue (most likely apps will include a port of Ruby 1.8 within the apps). So far, I have successfully run it on Yosemite, but haven't had time to test it. If you are interested in helping testing the functionality on Yosemite, please email me to the address on the site.