tag:blogger.com,1999:blog-6266250906676688622024-03-14T04:14:27.944+09:00CasualConc - a concordancer for macOSThis blog is mostly about a concordance program I have been developing for macOS and some related stuff. CasualConc is designed for exploratory-type text analysis and visualization of frequency data. You can download <b>CasualConc</b> from the main site (<a href="http://sites.google.com/site/casualconc/">English</a> or <a href="http://sites.google.com/site/casualconcj/">Japanese</a>).Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.comBlogger103125tag:blogger.com,1999:blog-626625090667668862.post-69611034321390374412022-04-01T19:28:00.001+09:002022-05-02T12:45:53.452+09:00CasualConc goes 3.0!<p>After 3 and half years of no activity on this blog, I am happy to announce a new release of CasualConc (though it is still a work-in-progress).</p><p>This version has some new features and bug fixes as well as interface changes.</p><p>Two major additions are basic XML file handling and grammar search using dependency grammar tags with Stanford CoreNLP.</p><p>The XML handling feature is fairly basic, but CasualConc analyzes XML structure and you can select the tags or attributes to use for grouping/filtering.</p><div class="separator" style="clear: both; text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiLt-cUO0Ta5d1ohsX5Z57kgUNTKTVZaTKb0dQyBLhxCfodHq2YNd2-A-ZjKx-_73js6gqEJcFYcEkWtME-eKZ0m85H4TjFHLHb-GeZzu5CMgFe8rVRgUrWeBB6EbybiF4sNYirGLS0DpW9FjLr1v8F3dNes9r4vE5sCG2ZRy43sQ3OMWFnb-0o3jj6/s1100/xml%20handling.png" style="margin-left: 1em; margin-right: 1em;"><img border="0" data-original-height="652" data-original-width="1100" height="238" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiLt-cUO0Ta5d1ohsX5Z57kgUNTKTVZaTKb0dQyBLhxCfodHq2YNd2-A-ZjKx-_73js6gqEJcFYcEkWtME-eKZ0m85H4TjFHLHb-GeZzu5CMgFe8rVRgUrWeBB6EbybiF4sNYirGLS0DpW9FjLr1v8F3dNes9r4vE5sCG2ZRy43sQ3OMWFnb-0o3jj6/w400-h238/xml%20handling.png" width="400" /></a></div><div><br /></div>You can use filters in tools.<div><br /></div><div class="separator" style="clear: both; text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi_ou4S7QIjRcpRQHDFeAgrShCSzWMrZWBZ00YduPgnrx_UmOGQbwGUeGrLAzIGcJilnDH-ElH2A_Y9TZIcIHrmjqrcTUHID8D6jM3WaDMXULlJ5kddxvSw_WCmmQ7A3lZaWAGXwFnb70IJJhN5W1g3pVCAhwkm3ZlOYnE7Jdx9MWys6VJqaqX9c2-r/s813/xml%20fi%20result.png" style="margin-left: 1em; margin-right: 1em;"><img border="0" data-original-height="248" data-original-width="813" height="123" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi_ou4S7QIjRcpRQHDFeAgrShCSzWMrZWBZ00YduPgnrx_UmOGQbwGUeGrLAzIGcJilnDH-ElH2A_Y9TZIcIHrmjqrcTUHID8D6jM3WaDMXULlJ5kddxvSw_WCmmQ7A3lZaWAGXwFnb70IJJhN5W1g3pVCAhwkm3ZlOYnE7Jdx9MWys6VJqaqX9c2-r/w400-h123/xml%20fi%20result.png" width="400" /></a></div><div><br /></div>Grammar search utilizes dependency grammar tags assigned by Stanford CoreNLP. You need to install Java and Stanford CoreNLP separately (the instruction is on a new manual).<div><br /></div><div>You can use this new feature on any of the languages Stanford CoreNLP can handle by specifying dependency grammar tags.</div><div><br /></div><div class="separator" style="clear: both; text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjTJt32JBYxQ-xLz3w2bHHwA1fkg67afTOWNghEqvSNEJxm5MG7m_XZtaMoDxboSJb5JsnqciMcZBbrLcKoJ_7mHEPad8sAbDhaPIviTGwdKiZkNV8qNnx9zZUJgM_r6neBd4raaE6CSEZddekcGQhjRRh6SypY6dJHbdkrpixFlVmrLtL6DhlQj7OZ/s1100/grammar%20search%20detail.png" style="margin-left: 1em; margin-right: 1em;"><img border="0" data-original-height="652" data-original-width="1100" height="238" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjTJt32JBYxQ-xLz3w2bHHwA1fkg67afTOWNghEqvSNEJxm5MG7m_XZtaMoDxboSJb5JsnqciMcZBbrLcKoJ_7mHEPad8sAbDhaPIviTGwdKiZkNV8qNnx9zZUJgM_r6neBd4raaE6CSEZddekcGQhjRRh6SypY6dJHbdkrpixFlVmrLtL6DhlQj7OZ/w400-h238/grammar%20search%20detail.png" width="400" /></a></div><br /><div>For English, you can select a pre-specified grammar structure for search.</div><div><br /></div><div class="separator" style="clear: both; text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhvj6tN5mxQ8sOFV_1gHxDwckaCz_zvU8fwE6awRiRuaJzw-278xAbkMyqWy3ou1Vs-TjAQ6ay7Eem0txoQ0GdQvbv58TSwYg-o69Ngsk1cBd94enzeMmZw63-ZKv4PCdf1SxoTWrU3j7uS_3LU7FCdPh9-o1AS6WgJ6jCiE0SJF6avk32COwR4ZkjY/s1100/grammar%20search%20preset.png" style="margin-left: 1em; margin-right: 1em;"><img border="0" data-original-height="652" data-original-width="1100" height="237" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhvj6tN5mxQ8sOFV_1gHxDwckaCz_zvU8fwE6awRiRuaJzw-278xAbkMyqWy3ou1Vs-TjAQ6ay7Eem0txoQ0GdQvbv58TSwYg-o69Ngsk1cBd94enzeMmZw63-ZKv4PCdf1SxoTWrU3j7uS_3LU7FCdPh9-o1AS6WgJ6jCiE0SJF6avk32COwR4ZkjY/w400-h237/grammar%20search%20preset.png" width="400" /></a></div><div><div><p>I prepared a sample grammar database file (with Sherlock Holmes novels/shorts) on CasualConc site, so you can feel what this tool is like.</p><p>There are other more minor additions, such as filtering files for analysis and lexical diversity measures on Word Count/File Info.</p><p>This is still a beta version, so there may be many issues. Please report any problem you encounter. That will help improve CasualConc.</p><p>In any case, please enjoy playing with grammar search. It's fun!</p></div></div>Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-41357332875579636022018-09-23T05:49:00.001+09:002018-09-23T05:50:33.720+09:00CasualConc 2.1 is released with an updated manual!I just released an updated version of CasualConc along with an updated manual.<br />
<br />
It has many new features and a lot of bug fixes.<br />
<br />
Major changes are an addition of Frequency Plot tool. Now you can draw box plots, violin plots, bean plots, histograms and bar plots with mean frequencies of items (different from bar plots of Chart tool). Also you can plot relative position of items.<br />
<br />
A very simple batch tagging function is also added. If you have TreeTagger, Stanford CoreNLP, or MeCab on your mac, you can create tagged texts on CasualConc.<br />
<br />
There might be some bugs remaining, so please try it and report any bug, problem, or feature request.<br />
<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://2.bp.blogspot.com/-BbEzmOe7Jlc/W6apYwhySbI/AAAAAAAAAg0/KCp7TTlVFfcy2-5F65bcOIi91H0mvh2FgCK4BGAYYCw/s1600/%25E3%2582%25B9%25E3%2582%25AF%25E3%2583%25AA%25E3%2583%25BC%25E3%2583%25B3%25E3%2582%25B7%25E3%2583%25A7%25E3%2583%2583%25E3%2583%2588%2B2018-09-21%2B7.57.51.png" imageanchor="1" style="clear: left; margin-bottom: 1em; margin-right: 1em;"><img alt="" border="0" height="314" src="https://2.bp.blogspot.com/-BbEzmOe7Jlc/W6apYwhySbI/AAAAAAAAAg0/KCp7TTlVFfcy2-5F65bcOIi91H0mvh2FgCK4BGAYYCw/s320/%25E3%2582%25B9%25E3%2582%25AF%25E3%2583%25AA%25E3%2583%25BC%25E3%2583%25B3%25E3%2582%25B7%25E3%2583%25A7%25E3%2583%2583%25E3%2583%2588%2B2018-09-21%2B7.57.51.png" title="" width="320" /></a><a href="http://2.bp.blogspot.com/-tNxd2gCQ_24/W6aqMD9dZ8I/AAAAAAAAAhM/UZrA8NFMWq8_Nm-ceuZY7Kg6XBvVY4PuACK4BGAYYCw/s1600/%25E3%2582%25B9%25E3%2582%25AF%25E3%2583%25AA%25E3%2583%25BC%25E3%2583%25B3%25E3%2582%25B7%25E3%2583%25A7%25E3%2583%2583%25E3%2583%2588%2B2018-09-22%2B11.56.14.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><br /><img border="0" height="312" src="https://2.bp.blogspot.com/-tNxd2gCQ_24/W6aqMD9dZ8I/AAAAAAAAAhM/UZrA8NFMWq8_Nm-ceuZY7Kg6XBvVY4PuACK4BGAYYCw/s320/%25E3%2582%25B9%25E3%2582%25AF%25E3%2583%25AA%25E3%2583%25BC%25E3%2583%25B3%25E3%2582%25B7%25E3%2583%25A7%25E3%2583%2583%25E3%2583%2588%2B2018-09-22%2B11.56.14.png" width="320" /></a></div>
<div class="separator" style="clear: both; text-align: center;">
<a href="http://4.bp.blogspot.com/-1WQC4K_JZVY/W6apjFUb-0I/AAAAAAAAAg8/6C6Y4XIyShIfJgjfk1XsF0y5nH5teP2SwCK4BGAYYCw/s1600/%25E3%2582%25B9%25E3%2582%25AF%25E3%2583%25AA%25E3%2583%25BC%25E3%2583%25B3%25E3%2582%25B7%25E3%2583%25A7%25E3%2583%2583%25E3%2583%2588%2B2018-09-20%2B4.16.52.png" imageanchor="1" style="clear: right; margin-bottom: 1em; margin-left: 1em;"><img border="0" height="305" src="https://4.bp.blogspot.com/-1WQC4K_JZVY/W6apjFUb-0I/AAAAAAAAAg8/6C6Y4XIyShIfJgjfk1XsF0y5nH5teP2SwCK4BGAYYCw/s320/%25E3%2582%25B9%25E3%2582%25AF%25E3%2583%25AA%25E3%2583%25BC%25E3%2583%25B3%25E3%2582%25B7%25E3%2583%25A7%25E3%2583%2583%25E3%2583%2588%2B2018-09-20%2B4.16.52.png" width="320" /></a></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: center;">
<a href="http://4.bp.blogspot.com/-tKcj5Ws0frw/W6aqfQMCXCI/AAAAAAAAAhY/mDHdL_QGgdQkMh_OTkeEZwuqaCcn48B0gCK4BGAYYCw/s1600/%25E3%2582%25B9%25E3%2582%25AF%25E3%2583%25AA%25E3%2583%25BC%25E3%2583%25B3%25E3%2582%25B7%25E3%2583%25A7%25E3%2583%2583%25E3%2583%2588%2B2018-09-22%2B10.02.32.png" imageanchor="1"><img border="0" height="320" src="https://4.bp.blogspot.com/-tKcj5Ws0frw/W6aqfQMCXCI/AAAAAAAAAhY/mDHdL_QGgdQkMh_OTkeEZwuqaCcn48B0gCK4BGAYYCw/s320/%25E3%2582%25B9%25E3%2582%25AF%25E3%2583%25AA%25E3%2583%25BC%25E3%2583%25B3%25E3%2582%25B7%25E3%2583%25A7%25E3%2583%2583%25E3%2583%2588%2B2018-09-22%2B10.02.32.png" width="318" /></a></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com1tag:blogger.com,1999:blog-626625090667668862.post-33836970440096511552016-05-08T11:09:00.000+09:002016-05-08T11:09:18.584+09:00Current status of CasualConcI haven't written here for quite a while, but I've been fixing bugs on CasualConc 2.0. Nothing fancy has been added since I posted an entry here on this blog. A few of the new or enhanced features are:<br />
<br />
<b>Visualization tools</b>:<br />
label coloring is easier; you can grab label colors from Label Coloring panel or even directly from the Label Coloring Database.<br />
<br />
<b>File Info</b>:<br />
tag filtering is available in File modes (simple/advanced); you can set multiple tags to count frequencies of words in different tagged parts; for the first time, you can set the tagged area using XPath (experimental)<br />
<br />
This is an example of TF-IDF using XPath option. Speakers are tagged on the Friends scripts and simply frequency counts by each character's utterances are used to create this. If you know Friends, you might be able to see something meaningful.<br />
<br />
<a href="http://1.bp.blogspot.com/-6YmhEYX_ZSk/Vy6daLlkHqI/AAAAAAAAAdM/PhHQnbw36EA_7kkZZMaVyPBZXiNfvpD6ACK4B/s1600/%25E3%2582%25B9%25E3%2582%25AF%25E3%2583%25AA%25E3%2583%25BC%25E3%2583%25B3%25E3%2582%25B7%25E3%2583%25A7%25E3%2583%2583%25E3%2583%2588%2B2016-04-25%2B8.06.16.png" imageanchor="1"><img border="0" height="48" src="https://1.bp.blogspot.com/-6YmhEYX_ZSk/Vy6daLlkHqI/AAAAAAAAAdM/PhHQnbw36EA_7kkZZMaVyPBZXiNfvpD6ACK4B/s320/%25E3%2582%25B9%25E3%2582%25AF%25E3%2583%25AA%25E3%2583%25BC%25E3%2583%25B3%25E3%2582%25B7%25E3%2583%25A7%25E3%2583%2583%25E3%2583%2588%2B2016-04-25%2B8.06.16.png" width="320" /></a><br />
<br />
I've been playing with XML to implement some basic XML features, but I'm not sure how to create a good interface. Any idea?<br />
<br />
<br />
Finally, this is a kind of important note. If you have used R on CasualConc and recently upgraded R to 3.3, you need to turn off/on the Use R for Visualization on Preferences. It looks like you need to re-install non-default packages after you upgraded to 3.3. I got one report that even after re-installing packages, some packages were not installed properly (not the packages CasualConc tries to install, but the packages those 'packages' depend). In that case, you need to manually reinstall those packages on R or RStudio. I'll see if I can add a function to check them or at least display error message from R (I haven't figured this out yet...).<br />
<br />Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-39844317963693552202015-10-18T21:06:00.004+09:002015-10-18T21:06:58.058+09:00New feature showcaseThis is just a series of images that showcase the new features of CasualConc 2.0.<br />
<br />
Line Chart<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://4.bp.blogspot.com/-flEeuaC9qdE/ViOJVKrKo-I/AAAAAAAAAYE/YSszxLcxFmA/s1600/2%2Bdemo_lineplot_hp2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="305" src="http://4.bp.blogspot.com/-flEeuaC9qdE/ViOJVKrKo-I/AAAAAAAAAYE/YSszxLcxFmA/s320/2%2Bdemo_lineplot_hp2.png" width="320" /></a></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
Line Chart</div>
<div class="separator" style="clear: both; text-align: center;">
<a href="http://4.bp.blogspot.com/-2AFsaDa6Ik0/ViOJQu50jnI/AAAAAAAAAXU/azN00O_eC9w/s1600/1%2Bdemo_lineplot_hp.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="308" src="http://4.bp.blogspot.com/-2AFsaDa6Ik0/ViOJQu50jnI/AAAAAAAAAXU/azN00O_eC9w/s320/1%2Bdemo_lineplot_hp.png" width="320" /></a></div>
Bar Chart<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://4.bp.blogspot.com/-upF5fp4XcnA/ViOJWvlaVLI/AAAAAAAAAYo/ZJnb8MsgKCA/s1600/3%2Bdemo_barplot_hp.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="307" src="http://4.bp.blogspot.com/-upF5fp4XcnA/ViOJWvlaVLI/AAAAAAAAAYo/ZJnb8MsgKCA/s320/3%2Bdemo_barplot_hp.png" width="320" /></a></div>
<br />
Pie Chart<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://4.bp.blogspot.com/-vnGeqHLpIdE/ViOJXjGm86I/AAAAAAAAAZA/yaUMrafKc-Y/s1600/4%2Bdemo_piechart_hp.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="320" src="http://4.bp.blogspot.com/-vnGeqHLpIdE/ViOJXjGm86I/AAAAAAAAAZA/yaUMrafKc-Y/s320/4%2Bdemo_piechart_hp.png" width="318" /></a></div>
<br />
Radar Chart<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://1.bp.blogspot.com/-RRUPk2PYeQM/ViOJXnmSFmI/AAAAAAAAAZI/tGcr8rwGl98/s1600/5%2Bdemo_radarchart_hp.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="278" src="http://1.bp.blogspot.com/-RRUPk2PYeQM/ViOJXnmSFmI/AAAAAAAAAZI/tGcr8rwGl98/s320/5%2Bdemo_radarchart_hp.png" width="320" /></a></div>
<br />
Scatter Plot<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://3.bp.blogspot.com/-8QD7PrQo0tw/ViOJXsihFHI/AAAAAAAAAZE/hLA3Cjx1ILE/s1600/6%2Bdemo_scatterplot.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="295" src="http://3.bp.blogspot.com/-8QD7PrQo0tw/ViOJXsihFHI/AAAAAAAAAZE/hLA3Cjx1ILE/s320/6%2Bdemo_scatterplot.png" width="320" /></a></div>
<br />
Word Cloud (frequency)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://1.bp.blogspot.com/-vuxvBxS1lJs/ViOJYV3oIOI/AAAAAAAAAZU/zAOkqjsW56A/s1600/7%2Bdemo_wordcloud%2Bfreq.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="281" src="http://1.bp.blogspot.com/-vuxvBxS1lJs/ViOJYV3oIOI/AAAAAAAAAZU/zAOkqjsW56A/s320/7%2Bdemo_wordcloud%2Bfreq.png" width="320" /></a></div>
<br />
Word Cloud (TF-IDF)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://4.bp.blogspot.com/-rygeggaBUYQ/ViOJYnaLGdI/AAAAAAAAAZY/5UohFZ22mN4/s1600/8%2Bdemo_wordcloud%2Btfidf.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="285" src="http://4.bp.blogspot.com/-rygeggaBUYQ/ViOJYnaLGdI/AAAAAAAAAZY/5UohFZ22mN4/s320/8%2Bdemo_wordcloud%2Btfidf.png" width="320" /></a></div>
<br />
Hierarchical Cluster Analysis (dendrogram)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://1.bp.blogspot.com/-QAJeRi5EX0I/ViOJ6d-mREI/AAAAAAAAAZ4/2PL6LaJDmAg/s1600/9%2Bdemo_cluster.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="315" src="http://1.bp.blogspot.com/-QAJeRi5EX0I/ViOJ6d-mREI/AAAAAAAAAZ4/2PL6LaJDmAg/s320/9%2Bdemo_cluster.png" width="320" /></a></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<br />
Correspondence Analysis<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://3.bp.blogspot.com/-UjQ07x0l8dg/ViOJQsTPGRI/AAAAAAAAAW8/vmzift9Agrw/s1600/10%2Bdemo_corresp.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="317" src="http://3.bp.blogspot.com/-UjQ07x0l8dg/ViOJQsTPGRI/AAAAAAAAAW8/vmzift9Agrw/s320/10%2Bdemo_corresp.png" width="320" /></a></div>
<br />
Principal Component Analysis (factor scores)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://2.bp.blogspot.com/-mtbOAcQnH2U/ViOJQsO_atI/AAAAAAAAAXA/DVzpy5kqW0E/s1600/11%2Bdemo_pca_lit.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="246" src="http://2.bp.blogspot.com/-mtbOAcQnH2U/ViOJQsO_atI/AAAAAAAAAXA/DVzpy5kqW0E/s320/11%2Bdemo_pca_lit.png" width="320" /></a></div>
<br />
Principal Component Analysis (factor loadings)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://1.bp.blogspot.com/-TRqenoYNDt8/ViOJRtgQr4I/AAAAAAAAAXM/GWvu3Fk21_w/s1600/12%2Bdemo_pca3.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="320" src="http://1.bp.blogspot.com/-TRqenoYNDt8/ViOJRtgQr4I/AAAAAAAAAXM/GWvu3Fk21_w/s320/12%2Bdemo_pca3.png" width="319" /></a></div>
<br />
Multi-dimensional Scaling (MDS)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://3.bp.blogspot.com/-ThUGeHfsIXk/ViOJRovDbjI/AAAAAAAAAXE/9kG55Os4mpY/s1600/13%2Bdemo_mds.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="307" src="http://3.bp.blogspot.com/-ThUGeHfsIXk/ViOJRovDbjI/AAAAAAAAAXE/9kG55Os4mpY/s320/13%2Bdemo_mds.png" width="320" /></a></div>
<br />
MDS (3D)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://3.bp.blogspot.com/-sPGyCvFIp84/ViOJRwOPNyI/AAAAAAAAAXI/f6PQRWGCIqQ/s1600/14%2Bdemo_mds_3d.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="300" src="http://3.bp.blogspot.com/-sPGyCvFIp84/ViOJRwOPNyI/AAAAAAAAAXI/f6PQRWGCIqQ/s320/14%2Bdemo_mds_3d.png" width="320" /></a></div>
<br />
Network Analysis<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://1.bp.blogspot.com/-rEkcTMyNww4/ViOJTLirXJI/AAAAAAAAAXw/BZ6HgT0dsHk/s1600/15%2Bdemo_network_hp.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="304" src="http://1.bp.blogspot.com/-rEkcTMyNww4/ViOJTLirXJI/AAAAAAAAAXw/BZ6HgT0dsHk/s320/15%2Bdemo_network_hp.png" width="320" /></a></div>
<br />
Network Analysis<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://3.bp.blogspot.com/-fQ4Eb_uopP8/ViOJTLXKTdI/AAAAAAAAAXo/36N661yuXaw/s1600/16%2Bdemo_network_hp2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="307" src="http://3.bp.blogspot.com/-fQ4Eb_uopP8/ViOJTLXKTdI/AAAAAAAAAXo/36N661yuXaw/s320/16%2Bdemo_network_hp2.png" width="320" /></a></div>
<br />
Concordance Plot<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://3.bp.blogspot.com/-SI06o32RdEA/ViOJTKeVQmI/AAAAAAAAAXs/EplPG_aV7QI/s1600/17%2Bdemo_concordplot.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="320" src="http://3.bp.blogspot.com/-SI06o32RdEA/ViOJTKeVQmI/AAAAAAAAAXs/EplPG_aV7QI/s320/17%2Bdemo_concordplot.png" width="318" /></a></div>
<br />
Random Forest (keyword extraction)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://2.bp.blogspot.com/-28qa61BrhCU/ViOKsX9waMI/AAAAAAAAAaE/iPOBLXSIQTQ/s1600/18%2Bdemo_randomforest.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="316" src="http://2.bp.blogspot.com/-28qa61BrhCU/ViOKsX9waMI/AAAAAAAAAaE/iPOBLXSIQTQ/s320/18%2Bdemo_randomforest.png" width="320" /></a></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
Regular Expression Practice<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://1.bp.blogspot.com/-hvAoP8S3qXc/ViOJUYVh1ZI/AAAAAAAAAX4/3WC2uHOo_8o/s1600/19%2Bdemo_regexpractice.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="225" src="http://1.bp.blogspot.com/-hvAoP8S3qXc/ViOJUYVh1ZI/AAAAAAAAAX4/3WC2uHOo_8o/s320/19%2Bdemo_regexpractice.png" width="320" /></a></div>
<br />
Vocabulary Profiler<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://4.bp.blogspot.com/-KOHycuqzNao/ViOJVbkf7HI/AAAAAAAAAYM/6Brtl7laCb4/s1600/20%2Bdemo_wordprofiler.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="241" src="http://4.bp.blogspot.com/-KOHycuqzNao/ViOJVbkf7HI/AAAAAAAAAYM/6Brtl7laCb4/s320/20%2Bdemo_wordprofiler.png" width="320" /></a></div>
<br />
Tokenization (Japanese)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://3.bp.blogspot.com/-z-3zbh6FxNQ/ViOJVqtN8GI/AAAAAAAAAYQ/h5R9W1d8q-g/s1600/21%2Bdemo_token_j.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="320" src="http://3.bp.blogspot.com/-z-3zbh6FxNQ/ViOJVqtN8GI/AAAAAAAAAYQ/h5R9W1d8q-g/s320/21%2Bdemo_token_j.png" width="290" /></a></div>
<br />
Tokenization (Korean)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://1.bp.blogspot.com/-Q-4vjjtI4BE/ViOJV9yYxAI/AAAAAAAAAYU/-SmC5azLurE/s1600/22%2Bdemo_token_k.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="320" src="http://1.bp.blogspot.com/-Q-4vjjtI4BE/ViOJV9yYxAI/AAAAAAAAAYU/-SmC5azLurE/s320/22%2Bdemo_token_k.png" width="275" /></a></div>
<br />
Tokenization (Chinese)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://2.bp.blogspot.com/-C4gdwruckhg/ViOJWS-AzcI/AAAAAAAAAYc/IWvxg7pt3VU/s1600/23%2Bdemo_token_c2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="320" src="http://2.bp.blogspot.com/-C4gdwruckhg/ViOJWS-AzcI/AAAAAAAAAYc/IWvxg7pt3VU/s320/23%2Bdemo_token_c2.png" width="277" /></a></div>
<br />
Tokenization (Thai)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://4.bp.blogspot.com/-Fv9t1LqhB2o/ViOJWqTCyxI/AAAAAAAAAYk/sZzj3fZtSVI/s1600/24%2Bdemo_token_thai.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="320" src="http://4.bp.blogspot.com/-Fv9t1LqhB2o/ViOJWqTCyxI/AAAAAAAAAYk/sZzj3fZtSVI/s320/24%2Bdemo_token_thai.png" width="261" /></a></div>
<br />
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<br />
<br />Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-73620833464729327882015-10-04T22:23:00.000+09:002015-10-05T08:10:59.003+09:00CasualConc 2.0 betaI'm pleased to announce the immediate release of CasualConc 2.0 beta, a new version of CasualConc.<br />
<br />
For the last couple of years, I've mostly worked on CasualTranscriber, which is now version 2.5. Now it has most of the functions I wanted to use. So I decided to update CasualConc.<br />
<br />
CasualConc was first released in 2008 and has been constantly updated up until a couple of years ago. The release of OS X 10.9 Mavericks changed something in the developing environment and made me unable to update the project. This was partly due to the scripting language/bridge I used to write CasualConc.<br />
<br />
This new version of CasualConc is almost entirely rewritten. It is written in RubyMotion, which is kind of a successor to RubyCocoa.<br />
<br />
Due to the difference in regular expression engine, the speed of text processing is a bit slower with RubyMotion, it is compensated by parallel processing. Also due to the same reason, your search result might be slightly different from the one with the older version. This is because the two engines treat some symbols/marks differently.<br />
<br />
This new version may not have all the functions you use with 1.9.x, so if you need a particular feature, please let me know. I'll see if I can bring it back to the new version.<br />
<br />
CasualConc 2.0 has the following new and improved features:<br />
<br />
<ul>
<li>text mode</li>
<li>opening a result window for Collocation and Word Count</li>
<li>much faster File Info tools</li>
<li>collocation frequency table</li>
<li>utilization of build-in tagger and integration of TreeTagger (including an installer)</li>
<li>integration of a statistical environment R (needs to be installed separately)</li>
<li>keyword extraction with Mann-Whitney-U and Random Forest (requires R)</li>
<li>visualization of frequency information (requires R)</li>
<li>and more...</li>
</ul>
<br />
A manual, which covers most of the features, is available and help is included in the application.<br />
<br />
CasualConc 2.0 beta is compatible with Mac with OS X 10.9 Mavericks or later, including 10.11 El Capitan.<br />
<br />
It is very likely that this version still has many problems. Please try it and if you find any problems, your feedback is greatly appreciated.<br />
<br />
<b><a href="http://www.slideshare.net/yasuimao/casualconc" target="_blank">This slideshow</a></b> (in Japanese) presents new features. Text is minimal (mostly screenshots).Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-76801939469799859252014-10-26T18:51:00.001+09:002014-10-26T18:51:24.082+09:00Yosemite compatibility reportI released Yosemite-compatible versions of the following application.<br />
<br />
CasualConc<br />
CasualPConc<br />
CasualMultiPConc<br />
CasualMecab<br />
CasualTextractor<br />
CasualTreeTagger<br />
IPATypist<br />
<br />
CasualTranscriber 2.5 is already compatible with Yosemite.<br />
<br />
If you encounter any problem, please let me know.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-57892564490995864182014-10-21T17:24:00.002+09:002014-10-21T22:11:16.975+09:00Yosemite issueAs you might have tried, CasualConc and other applications do not run on OS X 10.10 Yosemite. This is because my applications are written in RubyCocoa, which depends on Ruby 1.8.x. But Apple decided to drop Ruby 1.8.7 on Yosemite (Ruby 2.0.0 is there).<br />
<br />
I'm working with the maintainer of RubyCocoa to work around this issue (most likely apps will include a port of Ruby 1.8 within the apps). So far, I have successfully run it on Yosemite, but haven't had time to test it. If you are interested in helping testing the functionality on Yosemite, please email me to the address on the site.<br />
<br />
<br />Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-14466122226503323132013-10-12T03:40:00.002+09:002013-10-12T03:40:40.237+09:00Another long while...I haven't updated this blog for a long time mostly because I haven't done much to CasualConc. I've made a few changes and fixed a few bugs. You can find them on What's new page on the main site.<br />
<br />
I've done some work on CasualTranscriber. I've added a database function and made some changes to the subtitle function. Those are mainly for language pedagogy purposes, so if you use CasualTranscriber for other purposes, they are of not much use. I've also written a manual in Japanese. I'll try to write an English version of it in the coming months. <br />
<br />
What I've spent most of the time before the summer was on a GUI front-end of the statistical environment R. It only has basic functions and not very flexible. My plan is to make it a bridge to learn R. I'm not sure how long this will take before it is useful, but I'll see.<br />
<br />
<br />Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-32574008274443378522013-01-30T22:41:00.001+09:002013-01-30T22:41:20.183+09:00It's been a while...I haven't posted here for a long time. I've been very busy this semester and I also started a new personal project. I've been updating CasualConc and a few other apps since I last posted here. So if you haven't updated CasualConc in the last 3-4 months, please try the latest build.<br />
<br />
Other bug fixes, I mostly make changes based on users' feedback, which I receive not very often. So if you have some ideas, please let me know. I might not be able to answer all your requests, but I'll try.<br />
<br />
Also, though I really want to update the documentation, I must admit I'm a way behind. I started updating the CasualConc 'how to use', but haven't been able to work on it since last summer. I hope I can make a progress during the spring break...Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-30832241676526273092012-09-26T04:07:00.001+09:002012-09-26T04:07:39.662+09:00CasualConc bug fixes and moreSince I last posted here, I made a few changes and fixed bugs.<br />
<br />
<b>Features<br /></b><b>Concord</b><br />- some minor interface changes (sort, context word, etc)<br />
-
you can now save the Concord result table as PDF (Print or Command + P)<br />
- more accurate keyword coloring in the context view when tags are suppressed <br />
<br />
<b>Word Count</b><br />- Phrase-frames list (i.e. in * of, etc.): Preferences->Others, Word Count, Advanced Mode<br />
<br />
<br />
<b>General</b><br />- the minimum window size is set to 800 x 550px<br />- some minor interface changes to accommodate the minimum window size<br /><br /><br /><b>Bug fixes<br /><br />Concord</b>- saving the results should work now<br />- keyword coloring in the context view in the Database mode<br />
<br />
<b>General</b><br />
- should run in non-Japanese environment (it didn't)<br />
<br />
<br />
I also started to update the documentation. The ones with (updated) on the How to Use page are partially updated.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-72990143778483907912012-06-28T09:23:00.000+09:002012-06-28T09:23:42.754+09:00CasualConc beta bug fixWith the recent changes, I introduced one serious bug to the Concord data save function, so I fixed it. Now it should work fine.<br />
<br />
I also made a few more changes to the Concord Plot function.<br />
<br />
<b>Concord Plot</b><br />
- you can divide the plots by a fixed number of units (characters/words)<br />
<br />
Let me know if you find any problem.<br />
<br />Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-5376696870928570772012-06-16T23:43:00.001+09:002012-06-16T23:43:57.283+09:00CasualConc beta minor updateUpon requests, I added a few minor features to CasualConc beta. <br /><br /><b>Concord</b><br />-
If you enable lemmatization and apply lemmatization to Concord search
words, you can sort the Concord results by lemmas of the key.<br />- When you export Concord results, you can insert characters before and after the key.<br /><br /><b>Concord Plot<br />
</b>- export plot data (File -> Save Table)<br />
- import to the plots of the same corpus/files (File -> Open Saved Data)<br />
- variable plot widths for each file<br />
- fixed widths for the plot view<br />
- refresh to reflect changes to the Concord results (by deletion) <br />
<br />
<br />
<br />
<b>Bug fix</b> <br />
- context tag handling in Concord <br />
<br />
<br />
With the changes to Concord, you can now sort by lemma, and then by each word. This is not available to the keyword group function at the moment. I'll see if I can apply to it if I get requests.<br />
<br />
If you use the lemmatization function with word family lists, you can sort by the word family first, and then by each word.<br />
<br />
If you want to insert characters before and after the key when you export Concord results, go to Preferences -> Concord <br />
<br />
With the Concord Plot changes, you can now put three different search results on one plot. First, search any word(s)/phrase(s) on Concord with the plotting on, then save the data on the Plot view. If you do this with another search and save the data. Then with the same corpus/files, search new word(s)/phrase(s) and then on the Plot view, open the saved data one by one. You can set the color of plots in Preferences -> Others and check Insert markers and type any characters before and after KEY.<br />
<br />
To make the plot widths relative to the file length, Preferences -> Others -> Concordance Plot, and select Relative (or Same Width for the same plot width for all the files).<br />
<br />
If you want to make the plots width wider, set Width (print) to whatever number (pixels) you want and check Apply to On-screen.<br />
<br />
I might move the settings to new tab later, but for now, the settings are under Others.<br />
<br />
The new beta version is 1.9.5. <br />
<br />
If you find any problem, please let me know.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-75792341604930855282012-04-23T20:08:00.002+09:002012-04-23T20:08:20.275+09:00Bug fix to CasualConc betaWith a few minor changes I made in the last couple of weeks, I introduced yet another bugs.<br />
<br />
If you tried to search anything in Concord, you had a warning message,
though you still should have been able to search words/phrases. Now I
addressed this bug with the latest build (20120423).<br />
<br />
<br />The change I made was to allow wildcard character only search in Concord in Word(s) mode and non-word character search in Character/Regular Expression search modes. <br />
<br />
Also I made one minor feature addition upon request. If you add <sec="x"> (x is any string of your choice) to your corpus files and search Concord with Concord Plot, you will see red lines where the tag(s) is inserted. So if you have any section break in your file(s), you can mark them on plots. This is still an experimental feature and hopefully I can make it more a little more flexible as soon as I have time to work on it more.</sec="x"><br />
<br />
In any case, if you have downloaded CasualConc in the last couple of weeks, please get the latest beta build.<br />
<br />Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-41013428553856053892012-04-04T02:18:00.000+09:002012-04-04T02:18:02.384+09:00Well, another bug fix...I think I fixed a bug in the Database mode last time, but it turned out I introduced another bug. Last time I fixed the process of converting search strings with wild card characters to SQL query strings, but it the change I implemented causes SQL query errors. But it should be fixed now unless your search string is full of wild card characters.<br />
<br />
So, if you use the Database mode with wild card characters, please download the latest beta (1.9.2, 20120404).<br />
<br />
And please let me know if you find any other bugs.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-10697490248266127872012-03-28T22:15:00.003+09:002012-03-28T22:15:59.881+09:00CasualConc beta bug fixI've been fixing some minor bugs and adding a few minor features in the past few weeks, but I found a bit more serious bug, so I fixed it.<br />
<br />
This only applies to the Database mode searching with wild card characters. When you search in Concord (and probably in Cluster and Collocation) using wild card characters, depending on the combination of words and wild card characters, a search could have taken very long time. This was a bug and I thought I fixed it long time ago, but it looks like I only did it for the File mode. I applied the same fix to the Database mode, so this shouldn't be an issue any more.<br />
<br />
Since I updated the blog last time, I made a few minor changes and various minor bug fixes. A couple of notable features are on the Corpus File Information. I think I posted either here or on the main site that I added a new frequency table feature to count groups of words for each corpus/database. Now you can select either you count all the words in a corpus/database or in each file in a corpus/database. I also added a function to save the Corpus File Information table results as a file (not export as a CSV file) and later import it back to CasualConc.<br />
<br />
Anyway, if you use the Database mode frequently, you might want to update to this version (1.9.2, 20120328 or later).<br />
<br />
If you find any other problem, please let me know.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com2tag:blogger.com,1999:blog-626625090667668862.post-76716215510003809302012-03-11T01:30:00.001+09:002012-03-11T01:30:19.345+09:00CasualConc beta updateIt has been a known issue that CasualConc crashes when viewing results on tables. It is actually not a bug of CasualConc itself, but a bug of RubyCocoa which CasualConc depends on. To address this issue, I decided turn off garbage collection on Ruby side when not processing texts, which means when you are viewing results. This might increase memory usage, but will provide much better experience (or at least I hope so).<br />
<br />
Also, I fixed a few minor bugs related to the lemmatization, spelling variation, and stop word processing. <br />
<br />Please try this new beta version 1.9.1 and let me know if you encounter any problem. It is available on the CasualConc site. If this is much more problematic than the table view bug, I revert it back to the previous build.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-1919690201678114862012-02-26T23:55:00.000+09:002012-02-26T23:55:26.500+09:00CasualConc beta update and moreI haven't updated this blog for a while, but I have made a few bug fixes and a few feature additions. I also added a small utility program that accompanies CasualConc.<br />
<br />
<b>Bug fixes</b><br />
<b>Concord</b><br />
- Tag search mode should work now<br />
<br />
<b>Word Count</b><br />
- The counts of files that particular words appear are not correctly displayed in the Database mode with the lemmatize option on<br />
- Keyness statistics are correctly calculated in the Database mode with lemmatize option on<br />
- Specified string search mode is functional in the Database mode<br />
- Tag list creation should work now<br />
<br />
<b>Enhancements</b><br />
<b>General</b><br />
- Corpus/database switching in the Advanced Corpus Handling mode is now available in Concord and Collocation; you can switch them directly on each tool<br />
- When exporting results in CSV or Tab-delimited format, you can select .txt in addition to .csv, though the default is still .csv.<br />
- You can specify context tags (or any strings) to limit the search only to specific section(s) (See Preferences -> Tags)<br />
- Enhancements on Context Tags to Ignore Settings in Preferences<br />
- You can now specify ** to ignore any character in brackets (i.e. <>)<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="http://3.bp.blogspot.com/-epTq8ymSwP0/T0pAiEmwVZI/AAAAAAAAAR4/bGc-sbk0ouo/s1600/tag+ignore+special+chars.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="22" src="http://3.bp.blogspot.com/-epTq8ymSwP0/T0pAiEmwVZI/AAAAAAAAAR4/bGc-sbk0ouo/s320/tag+ignore+special+chars.png" width="320" /></a></div>
- You can now add files to a selected corpus/database by drag&drop when files are shown in Advanced Mode<br />
<br />
<b>Concord</b><br />
- You can edit the preset sort orders in Preferences -> Concord (this might have been introduced before)<br />
<br />
<b>Word Count</b><br />
- Specified strong search has search history<br />
- Search function on the result table is enhanced<br />
<br />
<b>Corpus File Information</b><br />
- You can count sets of words/phrases for each selected corpus/database in Word Group Freq Table; the format is as follows: Group Name->word1,word2,...<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://sites.google.com/site/casualconc/what-s-new/casualconcbetabugfix-1/read%20words.png?attredirects=0" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://sites.google.com/site/casualconc/_/rsrc/1329680141256/what-s-new/casualconcbetabugfix-1/read%20words.png" /></a></div>
<br />
<b>CasualConc Viewer</b><br />
It's been reported that CasualConc crashes when scrolling fast on result tables. This is not a bug of this particular program but a bug of the program environment CasualConc depends on (RubyCocoa). I've asked the maintainer of RubyCocoa to fix the bug, but it hasn't been resolved yet, so I decided to create a viewer app. This viewer is written in MacRuby and table view is much more stable.<br />
<b> </b><br />
To use the viewer, after you create KWIC results or any lists on a table, go to Misc -> Open with Viewer. If you want to view the results on the right table on Cluster and Word Count, go to Misc -> Open with Viewer (Right).<br />
<br />
<br />
The viewer is just a viewer, so you can't do much. I might add a few more functions, but if you want to export results or calculate statistics, you should do it on CasualConc.<br />
<br />
<br />
<b>e-lemma file</b><br />
With the current beta (1.9.0), e-lemma file is included in the disk image (with a permission). e-lemma file is a lemma list file created by Prof. Yasumasa Someya at Kansai University. You can import the list for lemmatization on CasualConc.<br />
<br />
Another file, a-e spelling differences, is a list of American/British spelling pairs. You can also import this list to CasualConc.<br />
<br />
You can use the lemmatization function and the spelling variation function. When applied to a search word, you can search words of the same lemma as well as spelling variants.<br />
<br />
<br />
If you have any other bugs, please let me know.<br />
<br />
<br />
<br />
<div style="display: block; text-align: left;">
</div>
<div style="display: block; text-align: left;">
</div>
<div style="display: block; text-align: left;">
</div>
<div style="display: block; text-align: left;">
</div>
<div style="display: block; text-align: left;">
</div>Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-73442101740585359212011-12-04T09:50:00.001+09:002011-12-04T09:54:27.910+09:00CasualConc beta bug fixI got a bug report, so I fixed it.<br />
<br />
The problem was in File Info. When exporting a Word Freq Info result, low frequency counts of individual files were sometime omitted. This was because the cells with no numbers were not skipped. So when it reached the number of types in a file, CasualConc stopped handling the data for that file for exporting. Internally, the frequency counts were stored (you could see them on the window), so I made it sure that CasualConc handle all the data properly.<br />
<br />
If you find any problem, please let me know.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com2tag:blogger.com,1999:blog-626625090667668862.post-71313165964437593312011-11-27T03:05:00.001+09:002011-11-27T03:10:40.385+09:00CasualTranscriber alphaSince the current version of CasualTranscriber is quite buggy because of the programming language I use, I decided to rewrite it in another language. Now, the new version has most of the basic functions for transcribing, so I decided to release it as an alpha version. It does not have all the functions that the current version has and is more likely to have bugs, but it is, in a sense, more stable (at least in my environment).<br />
<br />
If you are using CasualTranscriber, especially on Lion, please try it and let me know what you think.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-74378199784968869522011-09-30T23:16:00.000+09:002011-09-30T23:16:24.688+09:00Lion compatibility and ...It's been a couple of months after Lion was released. Today, I made some changes to application so that they run on Lion (at least). I don't have time to test all the features, so if you use them on Lion, please let me know if they work fine or have some problems.<br />
<br />
And finally, I decided to drop the Leopard support. They might run, but I will not test the compatibility any more (I finally upgraded my Leopard machine to Snow Leopard).<br />
<br />
If you really need to use them on a Leopard machine, let me know.<br />
<br />
Also, I've added a few minor features to CasualConc. It now has a function to test your regular expressions. Go to Main Menu -> Window -> Regex Test Panel. This is an experimental feature, so let me know what you think.<br />
<br />
Another minor feature is a support of multi-line regular expression search. You can enable this on the Preference window.<br />
<br />
I've also started to rewrite CasualTranscriber. I labeled it as alpha, but most of the basic features of the current version are added. It only runs on 64-bit Mac with Snow Leopard or Lion. If you have chance to try it, let me know.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-27170451121579684292011-07-28T00:26:00.000+09:002011-07-28T00:26:11.471+09:00CasualConc on Lion updateI figured out how to include sqlite3-ruby in CasualConc, so I made changes to the latest beta build and labeled it as CasualConc beta for Lion. It is available on the CasualConc download page. Please try it and let me know if it doesn't work (or it does).Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com2tag:blogger.com,1999:blog-626625090667668862.post-13573946787236032012011-07-25T00:13:00.003+09:002011-07-28T00:31:15.484+09:00CasualConc on LionI haven't been able to check the compatibility personally, but I got a couple of reports that CasualConc 1.0.x does not run on Lion. <br />
<br />
<b>UPDATE</b>: As you can see above (at least for now), I figured out how to include sqlite3-ruby, so you don't have to follow this process unless the beta for Lion doesn't run.<br />
<br />
<b>To use CasualConc beta on Lion, you need to install 'sqlite3-ruby' and run a beta build 2011/07/21 or later.<br />
</b>1. open Mac <b>App Store.app</b> and download/install Xcode (free) [you need Mac App Store account]<br />
2. open <b>Terminal.app</b><br />
3. type <b>sudo gem install sqlite3-ruby --version "= 1.2.5"</b> and hit the enter key (and enter the password of your account on Mac)<br />
<br />
<br />
I will check this as soon as I can access Lion (hopefully within a couple of weeks), but I heard the beta ran after installing sqlite3-ruby.<br />
<br />
<br />
I will also check other apps and see if I can make them run on Lion (if they don't).Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-7206223313207644302011-03-10T13:41:00.000+09:002011-03-10T13:41:30.749+09:00Cumulative bug fixes and feature additions to CasualConc betaI haven't posted anything here for a while because I've been busy with developing a new version of CasualConc and preparing for the classes I start teaching next month. <br />
<br />
In the last few months, I added a few new features and fixed bugs (mostly the ones I introduced when I added these new features).<br />
<br />
<b>Bug fixes</b><br />
<br />
<b>General</b><br />
- searching multiple words/phrases in Word (wildcard) mode using a slash (/) matched words/phrases that included searched words/phrases<br />
<br />
<b>Concord</b><br />
- keyword was not correctly colored in the context view when searched in the database mode with File as Scope of Context<br />
- keyword was not colored correctly in the context view when used in the database mode (introduced with the above change)<br />
- saved results did not display correctly<br />
<br />
<b>Feature additions</b><br />
<br />
<b>Concord</b><br />
- added positions in a file as an sort option when File is selected as Scope of Context<br />
<br />
<b>File info</b><br />
- added standardized TTR (TTR in every 1000 words)<br />
<br />
<b>Concordance Plot</b><br />
- you can reflect the changes you made to Concord results<br />
- able to export selected/all plots as a single PDF file.<br />
<br />
<br />
If you find any other bugs or want to see some other features, please let me know. I'll try to fix bugs as soon as possible and I will try to add requested features as much as possible (unless they are technically too difficult).Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com3tag:blogger.com,1999:blog-626625090667668862.post-20903355656650504052010-11-04T17:03:00.000+09:002010-11-04T17:03:51.248+09:00Bug fixes to CasualConc Beta 1.8I found a few bugs related to Corpus/Database file handling in the Advanced mode. Also, a serious bug was reported, so I fixed them and uploaded the latest beta to the site. If you have downloaded beta 1.8, please visit the site and download the latest beta. The latest beta is also version 1.8, but if you could run and check About CasualConc, it should say Version 1.8 (20101104).<br />
<br />
If you still cannot run this version or find another bug, please report it to me. You can email me or post comment on the blog or send message on Twitter.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0tag:blogger.com,1999:blog-626625090667668862.post-52326388829995239132010-11-01T12:59:00.002+09:002010-11-01T15:00:04.781+09:00A minor update to CasualConc BetaI added a few features (some new and some enhanced) to CasualConc Beta. I also enabled an experimental gapped n-gram list feature.<br />
<br />
Documentation is not updated, so you might need to figure out how to use some of the features.<br />
<br />
<b>General</b><br />
- Spelling variation feature. You can register spelling variations (i.e. analyze-analyse) and use them in Concord/Cluster/Collocation searches as well as in Word List.<br />
- You can assign different corpus/database file to the left and right tables in Cluster/Word Count in Advanced Corpus Handling Mode.<b><br />
</b><br />
<b><br />
Concord</b><br />
- Sorting now has 4th elements. <br />
<br />
<b>Cluster</b><br />
- longer cluster search (up to 8 words)<br />
<br />
<b>Word List</b>- Gapped n-gram (3-5 gram)<br />
<b><br />
Concordance Plot</b><br />
- You can export selected concordance plot as JPEG files (individually) or print them (= export as PDF).<br />
<br />
<br />
The spelling variation feature is to accommodate word search/word list creation in languages that have spelling variations (i.e. American/British English). Once you create a spelling variation list, you can use information on it when searching a word/creating a word list. There is one problem with this feature. Unless your corpus and spelling variation list are not pos-tagged, CasualConc cannot distinguish the same spelling of different word classes (i.e. analyses [v] vs analyses [n]). A sample list of spelling variations is included in the disk image. You can import it to CasualConc to see how this feature works. You might want to create your own list (there might be some errors in my list and it is far from complete).<br />
<br />
For corpus handling, I added a feature to assign different corpus/database file to the left/right tables in Cluster and Word Count. This is available in Advanced Corpus Handling Mode. If you have more than one corpus or database file registered on the table and check more than one corpora or database files, you can select one corpus/database file or All for each of the two tables in Cluster and Word Count. You need to do this in File view.<br />
<br />
In Concord, you can select 4th sorting position. I don't know how useful this feature is, but I wanted this from time to time, so I added it. Also related to Concord is Concordance Plot export. Now you can export Concordance Plots. There are two types. One is to export selected plots individually as JPEG files. You can select the ones you want to export on the plots. Another is to print the selected plots. Thanks to OS X's "Save as PDF" feature, you can save the plots as a single PDF file. Now that you can export plots, you might want to change the size of plots, I guess. So I added a feature to change the hight and width of plot boxes. You can set them in Preferences -> Others.<br />
<br />
In Cluster, you can create a Cluster list of up to 8 words (= 7 + search word).<br />
<br />
In Word List, you can search gapped n-grams (2-5 grams). This feature was already introduced as an experimental feature. I enhanced it a bit and enabled it. I'm thinking about enhancing this feature, but I decided to release it before that hoping to get some feedback (to decide how to enhance this feature).<br />
<br />
There might be bugs related to these new features. Any feedback and/or bug report is welcome.Yasuhttp://www.blogger.com/profile/08489030458578691142noreply@blogger.com0