Posts

Showing posts from July, 2009

Project Torngat: Building Large-Scale Semantic 'Maps of Science' with LuSql, Lucene, Semantic Vectors, R and Processing from Full-Text

Image
Project Torngat is a research project here at NRC - CISTI   [ Note that I am no longer at CISTI and that I am now continuing this work at Carleton University - GN 2010 04 07 ] that looks to use the full-text of journal articles to construct semantic journal maps for use in -- among other things -- projecting article search results onto the map to visualize the results and support interactive exploration and discovery of related articles, term and journals. Starting with 5.7 million full-text articles from 2200+ journals (mostly science, technology and medical (STM)), and using LuSql , Lucene , Semantic Vectors , R , and processing , a two dimensional mapping of a 512 dimension semantic space was created which revealed an excellent correspondence with the 23 human-created journal categories: Semantic Journal Space of 2231 Journals Scaled to Two Dimensions This initial work was initiated to find a technique that would scale, and follow-up work is looking at integrating t

Springer LNCS, or, How not to do alerts!

Image
I subscribe to Springer Lecture Notes in Computer Science (LNCS) alerts. Several times now, I have received alerts when there was no web content at the URLs that they sent me. Very annoying, wasting my time. Please, this is 2009: try and make things work! . The latest was yesterday: at Mon, 20 Jul 2009 14:53:35 -0700 (PDT) I got an alert email from Springer LNCS: Dear Glen Newton, We are pleased to deliver your requested table of contents alert for a new volume of "Lecture Notes in Computer Science", subseries: "Lecture Notes in Artificial Intelligence". Volume 5632: Machine Learning and Data Mining in Pattern Recognition by Petra Perner is now available on the SpringerLink web site at http://springer.r.delivery.net/r/r?2.1.Ee.2Tp.1gRdiL.ByTshW..N.I9y2.3DBm.bW89MQ%5f%5fDJNcFRf0 Going to this page (as of Tues 14:26 ET July 21 2009, ~24hrs later), or to any of the URLs for the articles (including DOIs, like http://dx.doi.org/10.1007/978-3-642-03070-3_5 ) gives me -

Emacs 'mode' and learning `modes`

Image
I've used emacs as my primary editor, (emersive?) environment and de fact o almost-OS for about 20 years now. I read and send my email in it ( vm ), write/run/debug Java in it ( JDEE ), edit and compile my LaTeX in it, edit all other files with it, sometimes with complex macros that others would use Perl to do, and interact with shells inside of it. In the past I've edited and debugged C and C++, HTML and XML in various foo ML modes. The only other major thing I have running on my workstation is my web browser (//and occasionally OpenOffice for reading Word files//). Of course, I will have additional emacs windows open on the 3 or 4 servers I am editing and running code on (and also use tramp to transparently edit remote files). Yes, I've tried Eclipse . I know it quite well: I've even written an Eclipse plugin and published a paper about it ( Takaka: Eclipse Image Processing Plug-in ) . But it does not work for me like emacs does. If Eclipse works for you : that