SCM

SCM Repository

[tm] Log of /pkg/R/corpus.R
[tm] / pkg / R / corpus.R  
ViewVC logotype

Log of /pkg/R/corpus.R

Parent Directory Parent Directory


Links to HEAD: (view) (download) (annotate)
Sticky Revision:

Revision 1481 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 20 10:28:00 2017 UTC (23 months ago) by feinerer
File length: 8767 byte(s)
Diff to previous 1467 , to selected 1357
Support TIF for DataframeSource

See Text Interchange Formats (TIF, https://github.com/ropensci/tif) and
readtext (https://github.com/kbenoit/readtext).

Revision 1467 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 22 18:06:19 2017 UTC (2 years, 3 months ago) by khornik
File length: 7858 byte(s)
Diff to previous 1460 , to selected 1357
Register native routines.

Revision 1460 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jan 9 17:01:04 2017 UTC (2 years, 3 months ago) by feinerer
File length: 7852 byte(s)
Diff to previous 1458 , to selected 1357
Check before forcing as.character()

as.character() strips attributes including names, so avoid the call if the
content is already a character vector.

Revision 1458 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 8 18:46:50 2017 UTC (2 years, 3 months ago) by feinerer
File length: 7804 byte(s)
Diff to previous 1445 , to selected 1357
Ensure character content

Revision 1445 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 9 09:30:58 2016 UTC (2 years, 6 months ago) by feinerer
File length: 7790 byte(s)
Diff to previous 1440 , to selected 1357
Speed up termFreq(), general cleanup

- Avoid parallel::mclapply()
- Use custom .table()
- Use rep.int(), rep_len() and lengths()
- Fix typos
- Shorten overlong lines
- Consistent formatting

Revision 1440 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 30 06:34:57 2016 UTC (2 years, 8 months ago) by feinerer
File length: 7792 byte(s)
Diff to previous 1437 , to selected 1357
Corpus() now chooses between SimpleCorpus and VCorpus based on its arguments

Revision 1437 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 13 19:23:49 2016 UTC (2 years, 9 months ago) by feinerer
File length: 7372 byte(s)
Diff to previous 1419 , to selected 1357
Add SimpleCorpus

SimpleCorpus provides a corpus which is optimized for the most common usage
scenario: importing plain texts from files in a directory or directly from a
vector in R, preprocessing and transforming the texts, and finally exporting
them to a term-document matrix. The aim is to boost performance and minimize
memory pressure. It loads all documents into memory, and is designed for
medium-sized to large data sets.

Revision 1419 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 2 17:23:47 2015 UTC (3 years, 11 months ago) by feinerer
File length: 6081 byte(s)
Diff to previous 1411 , to selected 1357
Sync format()/print() with NLP

Revision 1411 - (view) (download) (annotate) - [select for diffs]
Modified Sat Feb 28 18:16:54 2015 UTC (4 years, 1 month ago) by feinerer
File length: 6097 byte(s)
Diff to previous 1409 , to selected 1357
Let as.list.[PV]Corpus() return names

Revision 1409 - (view) (download) (annotate) - [select for diffs]
Modified Fri Feb 27 16:10:18 2015 UTC (4 years, 1 month ago) by feinerer
File length: 6041 byte(s)
Diff to previous 1404 , to selected 1357
Add as.VCorpus.list()

Revision 1404 - (view) (download) (annotate) - [select for diffs]
Modified Tue Feb 17 18:04:22 2015 UTC (4 years, 2 months ago) by feinerer
File length: 5990 byte(s)
Diff to previous 1397 , to selected 1357
Avoid (rather expensive) structure()

Revision 1397 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 12 19:30:27 2014 UTC (4 years, 7 months ago) by feinerer
File length: 6051 byte(s)
Diff to previous 1383 , to selected 1357
Add open() and close() for sources

Useful for sources with complex or expensive setup, e.g., database connections
or file handles.

Revision 1383 - (view) (download) (annotate) - [select for diffs]
Modified Thu May 29 07:32:14 2014 UTC (4 years, 10 months ago) by feinerer
File length: 5981 byte(s)
Diff to previous 1379 , to selected 1357
Remove handling of undocumented reader options

Revision 1379 - (view) (download) (annotate) - [select for diffs]
Modified Tue May 27 17:55:29 2014 UTC (4 years, 10 months ago) by feinerer
File length: 6283 byte(s)
Diff to previous 1377 , to selected 1357
Provide names<-() for VCorpus and PCorpus

Revision 1377 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 21 17:15:56 2014 UTC (4 years, 11 months ago) by feinerer
File length: 6161 byte(s)
Diff to previous 1376 , to selected 1357
Provide names() for corpora

Revision 1376 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 21 14:36:35 2014 UTC (4 years, 11 months ago) by feinerer
File length: 6073 byte(s)
Diff to previous 1366 , to selected 1357
Remove names() from Source API

Revision 1366 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 28 14:48:37 2014 UTC (4 years, 11 months ago) by feinerer
File length: 6393 byte(s)
Diff to previous 1363 , to selected 1357
Support to set a named list as meta in PlainTextDocument and XMLTextDocument

Revision 1363 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 28 09:49:46 2014 UTC (4 years, 11 months ago) by feinerer
File length: 6443 byte(s)
Diff to previous 1357
Keep Corpus() alias

Revision 1357 - (view) (download) (annotate) - [selected]
Modified Thu Apr 24 06:33:35 2014 UTC (5 years ago) by feinerer
File length: 6433 byte(s)
Diff to previous 1350
Simplify SimpleSource() default arguments

Revision 1350 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 22 07:41:14 2014 UTC (5 years ago) by feinerer
File length: 6722 byte(s)
Diff to previous 1348 , to selected 1357
Reorder, fix typo

Revision 1348 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 22 07:09:41 2014 UTC (5 years ago) by feinerer
File length: 6722 byte(s)
Diff to previous 1342 , to selected 1357
Provide as.VCorpus() generic

Revision 1342 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 19 17:06:45 2014 UTC (5 years ago) by feinerer
File length: 6636 byte(s)
Diff to previous 1336 , to selected 1357
Sync vignette with code, small bug fix for access with names via [[

Revision 1336 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 19 08:59:39 2014 UTC (5 years ago) by feinerer
File length: 6616 byte(s)
Diff to previous 1333 , to selected 1357
Implement and describe Source API

Revision 1333 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 18 10:38:46 2014 UTC (5 years ago) by feinerer
File length: 6694 byte(s)
Diff to previous 1329 , to selected 1357
Update Corpus documentation

Revision 1329 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 15 17:16:03 2014 UTC (5 years ago) by feinerer
File length: 6704 byte(s)
Diff to previous 1328 , to selected 1357
Synchronize print() appearance with NLP

Revision 1328 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 15 09:46:28 2014 UTC (5 years ago) by feinerer
File length: 6892 byte(s)
Diff to previous 1327 , to selected 1357
Rearrange, update manual

Revision 1327 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 14 15:35:38 2014 UTC (5 years ago) by feinerer
File length: 7338 byte(s)
Diff to previous 1315 , to selected 1357
Simplify c.VCorpus()

Revision 1315 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 31 08:38:05 2014 UTC (5 years ago) by feinerer
File length: 8831 byte(s)
Diff to previous 1313 , to selected 1357
Simplify tm_map, tm_filter, and tm_index; remove makeChunks; rework lazy maps

Revision 1313 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 30 09:28:00 2014 UTC (5 years ago) by feinerer
File length: 10157 byte(s)
Diff to previous 1312 , to selected 1357
content() and as.list() now give the full documents

Revision 1312 - (view) (download) (annotate) - [select for diffs]
Modified Sat Mar 29 09:35:44 2014 UTC (5 years ago) by feinerer
File length: 10522 byte(s)
Diff to previous 1311 , to selected 1357
Simplify corpus metadata and PCorpus metadata storage

Revision 1311 - (view) (download) (annotate) - [select for diffs]
Modified Thu Mar 27 14:15:08 2014 UTC (5 years ago) by feinerer
File length: 11295 byte(s)
Diff to previous 1308 , to selected 1357
Some bug fixes

Revision 1308 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 25 15:02:15 2014 UTC (5 years ago) by feinerer
File length: 11286 byte(s)
Diff to previous 1307 , to selected 1357
Bug fixes. More to come ...

Revision 1307 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 25 12:15:51 2014 UTC (5 years, 1 month ago) by feinerer
File length: 11279 byte(s)
Diff to previous 1306 , to selected 1357
Redesign corpora

Revision 1306 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 25 08:37:05 2014 UTC (5 years, 1 month ago) by feinerer
File length: 11715 byte(s)
Diff to previous 1300 , to selected 1357
Improve writeCorpus, use lower case in internal data structures

Revision 1300 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 21 14:30:05 2014 UTC (5 years, 1 month ago) by feinerer
File length: 11720 byte(s)
Diff to previous 1297 , to selected 1357
Redesign text documents

This is a major change and causes fallout. Soon to be fixed ...

Revision 1297 - (view) (download) (annotate) - [select for diffs]
Modified Thu Mar 20 18:43:22 2014 UTC (5 years, 1 month ago) by feinerer
File length: 11688 byte(s)
Diff to previous 1285 , to selected 1357
Redesign sources

Revision 1285 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 11 09:31:10 2014 UTC (5 years, 3 months ago) by feinerer
File length: 11670 byte(s)
Diff to previous 1274 , to selected 1357
Simplify checks

Revision 1274 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 5 10:51:18 2014 UTC (5 years, 3 months ago) by feinerer
File length: 11678 byte(s)
Diff to previous 1273 , to selected 1357
More sanity checks

Revision 1273 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 5 08:42:02 2014 UTC (5 years, 3 months ago) by feinerer
File length: 11696 byte(s)
Diff to previous 1261 , to selected 1357
Some sanity checks

Revision 1261 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 27 09:37:35 2013 UTC (5 years, 6 months ago) by feinerer
File length: 11620 byte(s)
Diff to previous 1259 , to selected 1357
Allow multiple URIs for URISource, default to vectorized sources, simplify eoi()

Revision 1259 - (view) (download) (annotate) - [select for diffs]
Modified Sat Sep 21 07:36:25 2013 UTC (5 years, 7 months ago) by feinerer
File length: 11520 byte(s)
Diff to previous 1258 , to selected 1357
Remove unused arguments, sync VCorpus and PCorpus constructors

Revision 1258 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 20 12:15:42 2013 UTC (5 years, 7 months ago) by feinerer
File length: 11558 byte(s)
Diff to previous 1242 , to selected 1357
Remove GmaneSource() and readGmane(), simplify readers, improve documentation

Revision 1242 - (view) (download) (annotate) - [select for diffs]
Modified Mon Aug 19 05:33:57 2013 UTC (5 years, 8 months ago) by feinerer
File length: 11457 byte(s)
Diff to previous 1203 , to selected 1357
Do not register VCorpus and PlainTextDocument as S4 classes anymore

Revision 1203 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 11 19:43:37 2013 UTC (6 years, 3 months ago) by khornik
File length: 11780 byte(s)
Diff to previous 1114 , to selected 1357
Improve formals for c() methods.

Revision 1114 - (view) (download) (annotate) - [select for diffs]
Modified Fri Nov 26 14:05:54 2010 UTC (8 years, 4 months ago) by feinerer
File length: 11666 byte(s)
Diff to previous 1108 , to selected 1357
Allow init and exit hooks for readers

Revision 1108 - (view) (download) (annotate) - [select for diffs]
Modified Fri Oct 22 18:32:47 2010 UTC (8 years, 6 months ago) by feinerer
File length: 11364 byte(s)
Diff to previous 1102 , to selected 1357
Change Weighting from list element to attribute, access documents by name

Revision 1102 - (view) (download) (annotate) - [select for diffs]
Modified Sat Oct 16 10:01:09 2010 UTC (8 years, 6 months ago) by feinerer
File length: 11191 byte(s)
Diff to previous 1095 , to selected 1357
Access documents by their document ID

Revision 1095 - (view) (download) (annotate) - [select for diffs]
Modified Wed Aug 25 19:05:38 2010 UTC (8 years, 7 months ago) by feinerer
File length: 11035 byte(s)
Diff to previous 1074 , to selected 1357
Use the \code{recursive} argument to determine whether meta data is used when merging corpora

Revision 1074 - (view) (download) (annotate) - [select for diffs]
Modified Fri May 28 12:58:59 2010 UTC (8 years, 10 months ago) by feinerer
File length: 10785 byte(s)
Diff to previous 1073 , to selected 1357
Fix typo

Revision 1073 - (view) (download) (annotate) - [select for diffs]
Modified Fri May 28 12:32:46 2010 UTC (8 years, 10 months ago) by feinerer
File length: 10786 byte(s)
Diff to previous 1070 , to selected 1357
Use IETF language tags for language codes

Revision 1070 - (view) (download) (annotate) - [select for diffs]
Modified Tue May 18 08:58:22 2010 UTC (8 years, 11 months ago) by feinerer
File length: 10787 byte(s)
Diff to previous 1064 , to selected 1357
Use element names as document IDs if provided by a source

Revision 1064 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 9 10:43:22 2010 UTC (9 years ago) by feinerer
File length: 10664 byte(s)
Diff to previous 1063 , to selected 1357
Use document names provided by a source

Revision 1063 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 9 10:36:39 2010 UTC (9 years ago) by feinerer
File length: 10638 byte(s)
Diff to previous 1025 , to selected 1357
Sources can now provide document names

Revision 1025 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 11 08:56:22 2009 UTC (9 years, 4 months ago) by feinerer
File length: 10593 byte(s)
Diff to previous 1021 , to selected 1357
Register S3 document classes to be recognized by S4 methods.

Revision 1021 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 17 16:37:22 2009 UTC (9 years, 5 months ago) by feinerer
File length: 10590 byte(s)
Diff to previous 1004 , to selected 1357
Register S3 corpus classes to be recognized by S4 methods.

Revision 1004 - (view) (download) (annotate) - [select for diffs]
Modified Tue Sep 8 10:28:28 2009 UTC (9 years, 7 months ago) by feinerer
File length: 10267 byte(s)
Diff to previous 995 , to selected 1357
Improve vignette.

Revision 995 - (view) (download) (annotate) - [select for diffs]
Modified Mon Sep 7 07:54:08 2009 UTC (9 years, 7 months ago) by feinerer
File length: 10357 byte(s)
Diff to previous 988 , to selected 1357
Use NextMethod().

Revision 988 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 4 12:27:12 2009 UTC (9 years, 7 months ago) by feinerer
File length: 10534 byte(s)
Diff to previous 987 , to selected 1357
Update documentation.

Revision 987 - (view) (download) (annotate) - [select for diffs]
Modified Wed Sep 2 17:54:45 2009 UTC (9 years, 7 months ago) by feinerer
File length: 11364 byte(s)
Diff to previous 986 , to selected 1357
Update documentation.

Revision 986 - (view) (download) (annotate) - [select for diffs]
Modified Tue Sep 1 15:33:30 2009 UTC (9 years, 7 months ago) by feinerer
File length: 11632 byte(s)
Diff to previous 985 , to selected 1357
Further changes due to S3 class system.

Revision 985 - (view) (download) (annotate) - [select for diffs]
Modified Thu Aug 27 18:09:05 2009 UTC (9 years, 7 months ago) by feinerer
File length: 11781 byte(s)
Diff to previous 984 , to selected 1357
Use S3 instead of S4 class system.

Revision 984 - (view) (download) (annotate) - [select for diffs]
Modified Fri Aug 14 16:32:35 2009 UTC (9 years, 8 months ago) by feinerer
File length: 23441 byte(s)
Diff to previous 982 , to selected 1357
Remove obsolete appendElem() method (use c() instead).

Revision 982 - (view) (download) (annotate) - [select for diffs]
Modified Tue Aug 11 07:48:04 2009 UTC (9 years, 8 months ago) by feinerer
File length: 24336 byte(s)
Diff to previous 973 , to selected 1357
Moved readMail and MailDocument class from tm to tm.plugin.mail.

Revision 973 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 4 08:10:25 2009 UTC (9 years, 9 months ago) by feinerer
File length: 24433 byte(s)
Diff to previous 966 , to selected 1357
Rename readNewsgroup to readMail.

Revision 966 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jun 29 09:05:14 2009 UTC (9 years, 9 months ago) by feinerer
File length: 24784 byte(s)
Diff to previous 963 , to selected 1357
Remove TODO item.

Revision 963 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jun 29 07:01:19 2009 UTC (9 years, 9 months ago) by feinerer
File length: 24830 byte(s)
Diff to previous 962 , to selected 1357
Rename SCorpus to VCorpus (Volatile Corpus).

Revision 962 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jun 28 15:52:33 2009 UTC (9 years, 9 months ago) by feinerer
File length: 24830 byte(s)
Diff to previous 960 , to selected 1357
Fix documentation.

Revision 960 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jun 26 17:43:45 2009 UTC (9 years, 9 months ago) by feinerer
File length: 24632 byte(s)
Diff to previous 958 , to selected 1357
Add slam dependency and readReut21578XMLasPlain reader.

Revision 958 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jun 13 06:06:42 2009 UTC (9 years, 10 months ago) by feinerer
File length: 24780 byte(s)
Diff to previous 952 , to selected 1357
Code cleanup.

Revision 952 - (view) (download) (annotate) - [select for diffs]
Modified Mon May 18 13:43:01 2009 UTC (9 years, 11 months ago) by feinerer
File length: 24807 byte(s)
Diff to previous 950 , to selected 1357
Further work on FCorpus integration.

Revision 950 - (view) (download) (annotate) - [select for diffs]
Modified Thu May 14 15:17:18 2009 UTC (9 years, 11 months ago) by feinerer
File length: 24281 byte(s)
Diff to previous 946 , to selected 1357
Experimental FCorpus (fast corpus).

Revision 946 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 13 18:07:35 2009 UTC (9 years, 11 months ago) by feinerer
File length: 23422 byte(s)
Copied from: pkg/R/textdoccol.R revision 945
Diff to previous 940 , to selected 1357
A lot of major improvements (see NEWS).

Revision 940 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 26 12:35:25 2009 UTC (9 years, 11 months ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31633 byte(s)
Diff to previous 938 , to selected 1357
Fix codetools warnings.

Revision 938 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 25 19:05:50 2009 UTC (10 years ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31613 byte(s)
Diff to previous 909 , to selected 1357
Get rid of Matrix package dependency.

Revision 909 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 22 12:45:59 2009 UTC (10 years, 1 month ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31803 byte(s)
Diff to previous 905 , to selected 1357
Sources now can be vectorized.

Revision 905 - (view) (download) (annotate) - [select for diffs]
Modified Sat Mar 21 10:13:08 2009 UTC (10 years, 1 month ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31363 byte(s)
Diff to previous 902 , to selected 1357
Always use UTC time zone for DateTimeStamps.

Revision 902 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 20 20:08:39 2009 UTC (10 years, 1 month ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31195 byte(s)
Diff to previous 900 , to selected 1357
Fix arguments to Reduce() call.

Revision 900 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 20 16:50:27 2009 UTC (10 years, 1 month ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31186 byte(s)
Diff to previous 894 , to selected 1357
Add URL to DESCRIPTION. Use Reduce() function.

Revision 894 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 3 15:33:22 2009 UTC (10 years, 1 month ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31303 byte(s)
Diff to previous 886 , to selected 1357
Use the notion corpus instead of text document collection.

Revision 886 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 29 22:47:34 2009 UTC (10 years, 2 months ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31375 byte(s)
Diff to previous 885 , to selected 1357
Speed up package loading (Depends -> Suggests).

Revision 885 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 29 09:34:44 2009 UTC (10 years, 2 months ago) by stefan7th
Original Path: pkg/R/textdoccol.R
File length: 31006 byte(s)
Diff to previous 884 , to selected 1357
moved package to /pkg

Revision 884 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 28 10:24:27 2009 UTC (10 years, 2 months ago) by stefan7th
Original Path: pkg/tm/R/textdoccol.R
File length: 31006 byte(s)
Diff to previous 875 , to selected 1357
R-Forge transition completed

Revision 875 - (view) (download) (annotate) - [select for diffs]
Modified Sat Dec 6 13:25:03 2008 UTC (10 years, 4 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 31006 byte(s)
Diff to previous 870 , to selected 1357
Fixed non-standard call evaluation.

Revision 870 - (view) (download) (annotate) - [select for diffs]
Modified Mon Nov 10 15:29:22 2008 UTC (10 years, 5 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30992 byte(s)
Diff to previous 869 , to selected 1357
Fix documentation and codoc mismatches.

Revision 869 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 8 09:16:37 2008 UTC (10 years, 5 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30974 byte(s)
Diff to previous 861 , to selected 1357
Sources now have a Length slot. Knowing the length in advance makes corpus construction a lot faster (~ 8 times faster).

Revision 861 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jul 24 09:55:09 2008 UTC (10 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30517 byte(s)
Diff to previous 860 , to selected 1357
tmIndex(), tmFilter(), tmMap(), and TermDocMatrix() now use a MPI cluster if available.

Revision 860 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jul 18 05:05:20 2008 UTC (10 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 29904 byte(s)
Diff to previous 859 , to selected 1357
Removed some forgotten debug print out.

Revision 859 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 9 13:39:52 2008 UTC (10 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 29934 byte(s)
Diff to previous 856 , to selected 1357
Improved documentation.

Revision 856 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jun 6 11:45:39 2008 UTC (10 years, 10 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 29925 byte(s)
Diff to previous 854 , to selected 1357
Improved meta data extraction from Reuters Corpus Volume 1 documents.

Revision 854 - (view) (download) (annotate) - [select for diffs]
Modified Sun May 25 13:15:06 2008 UTC (10 years, 11 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30190 byte(s)
Diff to previous 853 , to selected 1357
searchFullText is now the default function used for tmFilter and tmIndex.

Revision 853 - (view) (download) (annotate) - [select for diffs]
Modified Sun May 18 13:09:35 2008 UTC (10 years, 11 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 31191 byte(s)
Diff to previous 837 , to selected 1357
Improved stem completion. Some documentation fixes.

Revision 837 - (view) (download) (annotate) - [select for diffs]
Modified Wed Apr 23 09:16:25 2008 UTC (11 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 31119 byte(s)
Diff to previous 836 , to selected 1357
Improved show methods.

Revision 836 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 19 17:08:07 2008 UTC (11 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 31108 byte(s)
Diff to previous 833 , to selected 1357
Improved meta data handling. Added coerce method from list to corpus. Updated CITATION file.

Revision 833 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 21 10:55:11 2008 UTC (11 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30169 byte(s)
Diff to previous 831 , to selected 1357
Included improvements suggested by Christian Buchta. Added CITATION file.

Revision 831 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 12 09:10:46 2008 UTC (11 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30003 byte(s)
Diff to previous 830 , to selected 1357
Fixed bug in [[<- (reported by Christian Buchta).

Revision 830 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 11 15:23:28 2008 UTC (11 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 29897 byte(s)
Diff to previous 829 , to selected 1357
Finished work on lazy mapping.

Revision 829 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 10 22:55:39 2008 UTC (11 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28877 byte(s)
Diff to previous 828 , to selected 1357
First version of working lazy mapping.

Revision 828 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 9 07:47:15 2008 UTC (11 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28047 byte(s)
Diff to previous 826 , to selected 1357
Some preliminary code for lazy mapping.

Revision 826 - (view) (download) (annotate) - [select for diffs]
Modified Sat Feb 23 14:38:15 2008 UTC (11 years, 2 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27318 byte(s)
Diff to previous 819 , to selected 1357
asPlain(): Preserve local meta data.

Revision 819 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 31 09:09:18 2008 UTC (11 years, 2 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27204 byte(s)
Diff to previous 817 , to selected 1357
Added writeCorpus function.

Revision 817 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 30 11:25:20 2008 UTC (11 years, 2 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26605 byte(s)
Diff to previous 816 , to selected 1357
Ensure that dimnames are always set correctly when generating a TermDocMatrix.

Revision 816 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 24 14:36:41 2008 UTC (11 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26611 byte(s)
Diff to previous 812 , to selected 1357
Renamed TextDocCol to Corpus, and Corpus to Content.

Revision 812 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jan 22 13:36:33 2008 UTC (11 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26714 byte(s)
Diff to previous 809 , to selected 1357
Ensure that tmUpdate uses provided encoding.

Revision 809 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jan 14 07:16:25 2008 UTC (11 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26626 byte(s)
Diff to previous 808 , to selected 1357
Improved handling of default readers.

Revision 808 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 13 16:18:27 2008 UTC (11 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26325 byte(s)
Diff to previous 799 , to selected 1357
Fixed bug regarding default reader selection when no reader argument is given.

Revision 799 - (view) (download) (annotate) - [select for diffs]
Modified Thu Nov 29 11:05:23 2007 UTC (11 years, 4 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26320 byte(s)
Diff to previous 791 , to selected 1357
Better handling of empty arguments in TextDocCol. Exported readDOC.

Revision 791 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 21 11:51:42 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26014 byte(s)
Diff to previous 780 , to selected 1357
New tmIntersect filter.

Revision 780 - (view) (download) (annotate) - [select for diffs]
Modified Sat Sep 29 13:24:17 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26325 byte(s)
Diff to previous 777 , to selected 1357
Added three transformations often used for e-mail analyses.

Revision 777 - (view) (download) (annotate) - [select for diffs]
Modified Tue Aug 28 07:19:12 2007 UTC (11 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28799 byte(s)
Diff to previous 775 , to selected 1357
Function generators are now real S4 classes instead of S3 attributes.

Revision 775 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 28 13:57:02 2007 UTC (11 years, 8 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28803 byte(s)
Diff to previous 772 , to selected 1357
Added conversion (asPlain) from StructuredTextDocuments to PlainTextDocuments.

Revision 772 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jul 20 14:00:58 2007 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28327 byte(s)
Diff to previous 769 , to selected 1357
Updated TODO list.

Revision 769 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jul 15 16:31:59 2007 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28313 byte(s)
Diff to previous 767 , to selected 1357
Fixed bug in tmUpdate.

Revision 767 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 14 16:50:44 2007 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28317 byte(s)
Diff to previous 760 , to selected 1357
Added simple HTML reader to produce StructuredTextDocuments.

Revision 760 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jun 21 22:40:15 2007 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27977 byte(s)
Diff to previous 757 , to selected 1357
require() uses the quietly option to suppress loading messages.

Revision 757 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jun 7 17:41:56 2007 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27961 byte(s)
Diff to previous 755 , to selected 1357
Added classes for Reuters21578 XML and RCV1 documents.

Revision 755 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jun 3 17:20:40 2007 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27111 byte(s)
Diff to previous 747 , to selected 1357
Added replaceWords function.

Revision 747 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 27 18:16:53 2007 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26694 byte(s)
Diff to previous 744 , to selected 1357
Removed dbDisconnect calls since deprecated by last filehash release.

Revision 744 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 23 00:35:10 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27044 byte(s)
Diff to previous 741 , to selected 1357
TermDocMatrix is now built by direct stepwise insertion, i.e., we save a lot of memory on construction.

Revision 741 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 21 18:35:16 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27046 byte(s)
Diff to previous 733 , to selected 1357
Switched back to filehash instead of filehashSQLite.

Revision 733 - (view) (download) (annotate) - [select for diffs]
Modified Wed Apr 11 18:53:49 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27058 byte(s)
Diff to previous 730 , to selected 1357
Removed a codetools warning.

Revision 730 - (view) (download) (annotate) - [select for diffs]
Modified Wed Apr 11 02:15:10 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27005 byte(s)
Diff to previous 729 , to selected 1357
Updated documentation.

Revision 729 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 10 17:08:52 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27041 byte(s)
Diff to previous 728 , to selected 1357
Improved database support.

Revision 728 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 8 23:23:29 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27060 byte(s)
Diff to previous 727 , to selected 1357
Updated database code.

Revision 727 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 8 19:36:41 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26843 byte(s)
Diff to previous 725 , to selected 1357
Fixed some bugs related to database support.

Revision 725 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 6 01:10:28 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26823 byte(s)
Diff to previous 724 , to selected 1357
Updated parts of the documentation.

Revision 724 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 1 21:13:36 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26378 byte(s)
Diff to previous 723 , to selected 1357
Finished experimental database support.

Revision 723 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 1 16:12:26 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 25714 byte(s)
Diff to previous 722 , to selected 1357
Now each source has its own default reader.

Revision 722 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 1 15:53:58 2007 UTC (12 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 25670 byte(s)
Diff to previous 721 , to selected 1357
Prettyprint summary, print method for plain text docs, removePunctuation.

Revision 721 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 21 13:54:43 2007 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 25238 byte(s)
Diff to previous 720 , to selected 1357
Simplified sFilter.

Revision 720 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 20 10:43:11 2007 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26362 byte(s)
Diff to previous 719 , to selected 1357
Some bug fixes.

Revision 719 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 18 09:24:47 2007 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 25853 byte(s)
Diff to previous 717 , to selected 1357
Improved database support.

Revision 717 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 16 11:13:04 2007 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 23828 byte(s)
Diff to previous 715 , to selected 1357
Added Language slot to text documents. Refactored TextDocCol constructor.

Revision 715 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 14 15:16:27 2007 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 23564 byte(s)
Diff to previous 713 , to selected 1357
Datasets acq and crude can now be created on the fly with tmDataSetup.R.

Revision 713 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 14 13:44:11 2007 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 23581 byte(s)
Diff to previous 712 , to selected 1357
Added Snowball support. Added function returning stopwords (English, German, French).

Revision 712 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 4 15:18:36 2007 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 23452 byte(s)
Diff to previous 702 , to selected 1357
Started to implement database support to optimize RAM usage, i.e., minimize RAM demand if necessary.

Revision 702 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jan 9 09:39:33 2007 UTC (12 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 21164 byte(s)
Diff to previous 698 , to selected 1357
wordStem now explicitly uses Rstem namespace.

Revision 698 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 6 17:05:44 2007 UTC (12 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 21157 byte(s)
Diff to previous 697 , to selected 1357
Changes due to Kurt's review.

Revision 697 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 5 23:09:12 2007 UTC (12 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 21254 byte(s)
Diff to previous 696 , to selected 1357
Fixed codetools warnings.

Revision 696 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 5 15:04:53 2007 UTC (12 years, 3 months ago) by hornik
Original Path: trunk/tm/R/textdoccol.R
File length: 21189 byte(s)
Diff to previous 694 , to selected 1357
Avoid non-standard eval (makes codetools happier).

Revision 694 - (view) (download) (annotate) - [select for diffs]
Modified Sun Dec 31 14:47:46 2006 UTC (12 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 21185 byte(s)
Diff to previous 693 , to selected 1357
Implemented improvements based upon comments by David.

Revision 693 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 22 13:21:30 2006 UTC (12 years, 4 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 21246 byte(s)
Diff to previous 690 , to selected 1357
Renamed textmin to tm directory since the package name changed.

Revision 690 - (view) (download) (annotate) - [select for diffs]
Modified Sat Dec 16 17:22:56 2006 UTC (12 years, 4 months ago) by feinerer
Original Path: trunk/textmin/R/textdoccol.R
File length: 21246 byte(s)
Diff to previous 689 , to selected 1357
Renamed package to 'tm'. Updated documentation (man) for CRAN release.

Revision 689 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 8 14:21:46 2006 UTC (12 years, 4 months ago) by feinerer
Original Path: trunk/textmin/R/textdoccol.R
File length: 21402 byte(s)
Diff to previous 78 , to selected 1357
Implemented changes as proposed at the Forschungsklausur on 01.12.2006.

Revision 78 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 29 14:56:36 2006 UTC (12 years, 4 months ago) by zeileis
Original Path: trunk/textmin/R/textdoccol.R
File length: 32515 byte(s)
Diff to previous 77 , to selected 1357
removed old repos structure, now only R packages

Revision 77 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 26 13:32:16 2006 UTC (12 years, 4 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 32515 byte(s)
Diff to previous 76 , to selected 1357
See ChangeLog.

Revision 76 - (view) (download) (annotate) - [select for diffs]
Modified Thu Nov 23 16:29:02 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 32582 byte(s)
Diff to previous 75 , to selected 1357
Various bug fixes. Data and vignette update.

Revision 75 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 22 14:37:18 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 32532 byte(s)
Diff to previous 74 , to selected 1357
Improved s_filter and prescind_meta.

Revision 74 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 21 20:04:17 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 32028 byte(s)
Diff to previous 73 , to selected 1357
Text documents' slot metadata is now accessible in s_filter.

Revision 73 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 21 15:52:39 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 31354 byte(s)
Diff to previous 72 , to selected 1357
Rewrote s_filter function.

Revision 72 - (view) (download) (annotate) - [select for diffs]
Modified Mon Nov 20 20:43:34 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 28834 byte(s)
Diff to previous 71 , to selected 1357
Added update mechanism for document collections. Various fixes for metadata handling.

Revision 71 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 19 17:30:26 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 27454 byte(s)
Diff to previous 70 , to selected 1357
Added sophisticated merging for document collections.

Revision 70 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 7 18:18:51 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 24227 byte(s)
Diff to previous 69 , to selected 1357
Messages now use \code{ngettext}.

Revision 69 - (view) (download) (annotate) - [select for diffs]
Modified Fri Nov 3 10:50:39 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 24061 byte(s)
Diff to previous 68 , to selected 1357
Added functions for modifying and removing metadata.

Revision 68 - (view) (download) (annotate) - [select for diffs]
Modified Thu Nov 2 14:06:42 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 23404 byte(s)
Diff to previous 67 , to selected 1357
Wrote vignette.

Revision 67 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 1 17:29:59 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 23332 byte(s)
Diff to previous 66 , to selected 1357
See ChangeLog

Revision 66 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 31 22:03:33 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 21924 byte(s)
Diff to previous 65 , to selected 1357
See ChangeLog.

Revision 65 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 31 17:10:24 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 22035 byte(s)
Diff to previous 64 , to selected 1357


Revision 64 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 29 14:29:43 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 20004 byte(s)
Diff to previous 63 , to selected 1357
Corrected NAMESPACE. Some minor improvements in the code.

Revision 63 - (view) (download) (annotate) - [select for diffs]
Modified Thu Oct 26 14:59:09 2006 UTC (12 years, 5 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 20004 byte(s)
Diff to previous 62 , to selected 1357
See ChangeLog.

Revision 62 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 24 10:08:58 2006 UTC (12 years, 6 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 14363 byte(s)
Diff to previous 61 , to selected 1357
See ChangeLog.

Revision 61 - (view) (download) (annotate) - [select for diffs]
Modified Mon Oct 23 20:07:05 2006 UTC (12 years, 6 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 14099 byte(s)
Diff to previous 60 , to selected 1357
See ChangeLog.

Revision 60 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 22 17:57:47 2006 UTC (12 years, 6 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 13019 byte(s)
Diff to previous 57 , to selected 1357
See ChangeLog.

Revision 57 - (view) (download) (annotate) - [select for diffs]
Modified Sun Sep 24 14:27:54 2006 UTC (12 years, 7 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 19765 byte(s)
Diff to previous 56 , to selected 1357
Eliminated tm_filter bug.

Revision 56 - (view) (download) (annotate) - [select for diffs]
Modified Sun Sep 24 14:12:28 2006 UTC (12 years, 7 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 19884 byte(s)
Diff to previous 55 , to selected 1357
See ChangeLog.

Revision 55 - (view) (download) (annotate) - [select for diffs]
Modified Thu Sep 14 12:31:07 2006 UTC (12 years, 7 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 16601 byte(s)
Diff to previous 54 , to selected 1357
Minor improvements.

Revision 54 - (view) (download) (annotate) - [select for diffs]
Modified Wed Sep 13 09:08:20 2006 UTC (12 years, 7 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 16258 byte(s)
Diff to previous 53 , to selected 1357
length, show and summary functions. Renamed transfromXXX and filterXXX to tm_transfrom and tm_filter.

Revision 53 - (view) (download) (annotate) - [select for diffs]
Modified Thu Aug 24 13:06:50 2006 UTC (12 years, 8 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 15303 byte(s)
Diff to previous 52 , to selected 1357
See ChangeLog for changes.

Revision 52 - (view) (download) (annotate) - [select for diffs]
Modified Sat Aug 12 12:43:39 2006 UTC (12 years, 8 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 13701 byte(s)
Diff to previous 51 , to selected 1357
Various updates. See ChangeLog and diff source code.

Revision 51 - (view) (download) (annotate) - [select for diffs]
Modified Mon Aug 7 12:14:09 2006 UTC (12 years, 8 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 12942 byte(s)
Diff to previous 49 , to selected 1357
Various changes due to new layout.

Revision 49 - (view) (download) (annotate) - [select for diffs]
Modified Sun Aug 6 10:12:13 2006 UTC (12 years, 8 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 13212 byte(s)
Diff to previous 48 , to selected 1357
Improved design. See ChangLog for details.

Revision 48 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jul 13 13:47:31 2006 UTC (12 years, 9 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 8671 byte(s)
Diff to previous 47 , to selected 1357
Clean up of old stuff.

Revision 47 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jul 10 12:22:35 2006 UTC (12 years, 9 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 8747 byte(s)
Diff to previous 46 , to selected 1357
Renamed tm to textmin directory.

Revision 46 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 5 18:08:41 2006 UTC (12 years, 9 months ago) by meyer
Original Path: trunk/R/tm/R/textdoccol.R
File length: 8747 byte(s)
Diff to previous 45 , to selected 1357
move


Revision 45 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 5 17:27:29 2006 UTC (12 years, 9 months ago) by meyer
Original Path: trunk/R/trunk/tm/R/textdoccol.R
File length: 8747 byte(s)
Diff to previous 42 , to selected 1357
move in subdir


Revision 42 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 1 08:42:26 2006 UTC (12 years, 9 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 8747 byte(s)
Diff to previous 41 , to selected 1357
Changed S4 method signatures.

Revision 41 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 12 17:14:15 2006 UTC (13 years, 1 month ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 8665 byte(s)
Diff to previous 40 , to selected 1357
Automatic RIS import implemented

Revision 40 - (view) (download) (annotate) - [select for diffs]
Modified Tue Feb 14 15:02:45 2006 UTC (13 years, 2 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 8025 byte(s)
Diff to previous 39 , to selected 1357
See ChangeLog

Revision 39 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 21 09:37:39 2006 UTC (13 years, 3 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 6439 byte(s)
Diff to previous 37 , to selected 1357
Removed bug

Revision 37 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 11 17:49:17 2006 UTC (13 years, 3 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 6022 byte(s)
Diff to previous 36 , to selected 1357


Revision 36 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 11 15:42:56 2006 UTC (13 years, 3 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 5143 byte(s)
Diff to previous 33 , to selected 1357
See ChangeLog

Revision 33 - (view) (download) (annotate) - [select for diffs]
Modified Thu Dec 15 13:29:17 2005 UTC (13 years, 4 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 4995 byte(s)
Diff to previous 32 , to selected 1357
See ChangeLog

Revision 32 - (view) (download) (annotate) - [select for diffs]
Modified Thu Dec 15 13:13:54 2005 UTC (13 years, 4 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 5115 byte(s)
Diff to previous 26 , to selected 1357


Revision 26 - (view) (download) (annotate) - [select for diffs]
Modified Sat Dec 3 15:20:17 2005 UTC (13 years, 4 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 5876 byte(s)
Diff to previous 24 , to selected 1357
See ChangeLog

Revision 24 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 20 15:31:34 2005 UTC (13 years, 5 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 5874 byte(s)
Diff to previous 23 , to selected 1357


Revision 23 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 19 18:25:41 2005 UTC (13 years, 5 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 5451 byte(s)
Diff to previous 22 , to selected 1357
Enabled import of files in Reuters-21578 XML format

Revision 22 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 19 16:58:34 2005 UTC (13 years, 5 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 3655 byte(s)
Diff to previous 21 , to selected 1357
See ChangeLog

Revision 21 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 19 10:23:19 2005 UTC (13 years, 5 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 2120 byte(s)
Diff to previous 20 , to selected 1357
See ChangeLog

Revision 20 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 8 16:40:52 2005 UTC (13 years, 5 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 1714 byte(s)
Diff to previous 19 , to selected 1357
See ChangeLog

Revision 19 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 6 15:38:48 2005 UTC (13 years, 5 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 1526 byte(s)
Diff to previous 18 , to selected 1357


Revision 18 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 5 19:00:05 2005 UTC (13 years, 5 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 1500 byte(s)
Diff to previous 17 , to selected 1357


Revision 17 - (view) (download) (annotate) - [select for diffs]
Added Sat Nov 5 14:47:12 2005 UTC (13 years, 5 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 1442 byte(s)
Diff to selected 1357
For infos see ChangeLog.

This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

Sort log by:

root@r-forge.r-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business University of Wisconsin - Madison Powered By FusionForge