SCM

SCM Repository

[tm] Log of /pkg/R/corpus.R
[tm] / pkg / R / corpus.R  
ViewVC logotype

Log of /pkg/R/corpus.R

Parent Directory Parent Directory


Links to HEAD: (view) (download) (annotate)
Sticky Revision:

Revision 1481 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 20 10:28:00 2017 UTC (17 months ago) by feinerer
File length: 8767 byte(s)
Diff to previous 1467
Support TIF for DataframeSource

See Text Interchange Formats (TIF, https://github.com/ropensci/tif) and
readtext (https://github.com/kbenoit/readtext).

Revision 1467 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 22 18:06:19 2017 UTC (20 months, 4 weeks ago) by khornik
File length: 7858 byte(s)
Diff to previous 1460
Register native routines.

Revision 1460 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jan 9 17:01:04 2017 UTC (21 months, 1 week ago) by feinerer
File length: 7852 byte(s)
Diff to previous 1458
Check before forcing as.character()

as.character() strips attributes including names, so avoid the call if the
content is already a character vector.

Revision 1458 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 8 18:46:50 2017 UTC (21 months, 1 week ago) by feinerer
File length: 7804 byte(s)
Diff to previous 1445
Ensure character content

Revision 1445 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 9 09:30:58 2016 UTC (2 years ago) by feinerer
File length: 7790 byte(s)
Diff to previous 1440
Speed up termFreq(), general cleanup

- Avoid parallel::mclapply()
- Use custom .table()
- Use rep.int(), rep_len() and lengths()
- Fix typos
- Shorten overlong lines
- Consistent formatting

Revision 1440 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 30 06:34:57 2016 UTC (2 years, 2 months ago) by feinerer
File length: 7792 byte(s)
Diff to previous 1437
Corpus() now chooses between SimpleCorpus and VCorpus based on its arguments

Revision 1437 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 13 19:23:49 2016 UTC (2 years, 3 months ago) by feinerer
File length: 7372 byte(s)
Diff to previous 1419
Add SimpleCorpus

SimpleCorpus provides a corpus which is optimized for the most common usage
scenario: importing plain texts from files in a directory or directly from a
vector in R, preprocessing and transforming the texts, and finally exporting
them to a term-document matrix. The aim is to boost performance and minimize
memory pressure. It loads all documents into memory, and is designed for
medium-sized to large data sets.

Revision 1419 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 2 17:23:47 2015 UTC (3 years, 5 months ago) by feinerer
File length: 6081 byte(s)
Diff to previous 1411
Sync format()/print() with NLP

Revision 1411 - (view) (download) (annotate) - [select for diffs]
Modified Sat Feb 28 18:16:54 2015 UTC (3 years, 7 months ago) by feinerer
File length: 6097 byte(s)
Diff to previous 1409
Let as.list.[PV]Corpus() return names

Revision 1409 - (view) (download) (annotate) - [select for diffs]
Modified Fri Feb 27 16:10:18 2015 UTC (3 years, 7 months ago) by feinerer
File length: 6041 byte(s)
Diff to previous 1404
Add as.VCorpus.list()

Revision 1404 - (view) (download) (annotate) - [select for diffs]
Modified Tue Feb 17 18:04:22 2015 UTC (3 years, 8 months ago) by feinerer
File length: 5990 byte(s)
Diff to previous 1397
Avoid (rather expensive) structure()

Revision 1397 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 12 19:30:27 2014 UTC (4 years, 1 month ago) by feinerer
File length: 6051 byte(s)
Diff to previous 1383
Add open() and close() for sources

Useful for sources with complex or expensive setup, e.g., database connections
or file handles.

Revision 1383 - (view) (download) (annotate) - [select for diffs]
Modified Thu May 29 07:32:14 2014 UTC (4 years, 4 months ago) by feinerer
File length: 5981 byte(s)
Diff to previous 1379
Remove handling of undocumented reader options

Revision 1379 - (view) (download) (annotate) - [select for diffs]
Modified Tue May 27 17:55:29 2014 UTC (4 years, 4 months ago) by feinerer
File length: 6283 byte(s)
Diff to previous 1377
Provide names<-() for VCorpus and PCorpus

Revision 1377 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 21 17:15:56 2014 UTC (4 years, 5 months ago) by feinerer
File length: 6161 byte(s)
Diff to previous 1376
Provide names() for corpora

Revision 1376 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 21 14:36:35 2014 UTC (4 years, 5 months ago) by feinerer
File length: 6073 byte(s)
Diff to previous 1366
Remove names() from Source API

Revision 1366 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 28 14:48:37 2014 UTC (4 years, 5 months ago) by feinerer
File length: 6393 byte(s)
Diff to previous 1363
Support to set a named list as meta in PlainTextDocument and XMLTextDocument

Revision 1363 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 28 09:49:46 2014 UTC (4 years, 5 months ago) by feinerer
File length: 6443 byte(s)
Diff to previous 1357
Keep Corpus() alias

Revision 1357 - (view) (download) (annotate) - [select for diffs]
Modified Thu Apr 24 06:33:35 2014 UTC (4 years, 5 months ago) by feinerer
File length: 6433 byte(s)
Diff to previous 1350
Simplify SimpleSource() default arguments

Revision 1350 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 22 07:41:14 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6722 byte(s)
Diff to previous 1348
Reorder, fix typo

Revision 1348 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 22 07:09:41 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6722 byte(s)
Diff to previous 1342
Provide as.VCorpus() generic

Revision 1342 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 19 17:06:45 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6636 byte(s)
Diff to previous 1336
Sync vignette with code, small bug fix for access with names via [[

Revision 1336 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 19 08:59:39 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6616 byte(s)
Diff to previous 1333
Implement and describe Source API

Revision 1333 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 18 10:38:46 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6694 byte(s)
Diff to previous 1329
Update Corpus documentation

Revision 1329 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 15 17:16:03 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6704 byte(s)
Diff to previous 1328
Synchronize print() appearance with NLP

Revision 1328 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 15 09:46:28 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6892 byte(s)
Diff to previous 1327
Rearrange, update manual

Revision 1327 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 14 15:35:38 2014 UTC (4 years, 6 months ago) by feinerer
File length: 7338 byte(s)
Diff to previous 1315
Simplify c.VCorpus()

Revision 1315 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 31 08:38:05 2014 UTC (4 years, 6 months ago) by feinerer
File length: 8831 byte(s)
Diff to previous 1313
Simplify tm_map, tm_filter, and tm_index; remove makeChunks; rework lazy maps

Revision 1313 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 30 09:28:00 2014 UTC (4 years, 6 months ago) by feinerer
File length: 10157 byte(s)
Diff to previous 1312
content() and as.list() now give the full documents

Revision 1312 - (view) (download) (annotate) - [select for diffs]
Modified Sat Mar 29 09:35:44 2014 UTC (4 years, 6 months ago) by feinerer
File length: 10522 byte(s)
Diff to previous 1311
Simplify corpus metadata and PCorpus metadata storage

Revision 1311 - (view) (download) (annotate) - [select for diffs]
Modified Thu Mar 27 14:15:08 2014 UTC (4 years, 6 months ago) by feinerer
File length: 11295 byte(s)
Diff to previous 1308
Some bug fixes

Revision 1308 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 25 15:02:15 2014 UTC (4 years, 6 months ago) by feinerer
File length: 11286 byte(s)
Diff to previous 1307
Bug fixes. More to come ...

Revision 1307 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 25 12:15:51 2014 UTC (4 years, 6 months ago) by feinerer
File length: 11279 byte(s)
Diff to previous 1306
Redesign corpora

Revision 1306 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 25 08:37:05 2014 UTC (4 years, 6 months ago) by feinerer
File length: 11715 byte(s)
Diff to previous 1300
Improve writeCorpus, use lower case in internal data structures

Revision 1300 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 21 14:30:05 2014 UTC (4 years, 7 months ago) by feinerer
File length: 11720 byte(s)
Diff to previous 1297
Redesign text documents

This is a major change and causes fallout. Soon to be fixed ...

Revision 1297 - (view) (download) (annotate) - [select for diffs]
Modified Thu Mar 20 18:43:22 2014 UTC (4 years, 7 months ago) by feinerer
File length: 11688 byte(s)
Diff to previous 1285
Redesign sources

Revision 1285 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 11 09:31:10 2014 UTC (4 years, 9 months ago) by feinerer
File length: 11670 byte(s)
Diff to previous 1274
Simplify checks

Revision 1274 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 5 10:51:18 2014 UTC (4 years, 9 months ago) by feinerer
File length: 11678 byte(s)
Diff to previous 1273
More sanity checks

Revision 1273 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 5 08:42:02 2014 UTC (4 years, 9 months ago) by feinerer
File length: 11696 byte(s)
Diff to previous 1261
Some sanity checks

Revision 1261 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 27 09:37:35 2013 UTC (5 years ago) by feinerer
File length: 11620 byte(s)
Diff to previous 1259
Allow multiple URIs for URISource, default to vectorized sources, simplify eoi()

Revision 1259 - (view) (download) (annotate) - [select for diffs]
Modified Sat Sep 21 07:36:25 2013 UTC (5 years, 1 month ago) by feinerer
File length: 11520 byte(s)
Diff to previous 1258
Remove unused arguments, sync VCorpus and PCorpus constructors

Revision 1258 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 20 12:15:42 2013 UTC (5 years, 1 month ago) by feinerer
File length: 11558 byte(s)
Diff to previous 1242
Remove GmaneSource() and readGmane(), simplify readers, improve documentation

Revision 1242 - (view) (download) (annotate) - [select for diffs]
Modified Mon Aug 19 05:33:57 2013 UTC (5 years, 2 months ago) by feinerer
File length: 11457 byte(s)
Diff to previous 1203
Do not register VCorpus and PlainTextDocument as S4 classes anymore

Revision 1203 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 11 19:43:37 2013 UTC (5 years, 9 months ago) by khornik
File length: 11780 byte(s)
Diff to previous 1114
Improve formals for c() methods.

Revision 1114 - (view) (download) (annotate) - [select for diffs]
Modified Fri Nov 26 14:05:54 2010 UTC (7 years, 10 months ago) by feinerer
File length: 11666 byte(s)
Diff to previous 1108
Allow init and exit hooks for readers

Revision 1108 - (view) (download) (annotate) - [select for diffs]
Modified Fri Oct 22 18:32:47 2010 UTC (8 years ago) by feinerer
File length: 11364 byte(s)
Diff to previous 1102
Change Weighting from list element to attribute, access documents by name

Revision 1102 - (view) (download) (annotate) - [select for diffs]
Modified Sat Oct 16 10:01:09 2010 UTC (8 years ago) by feinerer
File length: 11191 byte(s)
Diff to previous 1095
Access documents by their document ID

Revision 1095 - (view) (download) (annotate) - [select for diffs]
Modified Wed Aug 25 19:05:38 2010 UTC (8 years, 1 month ago) by feinerer
File length: 11035 byte(s)
Diff to previous 1074
Use the \code{recursive} argument to determine whether meta data is used when merging corpora

Revision 1074 - (view) (download) (annotate) - [select for diffs]
Modified Fri May 28 12:58:59 2010 UTC (8 years, 4 months ago) by feinerer
File length: 10785 byte(s)
Diff to previous 1073
Fix typo

Revision 1073 - (view) (download) (annotate) - [select for diffs]
Modified Fri May 28 12:32:46 2010 UTC (8 years, 4 months ago) by feinerer
File length: 10786 byte(s)
Diff to previous 1070
Use IETF language tags for language codes

Revision 1070 - (view) (download) (annotate) - [select for diffs]
Modified Tue May 18 08:58:22 2010 UTC (8 years, 5 months ago) by feinerer
File length: 10787 byte(s)
Diff to previous 1064
Use element names as document IDs if provided by a source

Revision 1064 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 9 10:43:22 2010 UTC (8 years, 6 months ago) by feinerer
File length: 10664 byte(s)
Diff to previous 1063
Use document names provided by a source

Revision 1063 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 9 10:36:39 2010 UTC (8 years, 6 months ago) by feinerer
File length: 10638 byte(s)
Diff to previous 1025
Sources can now provide document names

Revision 1025 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 11 08:56:22 2009 UTC (8 years, 10 months ago) by feinerer
File length: 10593 byte(s)
Diff to previous 1021
Register S3 document classes to be recognized by S4 methods.

Revision 1021 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 17 16:37:22 2009 UTC (8 years, 11 months ago) by feinerer
File length: 10590 byte(s)
Diff to previous 1004
Register S3 corpus classes to be recognized by S4 methods.

Revision 1004 - (view) (download) (annotate) - [select for diffs]
Modified Tue Sep 8 10:28:28 2009 UTC (9 years, 1 month ago) by feinerer
File length: 10267 byte(s)
Diff to previous 995
Improve vignette.

Revision 995 - (view) (download) (annotate) - [select for diffs]
Modified Mon Sep 7 07:54:08 2009 UTC (9 years, 1 month ago) by feinerer
File length: 10357 byte(s)
Diff to previous 988
Use NextMethod().

Revision 988 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 4 12:27:12 2009 UTC (9 years, 1 month ago) by feinerer
File length: 10534 byte(s)
Diff to previous 987
Update documentation.

Revision 987 - (view) (download) (annotate) - [select for diffs]
Modified Wed Sep 2 17:54:45 2009 UTC (9 years, 1 month ago) by feinerer
File length: 11364 byte(s)
Diff to previous 986
Update documentation.

Revision 986 - (view) (download) (annotate) - [select for diffs]
Modified Tue Sep 1 15:33:30 2009 UTC (9 years, 1 month ago) by feinerer
File length: 11632 byte(s)
Diff to previous 985
Further changes due to S3 class system.

Revision 985 - (view) (download) (annotate) - [select for diffs]
Modified Thu Aug 27 18:09:05 2009 UTC (9 years, 1 month ago) by feinerer
File length: 11781 byte(s)
Diff to previous 984
Use S3 instead of S4 class system.

Revision 984 - (view) (download) (annotate) - [select for diffs]
Modified Fri Aug 14 16:32:35 2009 UTC (9 years, 2 months ago) by feinerer
File length: 23441 byte(s)
Diff to previous 982
Remove obsolete appendElem() method (use c() instead).

Revision 982 - (view) (download) (annotate) - [select for diffs]
Modified Tue Aug 11 07:48:04 2009 UTC (9 years, 2 months ago) by feinerer
File length: 24336 byte(s)
Diff to previous 973
Moved readMail and MailDocument class from tm to tm.plugin.mail.

Revision 973 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 4 08:10:25 2009 UTC (9 years, 3 months ago) by feinerer
File length: 24433 byte(s)
Diff to previous 966
Rename readNewsgroup to readMail.

Revision 966 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jun 29 09:05:14 2009 UTC (9 years, 3 months ago) by feinerer
File length: 24784 byte(s)
Diff to previous 963
Remove TODO item.

Revision 963 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jun 29 07:01:19 2009 UTC (9 years, 3 months ago) by feinerer
File length: 24830 byte(s)
Diff to previous 962
Rename SCorpus to VCorpus (Volatile Corpus).

Revision 962 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jun 28 15:52:33 2009 UTC (9 years, 3 months ago) by feinerer
File length: 24830 byte(s)
Diff to previous 960
Fix documentation.

Revision 960 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jun 26 17:43:45 2009 UTC (9 years, 3 months ago) by feinerer
File length: 24632 byte(s)
Diff to previous 958
Add slam dependency and readReut21578XMLasPlain reader.

Revision 958 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jun 13 06:06:42 2009 UTC (9 years, 4 months ago) by feinerer
File length: 24780 byte(s)
Diff to previous 952
Code cleanup.

Revision 952 - (view) (download) (annotate) - [select for diffs]
Modified Mon May 18 13:43:01 2009 UTC (9 years, 5 months ago) by feinerer
File length: 24807 byte(s)
Diff to previous 950
Further work on FCorpus integration.

Revision 950 - (view) (download) (annotate) - [select for diffs]
Modified Thu May 14 15:17:18 2009 UTC (9 years, 5 months ago) by feinerer
File length: 24281 byte(s)
Diff to previous 946
Experimental FCorpus (fast corpus).

Revision 946 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 13 18:07:35 2009 UTC (9 years, 5 months ago) by feinerer
File length: 23422 byte(s)
Copied from: pkg/R/textdoccol.R revision 945
Diff to previous 940
A lot of major improvements (see NEWS).

Revision 940 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 26 12:35:25 2009 UTC (9 years, 5 months ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31633 byte(s)
Diff to previous 938
Fix codetools warnings.

Revision 938 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 25 19:05:50 2009 UTC (9 years, 5 months ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31613 byte(s)
Diff to previous 909
Get rid of Matrix package dependency.

Revision 909 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 22 12:45:59 2009 UTC (9 years, 7 months ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31803 byte(s)
Diff to previous 905
Sources now can be vectorized.

Revision 905 - (view) (download) (annotate) - [select for diffs]
Modified Sat Mar 21 10:13:08 2009 UTC (9 years, 7 months ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31363 byte(s)
Diff to previous 902
Always use UTC time zone for DateTimeStamps.

Revision 902 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 20 20:08:39 2009 UTC (9 years, 7 months ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31195 byte(s)
Diff to previous 900
Fix arguments to Reduce() call.

Revision 900 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 20 16:50:27 2009 UTC (9 years, 7 months ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31186 byte(s)
Diff to previous 894
Add URL to DESCRIPTION. Use Reduce() function.

Revision 894 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 3 15:33:22 2009 UTC (9 years, 7 months ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31303 byte(s)
Diff to previous 886
Use the notion corpus instead of text document collection.

Revision 886 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 29 22:47:34 2009 UTC (9 years, 8 months ago) by feinerer
Original Path: pkg/R/textdoccol.R
File length: 31375 byte(s)
Diff to previous 885
Speed up package loading (Depends -> Suggests).

Revision 885 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 29 09:34:44 2009 UTC (9 years, 8 months ago) by stefan7th
Original Path: pkg/R/textdoccol.R
File length: 31006 byte(s)
Diff to previous 884
moved package to /pkg

Revision 884 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 28 10:24:27 2009 UTC (9 years, 8 months ago) by stefan7th
Original Path: pkg/tm/R/textdoccol.R
File length: 31006 byte(s)
Diff to previous 875
R-Forge transition completed

Revision 875 - (view) (download) (annotate) - [select for diffs]
Modified Sat Dec 6 13:25:03 2008 UTC (9 years, 10 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 31006 byte(s)
Diff to previous 870
Fixed non-standard call evaluation.

Revision 870 - (view) (download) (annotate) - [select for diffs]
Modified Mon Nov 10 15:29:22 2008 UTC (9 years, 11 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30992 byte(s)
Diff to previous 869
Fix documentation and codoc mismatches.

Revision 869 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 8 09:16:37 2008 UTC (9 years, 11 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30974 byte(s)
Diff to previous 861
Sources now have a Length slot. Knowing the length in advance makes corpus construction a lot faster (~ 8 times faster).

Revision 861 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jul 24 09:55:09 2008 UTC (10 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30517 byte(s)
Diff to previous 860
tmIndex(), tmFilter(), tmMap(), and TermDocMatrix() now use a MPI cluster if available.

Revision 860 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jul 18 05:05:20 2008 UTC (10 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 29904 byte(s)
Diff to previous 859
Removed some forgotten debug print out.

Revision 859 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 9 13:39:52 2008 UTC (10 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 29934 byte(s)
Diff to previous 856
Improved documentation.

Revision 856 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jun 6 11:45:39 2008 UTC (10 years, 4 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 29925 byte(s)
Diff to previous 854
Improved meta data extraction from Reuters Corpus Volume 1 documents.

Revision 854 - (view) (download) (annotate) - [select for diffs]
Modified Sun May 25 13:15:06 2008 UTC (10 years, 4 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30190 byte(s)
Diff to previous 853
searchFullText is now the default function used for tmFilter and tmIndex.

Revision 853 - (view) (download) (annotate) - [select for diffs]
Modified Sun May 18 13:09:35 2008 UTC (10 years, 5 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 31191 byte(s)
Diff to previous 837
Improved stem completion. Some documentation fixes.

Revision 837 - (view) (download) (annotate) - [select for diffs]
Modified Wed Apr 23 09:16:25 2008 UTC (10 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 31119 byte(s)
Diff to previous 836
Improved show methods.

Revision 836 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 19 17:08:07 2008 UTC (10 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 31108 byte(s)
Diff to previous 833
Improved meta data handling. Added coerce method from list to corpus. Updated CITATION file.

Revision 833 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 21 10:55:11 2008 UTC (10 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30169 byte(s)
Diff to previous 831
Included improvements suggested by Christian Buchta. Added CITATION file.

Revision 831 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 12 09:10:46 2008 UTC (10 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 30003 byte(s)
Diff to previous 830
Fixed bug in [[<- (reported by Christian Buchta).

Revision 830 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 11 15:23:28 2008 UTC (10 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 29897 byte(s)
Diff to previous 829
Finished work on lazy mapping.

Revision 829 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 10 22:55:39 2008 UTC (10 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28877 byte(s)
Diff to previous 828
First version of working lazy mapping.

Revision 828 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 9 07:47:15 2008 UTC (10 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28047 byte(s)
Diff to previous 826
Some preliminary code for lazy mapping.

Revision 826 - (view) (download) (annotate) - [select for diffs]
Modified Sat Feb 23 14:38:15 2008 UTC (10 years, 8 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27318 byte(s)
Diff to previous 819
asPlain(): Preserve local meta data.

Revision 819 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 31 09:09:18 2008 UTC (10 years, 8 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27204 byte(s)
Diff to previous 817
Added writeCorpus function.

Revision 817 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 30 11:25:20 2008 UTC (10 years, 8 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26605 byte(s)
Diff to previous 816
Ensure that dimnames are always set correctly when generating a TermDocMatrix.

Revision 816 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 24 14:36:41 2008 UTC (10 years, 8 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26611 byte(s)
Diff to previous 812
Renamed TextDocCol to Corpus, and Corpus to Content.

Revision 812 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jan 22 13:36:33 2008 UTC (10 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26714 byte(s)
Diff to previous 809
Ensure that tmUpdate uses provided encoding.

Revision 809 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jan 14 07:16:25 2008 UTC (10 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26626 byte(s)
Diff to previous 808
Improved handling of default readers.

Revision 808 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 13 16:18:27 2008 UTC (10 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26325 byte(s)
Diff to previous 799
Fixed bug regarding default reader selection when no reader argument is given.

Revision 799 - (view) (download) (annotate) - [select for diffs]
Modified Thu Nov 29 11:05:23 2007 UTC (10 years, 10 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26320 byte(s)
Diff to previous 791
Better handling of empty arguments in TextDocCol. Exported readDOC.

Revision 791 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 21 11:51:42 2007 UTC (11 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26014 byte(s)
Diff to previous 780
New tmIntersect filter.

Revision 780 - (view) (download) (annotate) - [select for diffs]
Modified Sat Sep 29 13:24:17 2007 UTC (11 years ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26325 byte(s)
Diff to previous 777
Added three transformations often used for e-mail analyses.

Revision 777 - (view) (download) (annotate) - [select for diffs]
Modified Tue Aug 28 07:19:12 2007 UTC (11 years, 1 month ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28799 byte(s)
Diff to previous 775
Function generators are now real S4 classes instead of S3 attributes.

Revision 775 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 28 13:57:02 2007 UTC (11 years, 2 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28803 byte(s)
Diff to previous 772
Added conversion (asPlain) from StructuredTextDocuments to PlainTextDocuments.

Revision 772 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jul 20 14:00:58 2007 UTC (11 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28327 byte(s)
Diff to previous 769
Updated TODO list.

Revision 769 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jul 15 16:31:59 2007 UTC (11 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28313 byte(s)
Diff to previous 767
Fixed bug in tmUpdate.

Revision 767 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 14 16:50:44 2007 UTC (11 years, 3 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 28317 byte(s)
Diff to previous 760
Added simple HTML reader to produce StructuredTextDocuments.

Revision 760 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jun 21 22:40:15 2007 UTC (11 years, 4 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27977 byte(s)
Diff to previous 757
require() uses the quietly option to suppress loading messages.

Revision 757 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jun 7 17:41:56 2007 UTC (11 years, 4 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27961 byte(s)
Diff to previous 755
Added classes for Reuters21578 XML and RCV1 documents.

Revision 755 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jun 3 17:20:40 2007 UTC (11 years, 4 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27111 byte(s)
Diff to previous 747
Added replaceWords function.

Revision 747 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 27 18:16:53 2007 UTC (11 years, 5 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26694 byte(s)
Diff to previous 744
Removed dbDisconnect calls since deprecated by last filehash release.

Revision 744 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 23 00:35:10 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27044 byte(s)
Diff to previous 741
TermDocMatrix is now built by direct stepwise insertion, i.e., we save a lot of memory on construction.

Revision 741 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 21 18:35:16 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27046 byte(s)
Diff to previous 733
Switched back to filehash instead of filehashSQLite.

Revision 733 - (view) (download) (annotate) - [select for diffs]
Modified Wed Apr 11 18:53:49 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27058 byte(s)
Diff to previous 730
Removed a codetools warning.

Revision 730 - (view) (download) (annotate) - [select for diffs]
Modified Wed Apr 11 02:15:10 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27005 byte(s)
Diff to previous 729
Updated documentation.

Revision 729 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 10 17:08:52 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27041 byte(s)
Diff to previous 728
Improved database support.

Revision 728 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 8 23:23:29 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 27060 byte(s)
Diff to previous 727
Updated database code.

Revision 727 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 8 19:36:41 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26843 byte(s)
Diff to previous 725
Fixed some bugs related to database support.

Revision 725 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 6 01:10:28 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26823 byte(s)
Diff to previous 724
Updated parts of the documentation.

Revision 724 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 1 21:13:36 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26378 byte(s)
Diff to previous 723
Finished experimental database support.

Revision 723 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 1 16:12:26 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 25714 byte(s)
Diff to previous 722
Now each source has its own default reader.

Revision 722 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 1 15:53:58 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 25670 byte(s)
Diff to previous 721
Prettyprint summary, print method for plain text docs, removePunctuation.

Revision 721 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 21 13:54:43 2007 UTC (11 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 25238 byte(s)
Diff to previous 720
Simplified sFilter.

Revision 720 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 20 10:43:11 2007 UTC (11 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 26362 byte(s)
Diff to previous 719
Some bug fixes.

Revision 719 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 18 09:24:47 2007 UTC (11 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 25853 byte(s)
Diff to previous 717
Improved database support.

Revision 717 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 16 11:13:04 2007 UTC (11 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 23828 byte(s)
Diff to previous 715
Added Language slot to text documents. Refactored TextDocCol constructor.

Revision 715 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 14 15:16:27 2007 UTC (11 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 23564 byte(s)
Diff to previous 713
Datasets acq and crude can now be created on the fly with tmDataSetup.R.

Revision 713 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 14 13:44:11 2007 UTC (11 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 23581 byte(s)
Diff to previous 712
Added Snowball support. Added function returning stopwords (English, German, French).

Revision 712 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 4 15:18:36 2007 UTC (11 years, 7 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 23452 byte(s)
Diff to previous 702
Started to implement database support to optimize RAM usage, i.e., minimize RAM demand if necessary.

Revision 702 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jan 9 09:39:33 2007 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 21164 byte(s)
Diff to previous 698
wordStem now explicitly uses Rstem namespace.

Revision 698 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 6 17:05:44 2007 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 21157 byte(s)
Diff to previous 697
Changes due to Kurt's review.

Revision 697 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 5 23:09:12 2007 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 21254 byte(s)
Diff to previous 696
Fixed codetools warnings.

Revision 696 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 5 15:04:53 2007 UTC (11 years, 9 months ago) by hornik
Original Path: trunk/tm/R/textdoccol.R
File length: 21189 byte(s)
Diff to previous 694
Avoid non-standard eval (makes codetools happier).

Revision 694 - (view) (download) (annotate) - [select for diffs]
Modified Sun Dec 31 14:47:46 2006 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 21185 byte(s)
Diff to previous 693
Implemented improvements based upon comments by David.

Revision 693 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 22 13:21:30 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/tm/R/textdoccol.R
File length: 21246 byte(s)
Diff to previous 690
Renamed textmin to tm directory since the package name changed.

Revision 690 - (view) (download) (annotate) - [select for diffs]
Modified Sat Dec 16 17:22:56 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/textmin/R/textdoccol.R
File length: 21246 byte(s)
Diff to previous 689
Renamed package to 'tm'. Updated documentation (man) for CRAN release.

Revision 689 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 8 14:21:46 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/textmin/R/textdoccol.R
File length: 21402 byte(s)
Diff to previous 78
Implemented changes as proposed at the Forschungsklausur on 01.12.2006.

Revision 78 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 29 14:56:36 2006 UTC (11 years, 10 months ago) by zeileis
Original Path: trunk/textmin/R/textdoccol.R
File length: 32515 byte(s)
Diff to previous 77
removed old repos structure, now only R packages

Revision 77 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 26 13:32:16 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 32515 byte(s)
Diff to previous 76
See ChangeLog.

Revision 76 - (view) (download) (annotate) - [select for diffs]
Modified Thu Nov 23 16:29:02 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 32582 byte(s)
Diff to previous 75
Various bug fixes. Data and vignette update.

Revision 75 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 22 14:37:18 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 32532 byte(s)
Diff to previous 74
Improved s_filter and prescind_meta.

Revision 74 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 21 20:04:17 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 32028 byte(s)
Diff to previous 73
Text documents' slot metadata is now accessible in s_filter.

Revision 73 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 21 15:52:39 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 31354 byte(s)
Diff to previous 72
Rewrote s_filter function.

Revision 72 - (view) (download) (annotate) - [select for diffs]
Modified Mon Nov 20 20:43:34 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 28834 byte(s)
Diff to previous 71
Added update mechanism for document collections. Various fixes for metadata handling.

Revision 71 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 19 17:30:26 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 27454 byte(s)
Diff to previous 70
Added sophisticated merging for document collections.

Revision 70 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 7 18:18:51 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 24227 byte(s)
Diff to previous 69
Messages now use \code{ngettext}.

Revision 69 - (view) (download) (annotate) - [select for diffs]
Modified Fri Nov 3 10:50:39 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 24061 byte(s)
Diff to previous 68
Added functions for modifying and removing metadata.

Revision 68 - (view) (download) (annotate) - [select for diffs]
Modified Thu Nov 2 14:06:42 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 23404 byte(s)
Diff to previous 67
Wrote vignette.

Revision 67 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 1 17:29:59 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 23332 byte(s)
Diff to previous 66
See ChangeLog

Revision 66 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 31 22:03:33 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 21924 byte(s)
Diff to previous 65
See ChangeLog.

Revision 65 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 31 17:10:24 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 22035 byte(s)
Diff to previous 64


Revision 64 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 29 14:29:43 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 20004 byte(s)
Diff to previous 63
Corrected NAMESPACE. Some minor improvements in the code.

Revision 63 - (view) (download) (annotate) - [select for diffs]
Modified Thu Oct 26 14:59:09 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 20004 byte(s)
Diff to previous 62
See ChangeLog.

Revision 62 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 24 10:08:58 2006 UTC (12 years ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 14363 byte(s)
Diff to previous 61
See ChangeLog.

Revision 61 - (view) (download) (annotate) - [select for diffs]
Modified Mon Oct 23 20:07:05 2006 UTC (12 years ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 14099 byte(s)
Diff to previous 60
See ChangeLog.

Revision 60 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 22 17:57:47 2006 UTC (12 years ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 13019 byte(s)
Diff to previous 57
See ChangeLog.

Revision 57 - (view) (download) (annotate) - [select for diffs]
Modified Sun Sep 24 14:27:54 2006 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 19765 byte(s)
Diff to previous 56
Eliminated tm_filter bug.

Revision 56 - (view) (download) (annotate) - [select for diffs]
Modified Sun Sep 24 14:12:28 2006 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 19884 byte(s)
Diff to previous 55
See ChangeLog.

Revision 55 - (view) (download) (annotate) - [select for diffs]
Modified Thu Sep 14 12:31:07 2006 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 16601 byte(s)
Diff to previous 54
Minor improvements.

Revision 54 - (view) (download) (annotate) - [select for diffs]
Modified Wed Sep 13 09:08:20 2006 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 16258 byte(s)
Diff to previous 53
length, show and summary functions. Renamed transfromXXX and filterXXX to tm_transfrom and tm_filter.

Revision 53 - (view) (download) (annotate) - [select for diffs]
Modified Thu Aug 24 13:06:50 2006 UTC (12 years, 2 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 15303 byte(s)
Diff to previous 52
See ChangeLog for changes.

Revision 52 - (view) (download) (annotate) - [select for diffs]
Modified Sat Aug 12 12:43:39 2006 UTC (12 years, 2 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 13701 byte(s)
Diff to previous 51
Various updates. See ChangeLog and diff source code.

Revision 51 - (view) (download) (annotate) - [select for diffs]
Modified Mon Aug 7 12:14:09 2006 UTC (12 years, 2 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 12942 byte(s)
Diff to previous 49
Various changes due to new layout.

Revision 49 - (view) (download) (annotate) - [select for diffs]
Modified Sun Aug 6 10:12:13 2006 UTC (12 years, 2 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 13212 byte(s)
Diff to previous 48
Improved design. See ChangLog for details.

Revision 48 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jul 13 13:47:31 2006 UTC (12 years, 3 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 8671 byte(s)
Diff to previous 47
Clean up of old stuff.

Revision 47 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jul 10 12:22:35 2006 UTC (12 years, 3 months ago) by feinerer
Original Path: trunk/R/textmin/R/textdoccol.R
File length: 8747 byte(s)
Diff to previous 46
Renamed tm to textmin directory.

Revision 46 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 5 18:08:41 2006 UTC (12 years, 3 months ago) by meyer
Original Path: trunk/R/tm/R/textdoccol.R
File length: 8747 byte(s)
Diff to previous 45
move


Revision 45 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 5 17:27:29 2006 UTC (12 years, 3 months ago) by meyer
Original Path: trunk/R/trunk/tm/R/textdoccol.R
File length: 8747 byte(s)
Diff to previous 42
move in subdir


Revision 42 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 1 08:42:26 2006 UTC (12 years, 3 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 8747 byte(s)
Diff to previous 41
Changed S4 method signatures.

Revision 41 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 12 17:14:15 2006 UTC (12 years, 7 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 8665 byte(s)
Diff to previous 40
Automatic RIS import implemented

Revision 40 - (view) (download) (annotate) - [select for diffs]
Modified Tue Feb 14 15:02:45 2006 UTC (12 years, 8 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 8025 byte(s)
Diff to previous 39
See ChangeLog

Revision 39 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 21 09:37:39 2006 UTC (12 years, 9 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 6439 byte(s)
Diff to previous 37
Removed bug

Revision 37 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 11 17:49:17 2006 UTC (12 years, 9 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 6022 byte(s)
Diff to previous 36


Revision 36 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 11 15:42:56 2006 UTC (12 years, 9 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 5143 byte(s)
Diff to previous 33
See ChangeLog

Revision 33 - (view) (download) (annotate) - [select for diffs]
Modified Thu Dec 15 13:29:17 2005 UTC (12 years, 10 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 4995 byte(s)
Diff to previous 32
See ChangeLog

Revision 32 - (view) (download) (annotate) - [select for diffs]
Modified Thu Dec 15 13:13:54 2005 UTC (12 years, 10 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 5115 byte(s)
Diff to previous 26


Revision 26 - (view) (download) (annotate) - [select for diffs]
Modified Sat Dec 3 15:20:17 2005 UTC (12 years, 10 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 5876 byte(s)
Diff to previous 24
See ChangeLog

Revision 24 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 20 15:31:34 2005 UTC (12 years, 11 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 5874 byte(s)
Diff to previous 23


Revision 23 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 19 18:25:41 2005 UTC (12 years, 11 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 5451 byte(s)
Diff to previous 22
Enabled import of files in Reuters-21578 XML format

Revision 22 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 19 16:58:34 2005 UTC (12 years, 11 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 3655 byte(s)
Diff to previous 21
See ChangeLog

Revision 21 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 19 10:23:19 2005 UTC (12 years, 11 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 2120 byte(s)
Diff to previous 20
See ChangeLog

Revision 20 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 8 16:40:52 2005 UTC (12 years, 11 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 1714 byte(s)
Diff to previous 19
See ChangeLog

Revision 19 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 6 15:38:48 2005 UTC (12 years, 11 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 1526 byte(s)
Diff to previous 18


Revision 18 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 5 19:00:05 2005 UTC (12 years, 11 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 1500 byte(s)
Diff to previous 17


Revision 17 - (view) (download) (annotate) - [select for diffs]
Added Sat Nov 5 14:47:12 2005 UTC (12 years, 11 months ago) by feinerer
Original Path: trunk/R/trunk/R/textdoccol.R
File length: 1442 byte(s)
For infos see ChangeLog.

This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

Sort log by:

R-Forge@R-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business University of Wisconsin - Madison Powered By FusionForge