SCM

SCM Repository

[tm] Log of /pkg/NAMESPACE
[tm] / pkg / NAMESPACE  
ViewVC logotype

Log of /pkg/NAMESPACE

Parent Directory Parent Directory


Links to HEAD: (view) (download) (annotate)
Sticky Revision:

Revision 1481 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 20 10:28:00 2017 UTC (16 months ago) by feinerer
File length: 8070 byte(s)
Diff to previous 1467
Support TIF for DataframeSource

See Text Interchange Formats (TIF, https://github.com/ropensci/tif) and
readtext (https://github.com/kbenoit/readtext).

Revision 1467 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 22 18:06:19 2017 UTC (19 months, 4 weeks ago) by khornik
File length: 8011 byte(s)
Diff to previous 1462
Register native routines.

Revision 1462 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 15 14:42:48 2017 UTC (20 months, 1 week ago) by khornik
File length: 7999 byte(s)
Diff to previous 1461
Provide and use new flexible parallelization framework.

Revision 1461 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 14 14:51:22 2017 UTC (20 months, 1 week ago) by feinerer
File length: 7945 byte(s)
Diff to previous 1448
Implement [ and [[ for selected sources

Both [ and [[ are not considered part of the API but are provided as
convenience. Moreover, it is considered good practice as sources typically
report a length.

Revision 1448 - (view) (download) (annotate) - [select for diffs]
Modified Mon Dec 12 10:52:47 2016 UTC (21 months, 1 week ago) by khornik
File length: 7707 byte(s)
Diff to previous 1447
Improve namespace.

Revision 1447 - (view) (download) (annotate) - [select for diffs]
Modified Mon Dec 12 08:56:14 2016 UTC (21 months, 1 week ago) by khornik
File length: 7435 byte(s)
Diff to previous 1446
Add findMostFreqTerms().

Revision 1446 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 2 15:41:49 2016 UTC (22 months, 2 weeks ago) by feinerer
File length: 7255 byte(s)
Diff to previous 1445
Revive parallel::mclapply()

Experiments show that, with the right hardware, mclapply() gives you measurable
performance gains. So reenable it --- despite substantial drawbacks (RAM and
CPU overhead) in some scenarios.

Revision 1445 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 9 09:30:58 2016 UTC (23 months, 1 week ago) by feinerer
File length: 7220 byte(s)
Diff to previous 1438
Speed up termFreq(), general cleanup

- Avoid parallel::mclapply()
- Use custom .table()
- Use rep.int(), rep_len() and lengths()
- Fix typos
- Shorten overlong lines
- Consistent formatting

Revision 1438 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 16 18:32:59 2016 UTC (2 years, 2 months ago) by feinerer
File length: 7255 byte(s)
Diff to previous 1437
Use Rcpp for efficient term-document matrix construction from a SimpleCorpus

Revision 1437 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 13 19:23:49 2016 UTC (2 years, 2 months ago) by feinerer
File length: 7225 byte(s)
Diff to previous 1435
Add SimpleCorpus

SimpleCorpus provides a corpus which is optimized for the most common usage
scenario: importing plain texts from files in a directory or directly from a
vector in R, preprocessing and transforming the texts, and finally exporting
them to a term-document matrix. The aim is to boost performance and minimize
memory pressure. It loads all documents into memory, and is designed for
medium-sized to large data sets.

Revision 1435 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 18 09:53:21 2015 UTC (2 years, 10 months ago) by feinerer
File length: 6619 byte(s)
Diff to previous 1431
Provide inspect.TextDocument() as shorthand for writeLines(as.character())

Revision 1431 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 1 10:32:42 2015 UTC (3 years, 2 months ago) by khornik
File length: 6583 byte(s)
Diff to previous 1423
Improve namespace.

Revision 1423 - (view) (download) (annotate) - [select for diffs]
Modified Mon May 4 19:37:55 2015 UTC (3 years, 4 months ago) by feinerer
File length: 6495 byte(s)
Diff to previous 1420
inspect() is not part of the TextDocument API

Use as.character() and content() to access the document instead.

Revision 1420 - (view) (download) (annotate) - [select for diffs]
Modified Mon May 4 19:04:00 2015 UTC (3 years, 4 months ago) by feinerer
File length: 6536 byte(s)
Diff to previous 1419
Accept NLP::Span_Tokenizer

Revision 1419 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 2 17:23:47 2015 UTC (3 years, 4 months ago) by feinerer
File length: 6555 byte(s)
Diff to previous 1417
Sync format()/print() with NLP

Revision 1417 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 28 18:02:42 2015 UTC (3 years, 4 months ago) by feinerer
File length: 6300 byte(s)
Diff to previous 1415
Mark scan_tokenizer() and MC_tokenizer() as NLP::Token_Tokenizer

Revision 1415 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 4 08:54:44 2015 UTC (3 years, 5 months ago) by feinerer
File length: 6281 byte(s)
Diff to previous 1409
Replace meta.TextDocument() with implementations for subclasses

Revision 1409 - (view) (download) (annotate) - [select for diffs]
Modified Fri Feb 27 16:10:18 2015 UTC (3 years, 6 months ago) by feinerer
File length: 6197 byte(s)
Diff to previous 1408
Add as.VCorpus.list()

Revision 1408 - (view) (download) (annotate) - [select for diffs]
Modified Mon Feb 23 20:55:55 2015 UTC (3 years, 6 months ago) by feinerer
File length: 6165 byte(s)
Diff to previous 1406
Fix typos, extend NAMESPACE

Revision 1406 - (view) (download) (annotate) - [select for diffs]
Modified Mon Feb 23 17:21:49 2015 UTC (3 years, 6 months ago) by feinerer
File length: 6016 byte(s)
Diff to previous 1397
Add readTagged(): a reader for text documents containing POS-tagged words

Revision 1397 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 12 19:30:27 2014 UTC (4 years ago) by feinerer
File length: 5995 byte(s)
Diff to previous 1379
Add open() and close() for sources

Useful for sources with complex or expensive setup, e.g., database connections
or file handles.

Revision 1379 - (view) (download) (annotate) - [select for diffs]
Modified Tue May 27 17:55:29 2014 UTC (4 years, 3 months ago) by feinerer
File length: 5928 byte(s)
Diff to previous 1377
Provide names<-() for VCorpus and PCorpus

Revision 1377 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 21 17:15:56 2014 UTC (4 years, 4 months ago) by feinerer
File length: 5866 byte(s)
Diff to previous 1376
Provide names() for corpora

Revision 1376 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 21 14:36:35 2014 UTC (4 years, 4 months ago) by feinerer
File length: 5808 byte(s)
Diff to previous 1373
Remove names() from Source API

Revision 1373 - (view) (download) (annotate) - [select for diffs]
Modified Thu May 15 15:33:37 2014 UTC (4 years, 4 months ago) by feinerer
File length: 5842 byte(s)
Diff to previous 1363
Keep FunctionGenerator() (suggested by Kurt)

Revision 1363 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 28 09:49:46 2014 UTC (4 years, 4 months ago) by feinerer
File length: 5814 byte(s)
Diff to previous 1358
Keep Corpus() alias

Revision 1358 - (view) (download) (annotate) - [select for diffs]
Modified Thu Apr 24 07:43:38 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5797 byte(s)
Diff to previous 1350
Document content_transformer()

Revision 1350 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 22 07:41:14 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5784 byte(s)
Diff to previous 1349
Reorder, fix typo

Revision 1349 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 22 07:13:40 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5784 byte(s)
Diff to previous 1348
Export as.VCorpus.VCorpus()

Revision 1348 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 22 07:09:41 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5750 byte(s)
Diff to previous 1336
Provide as.VCorpus() generic

Revision 1336 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 19 08:59:39 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5731 byte(s)
Diff to previous 1333
Implement and describe Source API

Revision 1333 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 18 10:38:46 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5599 byte(s)
Diff to previous 1332
Update Corpus documentation

Revision 1332 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 18 09:00:55 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5635 byte(s)
Diff to previous 1322
Update TextDocument documentation

Revision 1322 - (view) (download) (annotate) - [select for diffs]
Modified Thu Apr 10 12:39:01 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5520 byte(s)
Diff to previous 1320
Fix problem if no completions are found, remove PlainTextDocument method

Revision 1320 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 6 07:05:45 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5608 byte(s)
Diff to previous 1319
Use words() as default tokenizer in termFreq()

Revision 1319 - (view) (download) (annotate) - [select for diffs]
Modified Wed Apr 2 18:03:37 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5562 byte(s)
Diff to previous 1317
Provide words.PlainTextDocument(), clean NAMESPACE

Revision 1317 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 31 14:51:31 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5744 byte(s)
Diff to previous 1316
Do not export FunctionGenerator

Revision 1316 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 31 14:41:41 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5772 byte(s)
Diff to previous 1315
Remove dissimilarity() (a trivial wrapper around proxy:dist())

Revision 1315 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 31 08:38:05 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5796 byte(s)
Diff to previous 1313
Simplify tm_map, tm_filter, and tm_index; remove makeChunks; rework lazy maps

Revision 1313 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 30 09:28:00 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5750 byte(s)
Diff to previous 1312
content() and as.list() now give the full documents

Revision 1312 - (view) (download) (annotate) - [select for diffs]
Modified Sat Mar 29 09:35:44 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5606 byte(s)
Diff to previous 1310
Simplify corpus metadata and PCorpus metadata storage

Revision 1310 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 26 19:23:13 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5666 byte(s)
Diff to previous 1309
Remove text repository, various improvements and bug fixes

Revision 1309 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 26 09:15:04 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5782 byte(s)
Diff to previous 1307
Move content and meta generics to package NLP

Revision 1307 - (view) (download) (annotate) - [select for diffs]
Modified Tue Mar 25 12:15:51 2014 UTC (4 years, 5 months ago) by feinerer
File length: 5852 byte(s)
Diff to previous 1302
Redesign corpora

Revision 1302 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 24 11:55:16 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6015 byte(s)
Diff to previous 1300
Improve and simplify meta data management

Revision 1300 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 21 14:30:05 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6082 byte(s)
Diff to previous 1299
Redesign text documents

This is a major change and causes fallout. Soon to be fixed ...

Revision 1299 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 21 09:45:14 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6371 byte(s)
Diff to previous 1297
Use setNames() instead of structure(..., names)

Revision 1297 - (view) (download) (annotate) - [select for diffs]
Modified Thu Mar 20 18:43:22 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6339 byte(s)
Diff to previous 1295
Redesign sources

Revision 1295 - (view) (download) (annotate) - [select for diffs]
Modified Tue Feb 25 10:54:41 2014 UTC (4 years, 6 months ago) by feinerer
File length: 6341 byte(s)
Diff to previous 1277
Export pGetElem.URISource

Revision 1277 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 5 16:32:40 2014 UTC (4 years, 8 months ago) by feinerer
File length: 6307 byte(s)
Diff to previous 1274
Remove Dictionary class and functions

At the moment a Dictionary has no added value compared to a simple character
vector. We might want to reconsider dictionaries in the context of NLP but with
more functionality later on.

Revision 1274 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jan 5 10:51:18 2014 UTC (4 years, 8 months ago) by feinerer
File length: 6454 byte(s)
Diff to previous 1268
More sanity checks

Revision 1268 - (view) (download) (annotate) - [select for diffs]
Modified Wed Dec 18 16:37:48 2013 UTC (4 years, 9 months ago) by feinerer
File length: 6434 byte(s)
Diff to previous 1261
Show label for single result item, do not export findAssocs.matrix()

Revision 1261 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 27 09:37:35 2013 UTC (4 years, 11 months ago) by feinerer
File length: 6468 byte(s)
Diff to previous 1260
Allow multiple URIs for URISource, default to vectorized sources, simplify eoi()

Revision 1260 - (view) (download) (annotate) - [select for diffs]
Modified Sat Sep 21 09:10:20 2013 UTC (5 years ago) by feinerer
File length: 6596 byte(s)
Diff to previous 1258
Move preprocessReut21578XML() to package tm.corpus.Reuters21578

Revision 1258 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 20 12:15:42 2013 UTC (5 years ago) by feinerer
File length: 6629 byte(s)
Diff to previous 1257
Remove GmaneSource() and readGmane(), simplify readers, improve documentation

Revision 1257 - (view) (download) (annotate) - [select for diffs]
Modified Thu Sep 19 10:48:07 2013 UTC (5 years ago) by feinerer
File length: 6671 byte(s)
Diff to previous 1255
Export Source constructor, extend documentation

Revision 1255 - (view) (download) (annotate) - [select for diffs]
Modified Wed Sep 11 07:30:06 2013 UTC (5 years ago) by feinerer
File length: 6654 byte(s)
Diff to previous 1253
Rename tm_tag_score() to tm_term_score()

Revision 1253 - (view) (download) (annotate) - [select for diffs]
Modified Fri Aug 30 10:03:09 2013 UTC (5 years ago) by feinerer
File length: 6649 byte(s)
Diff to previous 1249
Remove getFilters(), searchFullText(), and tm_intersect() (use grep() instead)

Revision 1249 - (view) (download) (annotate) - [select for diffs]
Modified Tue Aug 20 18:42:11 2013 UTC (5 years, 1 month ago) by feinerer
File length: 6812 byte(s)
Diff to previous 1229
Export read_dtm_MC()

Revision 1229 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jun 19 09:05:59 2013 UTC (5 years, 3 months ago) by feinerer
File length: 6790 byte(s)
Diff to previous 1227
Import parallel

Revision 1227 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jun 16 08:37:10 2013 UTC (5 years, 3 months ago) by feinerer
File length: 6773 byte(s)
Diff to previous 1210
Use package parallel instead of Rmpi and snow

Revision 1210 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jan 22 18:40:48 2013 UTC (5 years, 8 months ago) by khornik
File length: 6824 byte(s)
Diff to previous 1207
Make nDocs()/nTerms() and Docs()/Terms() generic with methods for TDMs
and DTMs, and ensure that Docs()/Terms() returns a character vector with
length the number of documents and terms, respectively.

Revision 1207 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 12 15:27:20 2013 UTC (5 years, 8 months ago) by khornik
File length: 6503 byte(s)
Diff to previous 1206
Add as.DocumentTermMatrix() method for textcnt objects.

Revision 1206 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 11 20:15:37 2013 UTC (5 years, 8 months ago) by khornik
File length: 6406 byte(s)
Diff to previous 1202
Add TermDocumentMatix() method for textcnt objects.

Revision 1202 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 11 19:43:35 2013 UTC (5 years, 8 months ago) by khornik
File length: 6361 byte(s)
Diff to previous 1163
Add tm_tag_score() method DocumentTermMatrix objects.

Revision 1163 - (view) (download) (annotate) - [select for diffs]
Modified Wed Dec 7 08:27:47 2011 UTC (6 years, 9 months ago) by feinerer
File length: 6314 byte(s)
Diff to previous 1162
Export stripWhitespace.character() function

Revision 1162 - (view) (download) (annotate) - [select for diffs]
Modified Wed Dec 7 06:36:02 2011 UTC (6 years, 9 months ago) by feinerer
File length: 6273 byte(s)
Diff to previous 1158
Fix argument pass over

Revision 1158 - (view) (download) (annotate) - [select for diffs]
Modified Thu Nov 24 06:30:26 2011 UTC (6 years, 10 months ago) by feinerer
File length: 6154 byte(s)
Diff to previous 1150
Export 'Content<-.default' in NAMESPACE

Revision 1150 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 15 15:37:17 2011 UTC (6 years, 10 months ago) by feinerer
File length: 6120 byte(s)
Diff to previous 1149
Document MC_tokenizer(), scan_tokenizer(), and getTokenizers()

Revision 1149 - (view) (download) (annotate) - [select for diffs]
Modified Fri Nov 4 15:48:50 2011 UTC (6 years, 10 months ago) by feinerer
File length: 6048 byte(s)
Diff to previous 1145
Export and document c.term_frequency() and as.TermDocumentMatrix.term_frequency()

Revision 1145 - (view) (download) (annotate) - [select for diffs]
Modified Tue Aug 30 18:05:19 2011 UTC (7 years ago) by feinerer
File length: 5963 byte(s)
Diff to previous 1136
Add class label for term frequencies and corresponding c() and as.TermDocumentMatrix() implementation

Revision 1136 - (view) (download) (annotate) - [select for diffs]
Modified Fri May 27 11:50:39 2011 UTC (7 years, 3 months ago) by feinerer
File length: 5956 byte(s)
Diff to previous 1135
Improve SMART weighting (still buggy)

Revision 1135 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 15 06:18:54 2011 UTC (7 years, 5 months ago) by khornik
File length: 5957 byte(s)
Diff to previous 1128
Export and document Blei et al reader.

Revision 1128 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 8 17:36:10 2011 UTC (7 years, 5 months ago) by khornik
File length: 5836 byte(s)
Diff to previous 1120
Add functionality for obtaining DTMs and TDMs from t/f matrices coercible
to simple triplet matrices.

Revision 1120 - (view) (download) (annotate) - [select for diffs]
Modified Mon Feb 7 16:48:53 2011 UTC (7 years, 7 months ago) by feinerer
File length: 5420 byte(s)
Diff to previous 1119
Comment out words() for the moment.

Revision 1119 - (view) (download) (annotate) - [select for diffs]
Modified Mon Feb 7 10:08:36 2011 UTC (7 years, 7 months ago) by feinerer
File length: 5418 byte(s)
Diff to previous 1113
Comment out words() for the moment.

Revision 1113 - (view) (download) (annotate) - [select for diffs]
Modified Thu Nov 11 15:22:22 2010 UTC (7 years, 10 months ago) by feinerer
File length: 5417 byte(s)
Diff to previous 1087
First draft of words()

Revision 1087 - (view) (download) (annotate) - [select for diffs]
Modified Sun Aug 15 09:19:35 2010 UTC (8 years, 1 month ago) by khornik
File length: 5334 byte(s)
Diff to previous 1076
Complete c1084.

Revision 1076 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jun 2 17:54:51 2010 UTC (8 years, 3 months ago) by feinerer
File length: 5358 byte(s)
Diff to previous 1069
Export Zipf_plot() and Heaps_plot()

Revision 1069 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 5 10:38:26 2010 UTC (8 years, 4 months ago) by feinerer
File length: 5317 byte(s)
Diff to previous 1068
Added documentation for content_meta()

Revision 1068 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 5 10:09:47 2010 UTC (8 years, 4 months ago) by feinerer
File length: 5319 byte(s)
Diff to previous 1062
Improve stem completion.

Revision 1062 - (view) (download) (annotate) - [select for diffs]
Modified Wed Apr 7 17:25:20 2010 UTC (8 years, 5 months ago) by feinerer
File length: 5231 byte(s)
Diff to previous 1055
content_or_meta utility function

Revision 1055 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 15 15:55:02 2010 UTC (8 years, 6 months ago) by feinerer
File length: 5203 byte(s)
Diff to previous 1050
First attempt for weightings using SMART notation.

Revision 1050 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 3 11:26:06 2010 UTC (8 years, 6 months ago) by feinerer
File length: 5181 byte(s)
Diff to previous 1046
Add tm_tag_score method for term-document matrices.

Revision 1046 - (view) (download) (annotate) - [select for diffs]
Modified Fri Feb 26 12:45:38 2010 UTC (8 years, 6 months ago) by feinerer
File length: 5134 byte(s)
Diff to previous 1039
Update documentation.

Revision 1039 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jan 22 13:01:33 2010 UTC (8 years, 8 months ago) by feinerer
File length: 5028 byte(s)
Diff to previous 1035
Add stemDocument.character().

Revision 1035 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 14 08:59:43 2010 UTC (8 years, 8 months ago) by feinerer
File length: 4990 byte(s)
Diff to previous 1022
Add readRCV1asPlain reader.

Revision 1022 - (view) (download) (annotate) - [select for diffs]
Modified Thu Nov 19 21:33:19 2009 UTC (8 years, 10 months ago) by feinerer
File length: 4964 byte(s)
Diff to previous 1018
Added a combine method for merging multiple term-document matrices.

Revision 1018 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 15 15:53:49 2009 UTC (8 years, 10 months ago) by feinerer
File length: 4928 byte(s)
Diff to previous 988
Fix bug in removeWords(). Refactoring of term-document matrix constructor. Clean up of defunct functions.

Revision 988 - (view) (download) (annotate) - [select for diffs]
Modified Fri Sep 4 12:27:12 2009 UTC (9 years ago) by feinerer
File length: 4992 byte(s)
Diff to previous 987
Update documentation.

Revision 987 - (view) (download) (annotate) - [select for diffs]
Modified Wed Sep 2 17:54:45 2009 UTC (9 years ago) by feinerer
File length: 5026 byte(s)
Diff to previous 986
Update documentation.

Revision 986 - (view) (download) (annotate) - [select for diffs]
Modified Tue Sep 1 15:33:30 2009 UTC (9 years ago) by feinerer
File length: 5100 byte(s)
Diff to previous 985
Further changes due to S3 class system.

Revision 985 - (view) (download) (annotate) - [select for diffs]
Modified Thu Aug 27 18:09:05 2009 UTC (9 years ago) by feinerer
File length: 5165 byte(s)
Diff to previous 984
Use S3 instead of S4 class system.

Revision 984 - (view) (download) (annotate) - [select for diffs]
Modified Fri Aug 14 16:32:35 2009 UTC (9 years, 1 month ago) by feinerer
File length: 4666 byte(s)
Diff to previous 982
Remove obsolete appendElem() method (use c() instead).

Revision 982 - (view) (download) (annotate) - [select for diffs]
Modified Tue Aug 11 07:48:04 2009 UTC (9 years, 1 month ago) by feinerer
File length: 4694 byte(s)
Diff to previous 981
Moved readMail and MailDocument class from tm to tm.plugin.mail.

Revision 981 - (view) (download) (annotate) - [select for diffs]
Modified Fri Aug 7 09:04:37 2009 UTC (9 years, 1 month ago) by feinerer
File length: 4743 byte(s)
Diff to previous 976
Factor out mail handling functionality to tm.plugin.mail package.

Revision 976 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jul 9 09:23:39 2009 UTC (9 years, 2 months ago) by feinerer
File length: 4776 byte(s)
Diff to previous 973
Conversion to UTF-8 encoding.

Revision 973 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 4 08:10:25 2009 UTC (9 years, 2 months ago) by feinerer
File length: 4752 byte(s)
Diff to previous 972
Rename readNewsgroup to readMail.

Revision 972 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jul 3 16:16:59 2009 UTC (9 years, 2 months ago) by feinerer
File length: 4762 byte(s)
Diff to previous 963
Move removeCitation, removeMultipart, and removeSignature to the tau package.

Revision 963 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jun 29 07:01:19 2009 UTC (9 years, 2 months ago) by feinerer
File length: 4860 byte(s)
Diff to previous 962
Rename SCorpus to VCorpus (Volatile Corpus).

Revision 962 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jun 28 15:52:33 2009 UTC (9 years, 2 months ago) by feinerer
File length: 4860 byte(s)
Diff to previous 961
Fix documentation.

Revision 961 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jun 27 08:33:14 2009 UTC (9 years, 2 months ago) by feinerer
File length: 4841 byte(s)
Diff to previous 960
Export readReut21578XMLasPlain.

Revision 960 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jun 26 17:43:45 2009 UTC (9 years, 2 months ago) by feinerer
File length: 4807 byte(s)
Diff to previous 952
Add slam dependency and readReut21578XMLasPlain reader.

Revision 952 - (view) (download) (annotate) - [select for diffs]
Modified Mon May 18 13:43:01 2009 UTC (9 years, 4 months ago) by feinerer
File length: 5309 byte(s)
Diff to previous 950
Further work on FCorpus integration.

Revision 950 - (view) (download) (annotate) - [select for diffs]
Modified Thu May 14 15:17:18 2009 UTC (9 years, 4 months ago) by feinerer
File length: 5278 byte(s)
Diff to previous 946
Experimental FCorpus (fast corpus).

Revision 946 - (view) (download) (annotate) - [select for diffs]
Modified Wed May 13 18:07:35 2009 UTC (9 years, 4 months ago) by feinerer
File length: 5179 byte(s)
Diff to previous 945
A lot of major improvements (see NEWS).

Revision 945 - (view) (download) (annotate) - [select for diffs]
Modified Mon May 4 10:57:01 2009 UTC (9 years, 4 months ago) by feinerer
File length: 5195 byte(s)
Diff to previous 941
Export some simple_triplet_matrix functions.

Revision 941 - (view) (download) (annotate) - [select for diffs]
Modified Mon Apr 27 15:36:43 2009 UTC (9 years, 4 months ago) by feinerer
File length: 5183 byte(s)
Diff to previous 939
Create two distinct classes for term-document and document-term matrices.

Revision 939 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 26 07:04:11 2009 UTC (9 years, 4 months ago) by feinerer
File length: 4614 byte(s)
Diff to previous 938
Rename readCustom to readTabular.

Revision 938 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 25 19:05:50 2009 UTC (9 years, 5 months ago) by feinerer
File length: 4613 byte(s)
Diff to previous 926
Get rid of Matrix package dependency.

Revision 926 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 4 06:50:02 2009 UTC (9 years, 5 months ago) by feinerer
File length: 4186 byte(s)
Diff to previous 925
tmReduce() allows to combine multiple maps into one transformation.

Revision 925 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 3 17:39:44 2009 UTC (9 years, 5 months ago) by feinerer
File length: 4157 byte(s)
Diff to previous 924
Removed TermDocMatrix. Use DocumentTermMatrix or TermDocumentMatrix instead.

Revision 924 - (view) (download) (annotate) - [select for diffs]
Modified Fri Apr 3 15:41:48 2009 UTC (9 years, 5 months ago) by feinerer
File length: 4159 byte(s)
Diff to previous 912
Improve weighting functions.

Revision 912 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 23 22:50:36 2009 UTC (9 years, 6 months ago) by feinerer
File length: 4183 byte(s)
Diff to previous 911
New reader for arbitrary XML formats.

Revision 911 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 22 17:55:16 2009 UTC (9 years, 6 months ago) by feinerer
File length: 4165 byte(s)
Diff to previous 910
New XMLSource class for arbitrary XML files.

Revision 910 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 22 16:33:30 2009 UTC (9 years, 6 months ago) by feinerer
File length: 4199 byte(s)
Diff to previous 909
CSVSource is defunct.

Revision 909 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 22 12:45:59 2009 UTC (9 years, 6 months ago) by feinerer
File length: 4216 byte(s)
Diff to previous 908
Sources now can be vectorized.

Revision 908 - (view) (download) (annotate) - [select for diffs]
Modified Sat Mar 21 18:22:48 2009 UTC (9 years, 6 months ago) by feinerer
File length: 4190 byte(s)
Diff to previous 886
Added reader which can be customized via user-defined mappings.

Revision 886 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 29 22:47:34 2009 UTC (9 years, 7 months ago) by feinerer
File length: 4169 byte(s)
Diff to previous 885
Speed up package loading (Depends -> Suggests).

Revision 885 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 29 09:34:44 2009 UTC (9 years, 7 months ago) by stefan7th
File length: 4229 byte(s)
Copied from: pkg/tm/NAMESPACE revision 884
Diff to previous 884
moved package to /pkg

Revision 884 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 28 10:24:27 2009 UTC (9 years, 7 months ago) by stefan7th
Original Path: pkg/tm/NAMESPACE
File length: 4229 byte(s)
Diff to previous 876
R-Forge transition completed

Revision 876 - (view) (download) (annotate) - [select for diffs]
Modified Sat Dec 6 15:58:01 2008 UTC (9 years, 9 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 4229 byte(s)
Diff to previous 874
New DataframeSource.

Revision 874 - (view) (download) (annotate) - [select for diffs]
Modified Sat Nov 29 16:24:45 2008 UTC (9 years, 9 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 4170 byte(s)
Diff to previous 864
New URISource.

Revision 864 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jul 25 14:21:06 2008 UTC (10 years, 2 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 4116 byte(s)
Diff to previous 857
More fine tuning when using a MPI cluster in tm.

Revision 857 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jul 8 16:01:47 2008 UTC (10 years, 2 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 4062 byte(s)
Diff to previous 848
Removed tm-internal. Better (consistent) naming for dictionary functions.

Revision 848 - (view) (download) (annotate) - [select for diffs]
Modified Tue Apr 29 16:51:43 2008 UTC (10 years, 4 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 4124 byte(s)
Diff to previous 846
Improved vignette.

Revision 846 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 26 08:11:14 2008 UTC (10 years, 4 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 4103 byte(s)
Diff to previous 845
Added as.matrix() method for TermDocMatrix.

Revision 845 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 26 07:46:42 2008 UTC (10 years, 4 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 4076 byte(s)
Diff to previous 842
Added function for creating chunks from corpora.

Revision 842 - (view) (download) (annotate) - [select for diffs]
Modified Thu Apr 24 15:25:40 2008 UTC (10 years, 5 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 4055 byte(s)
Diff to previous 838
Added Simple Dublin Core meta data wrappers.

Revision 838 - (view) (download) (annotate) - [select for diffs]
Modified Wed Apr 23 09:45:06 2008 UTC (10 years, 5 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3997 byte(s)
Diff to previous 836
Changed replaceWords to replacePatterns. Suggested by Christian Buchta.

Revision 836 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 19 17:08:07 2008 UTC (10 years, 5 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3994 byte(s)
Diff to previous 832
Improved meta data handling. Added coerce method from list to corpus. Updated CITATION file.

Revision 832 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 12 12:59:48 2008 UTC (10 years, 6 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3958 byte(s)
Diff to previous 829
Added VectorSource.

Revision 829 - (view) (download) (annotate) - [select for diffs]
Modified Mon Mar 10 22:55:39 2008 UTC (10 years, 6 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3898 byte(s)
Diff to previous 823
First version of working lazy mapping.

Revision 823 - (view) (download) (annotate) - [select for diffs]
Modified Wed Feb 6 13:47:59 2008 UTC (10 years, 7 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3859 byte(s)
Diff to previous 822
Added removeNumbers transformation.

Revision 822 - (view) (download) (annotate) - [select for diffs]
Modified Wed Feb 6 13:06:15 2008 UTC (10 years, 7 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3828 byte(s)
Diff to previous 819
Renamed completeStems to stemCompletion (suggested by David Meyer).

Revision 819 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 31 09:09:18 2008 UTC (10 years, 7 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3827 byte(s)
Diff to previous 816
Added writeCorpus function.

Revision 816 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jan 24 14:36:41 2008 UTC (10 years, 8 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3798 byte(s)
Diff to previous 815
Renamed TextDocCol to Corpus, and Corpus to Content.

Revision 815 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jan 22 22:31:36 2008 UTC (10 years, 8 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3808 byte(s)
Diff to previous 814
Added documentation for meta().

Revision 814 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jan 22 18:47:57 2008 UTC (10 years, 8 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3784 byte(s)
Diff to previous 806
Fix namespace.

Revision 806 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jan 2 10:29:14 2008 UTC (10 years, 8 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3788 byte(s)
Diff to previous 805
Modular TermDocMatrix constructor is now default.

Revision 805 - (view) (download) (annotate) - [select for diffs]
Modified Tue Jan 1 14:10:40 2008 UTC (10 years, 8 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3820 byte(s)
Diff to previous 802
Added function (getReaders) returning all available reader functions.

Revision 802 - (view) (download) (annotate) - [select for diffs]
Modified Sun Dec 2 09:28:41 2007 UTC (10 years, 9 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3799 byte(s)
Diff to previous 799
See ChangeLog.

Revision 799 - (view) (download) (annotate) - [select for diffs]
Modified Thu Nov 29 11:05:23 2007 UTC (10 years, 9 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3772 byte(s)
Diff to previous 795
Better handling of empty arguments in TextDocCol. Exported readDOC.

Revision 795 - (view) (download) (annotate) - [select for diffs]
Modified Sat Oct 27 09:14:35 2007 UTC (10 years, 10 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3754 byte(s)
Diff to previous 792
Updated documentation

Revision 792 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 21 11:52:52 2007 UTC (10 years, 11 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3730 byte(s)
Diff to previous 790
Updated NAMESPACE.

Revision 790 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 21 08:27:13 2007 UTC (10 years, 11 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3651 byte(s)
Diff to previous 788
Exported termFreq to NAMESPACE. New modular constructor for TermDocMatrix (called TermDocMatrix2 at the moment).

Revision 788 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 14 12:16:26 2007 UTC (10 years, 11 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3600 byte(s)
Diff to previous 786
Weighting functions for TermDocMatrix.

Revision 786 - (view) (download) (annotate) - [select for diffs]
Modified Sat Oct 13 16:27:24 2007 UTC (10 years, 11 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3451 byte(s)
Diff to previous 780
Documentation for plot.TermDocMatrix.

Revision 780 - (view) (download) (annotate) - [select for diffs]
Modified Sat Sep 29 13:24:17 2007 UTC (10 years, 11 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3338 byte(s)
Diff to previous 779
Added three transformations often used for e-mail analyses.

Revision 779 - (view) (download) (annotate) - [select for diffs]
Modified Tue Sep 11 05:52:39 2007 UTC (11 years ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3240 byte(s)
Diff to previous 777
Added documentation for removePunctuation.

Revision 777 - (view) (download) (annotate) - [select for diffs]
Modified Tue Aug 28 07:19:12 2007 UTC (11 years ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3205 byte(s)
Diff to previous 774
Function generators are now real S4 classes instead of S3 attributes.

Revision 774 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 21 16:25:54 2007 UTC (11 years, 2 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3135 byte(s)
Diff to previous 767
Added convenience methods for term-document matrices.

Revision 767 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 14 16:50:44 2007 UTC (11 years, 2 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3091 byte(s)
Diff to previous 766
Added simple HTML reader to produce StructuredTextDocuments.

Revision 766 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jul 14 08:46:23 2007 UTC (11 years, 2 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3072 byte(s)
Diff to previous 765
Added PDF reader based on pdftotext and pdfinfo.

Revision 765 - (view) (download) (annotate) - [select for diffs]
Modified Fri Jul 13 15:53:45 2007 UTC (11 years, 2 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3054 byte(s)
Diff to previous 763
See ChangeLog.

Revision 763 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 11 11:56:44 2007 UTC (11 years, 2 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3033 byte(s)
Diff to previous 758
Changed from cba to new proxy package for computing (dis)similarities.

Revision 758 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jun 13 02:25:36 2007 UTC (11 years, 3 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 3031 byte(s)
Diff to previous 757
Added dictionary support.

Revision 757 - (view) (download) (annotate) - [select for diffs]
Modified Thu Jun 7 17:41:56 2007 UTC (11 years, 3 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2913 byte(s)
Diff to previous 755
Added classes for Reuters21578 XML and RCV1 documents.

Revision 755 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jun 3 17:20:40 2007 UTC (11 years, 3 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2845 byte(s)
Diff to previous 752
Added replaceWords function.

Revision 752 - (view) (download) (annotate) - [select for diffs]
Modified Sat May 19 22:39:04 2007 UTC (11 years, 4 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2815 byte(s)
Diff to previous 742
Small bug fix in textvector(). Added new function removeSparseTerms().

Revision 742 - (view) (download) (annotate) - [select for diffs]
Modified Sat Apr 21 18:36:11 2007 UTC (11 years, 5 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2780 byte(s)
Diff to previous 734
Updated NAMESPACE.

Revision 734 - (view) (download) (annotate) - [select for diffs]
Modified Wed Apr 11 19:12:12 2007 UTC (11 years, 5 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2786 byte(s)
Diff to previous 722
Fixed some warnings reported by R CMD check.

Revision 722 - (view) (download) (annotate) - [select for diffs]
Modified Sun Apr 1 15:53:58 2007 UTC (11 years, 5 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2780 byte(s)
Diff to previous 721
Prettyprint summary, print method for plain text docs, removePunctuation.

Revision 721 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 21 13:54:43 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2740 byte(s)
Diff to previous 718
Simplified sFilter.

Revision 718 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 16 12:55:16 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2711 byte(s)
Diff to previous 717
We now use sparse matrices.

Revision 717 - (view) (download) (annotate) - [select for diffs]
Modified Fri Mar 16 11:13:04 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2665 byte(s)
Diff to previous 716
Added Language slot to text documents. Refactored TextDocCol constructor.

Revision 716 - (view) (download) (annotate) - [select for diffs]
Modified Thu Mar 15 17:22:39 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2611 byte(s)
Diff to previous 713
Some improvements for TermDocMatrix.

Revision 713 - (view) (download) (annotate) - [select for diffs]
Modified Wed Mar 14 13:44:11 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2596 byte(s)
Diff to previous 712
Added Snowball support. Added function returning stopwords (English, German, French).

Revision 712 - (view) (download) (annotate) - [select for diffs]
Modified Sun Mar 4 15:18:36 2007 UTC (11 years, 6 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2550 byte(s)
Diff to previous 698
Started to implement database support to optimize RAM usage, i.e., minimize RAM demand if necessary.

Revision 698 - (view) (download) (annotate) - [select for diffs]
Modified Sat Jan 6 17:05:44 2007 UTC (11 years, 8 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2523 byte(s)
Diff to previous 694
Changes due to Kurt's review.

Revision 694 - (view) (download) (annotate) - [select for diffs]
Modified Sun Dec 31 14:47:46 2006 UTC (11 years, 8 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2557 byte(s)
Diff to previous 693
Implemented improvements based upon comments by David.

Revision 693 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 22 13:21:30 2006 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/tm/NAMESPACE
File length: 2559 byte(s)
Diff to previous 690
Renamed textmin to tm directory since the package name changed.

Revision 690 - (view) (download) (annotate) - [select for diffs]
Modified Sat Dec 16 17:22:56 2006 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/textmin/NAMESPACE
File length: 2559 byte(s)
Diff to previous 689
Renamed package to 'tm'. Updated documentation (man) for CRAN release.

Revision 689 - (view) (download) (annotate) - [select for diffs]
Modified Fri Dec 8 14:21:46 2006 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/textmin/NAMESPACE
File length: 2557 byte(s)
Diff to previous 78
Implemented changes as proposed at the Forschungsklausur on 01.12.2006.

Revision 78 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 29 14:56:36 2006 UTC (11 years, 9 months ago) by zeileis
Original Path: trunk/textmin/NAMESPACE
File length: 2472 byte(s)
Diff to previous 77
removed old repos structure, now only R packages

Revision 77 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 26 13:32:16 2006 UTC (11 years, 9 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 2472 byte(s)
Diff to previous 73
See ChangeLog.

Revision 73 - (view) (download) (annotate) - [select for diffs]
Modified Tue Nov 21 15:52:39 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 2580 byte(s)
Diff to previous 72
Rewrote s_filter function.

Revision 72 - (view) (download) (annotate) - [select for diffs]
Modified Mon Nov 20 20:43:34 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 2549 byte(s)
Diff to previous 71
Added update mechanism for document collections. Various fixes for metadata handling.

Revision 71 - (view) (download) (annotate) - [select for diffs]
Modified Sun Nov 19 17:30:26 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 2522 byte(s)
Diff to previous 69
Added sophisticated merging for document collections.

Revision 69 - (view) (download) (annotate) - [select for diffs]
Modified Fri Nov 3 10:50:39 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 2447 byte(s)
Diff to previous 67
Added functions for modifying and removing metadata.

Revision 67 - (view) (download) (annotate) - [select for diffs]
Modified Wed Nov 1 17:29:59 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 2381 byte(s)
Diff to previous 66
See ChangeLog

Revision 66 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 31 22:03:33 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 2319 byte(s)
Diff to previous 65
See ChangeLog.

Revision 65 - (view) (download) (annotate) - [select for diffs]
Modified Tue Oct 31 17:10:24 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 2354 byte(s)
Diff to previous 64


Revision 64 - (view) (download) (annotate) - [select for diffs]
Modified Sun Oct 29 14:29:43 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 2353 byte(s)
Diff to previous 63
Corrected NAMESPACE. Some minor improvements in the code.

Revision 63 - (view) (download) (annotate) - [select for diffs]
Modified Thu Oct 26 14:59:09 2006 UTC (11 years, 10 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 1704 byte(s)
Diff to previous 61
See ChangeLog.

Revision 61 - (view) (download) (annotate) - [select for diffs]
Modified Mon Oct 23 20:07:05 2006 UTC (11 years, 11 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 1579 byte(s)
Diff to previous 56
See ChangeLog.

Revision 56 - (view) (download) (annotate) - [select for diffs]
Modified Sun Sep 24 14:12:28 2006 UTC (12 years ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 1403 byte(s)
Diff to previous 55
See ChangeLog.

Revision 55 - (view) (download) (annotate) - [select for diffs]
Modified Thu Sep 14 12:31:07 2006 UTC (12 years ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 1130 byte(s)
Diff to previous 54
Minor improvements.

Revision 54 - (view) (download) (annotate) - [select for diffs]
Modified Wed Sep 13 09:08:20 2006 UTC (12 years ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 1112 byte(s)
Diff to previous 53
length, show and summary functions. Renamed transfromXXX and filterXXX to tm_transfrom and tm_filter.

Revision 53 - (view) (download) (annotate) - [select for diffs]
Modified Thu Aug 24 13:06:50 2006 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 972 byte(s)
Diff to previous 52
See ChangeLog for changes.

Revision 52 - (view) (download) (annotate) - [select for diffs]
Modified Sat Aug 12 12:43:39 2006 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 936 byte(s)
Diff to previous 51
Various updates. See ChangeLog and diff source code.

Revision 51 - (view) (download) (annotate) - [select for diffs]
Modified Mon Aug 7 12:14:09 2006 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 721 byte(s)
Diff to previous 50
Various changes due to new layout.

Revision 50 - (view) (download) (annotate) - [select for diffs]
Modified Mon Aug 7 08:55:57 2006 UTC (12 years, 1 month ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 638 byte(s)
Diff to previous 47
Corrected TermDocMatrix and NAMESPACE.

Revision 47 - (view) (download) (annotate) - [select for diffs]
Modified Mon Jul 10 12:22:35 2006 UTC (12 years, 2 months ago) by feinerer
Original Path: trunk/R/textmin/NAMESPACE
File length: 333 byte(s)
Copied from: trunk/R/trunk/NAMESPACE revision 44
Diff to previous 46
Renamed tm to textmin directory.

Revision 46 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 5 18:08:41 2006 UTC (12 years, 2 months ago) by meyer
Original Path: trunk/R/tm/NAMESPACE
File length: 333 byte(s)
Copied from: trunk/R/trunk/NAMESPACE revision 44
Diff to previous 45
move


Revision 45 - (view) (download) (annotate) - [select for diffs]
Modified Wed Jul 5 17:27:29 2006 UTC (12 years, 2 months ago) by meyer
Original Path: trunk/R/trunk/tm/NAMESPACE
File length: 333 byte(s)
Copied from: trunk/R/trunk/NAMESPACE revision 44
Diff to previous 44
move in subdir


Revision 44 - (view) (download) (annotate) - [select for diffs]
Modified Sun Jul 2 11:56:55 2006 UTC (12 years, 2 months ago) by feinerer
Original Path: trunk/R/trunk/NAMESPACE
File length: 333 byte(s)
Diff to previous 32
Various consistency updates.

Revision 32 - (view) (download) (annotate) - [select for diffs]
Added Thu Dec 15 13:13:54 2005 UTC (12 years, 9 months ago) by feinerer
Original Path: trunk/R/trunk/NAMESPACE
File length: 329 byte(s)


This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

Sort log by:

R-Forge@R-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business University of Wisconsin - Madison Powered By FusionForge