SCM

SCM Repository

[tm] Log of /pkg/inst/NEWS.Rd
[tm] / pkg / inst / NEWS.Rd  
ViewVC logotype

Log of /pkg/inst/NEWS.Rd

Parent Directory Parent Directory


Links to HEAD: (view) (download) (as text) (annotate)
Sticky Revision:

Revision 1481 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sat May 20 10:28:00 2017 UTC (3 months ago) by feinerer
File length: 19250 byte(s)
Diff to previous 1478
Support TIF for DataframeSource

See Text Interchange Formats (TIF, https://github.com/ropensci/tif) and
readtext (https://github.com/kbenoit/readtext).

Revision 1478 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sat Mar 25 18:13:48 2017 UTC (4 months, 4 weeks ago) by feinerer
File length: 18746 byte(s)
Diff to previous 1474
Mention bug reporters

Revision 1474 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Tue Mar 21 19:26:21 2017 UTC (5 months ago) by feinerer
File length: 18690 byte(s)
Diff to previous 1472
Fix 'dictionary' argument handling.

Revision 1472 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Wed Mar 1 14:48:46 2017 UTC (5 months, 3 weeks ago) by khornik
File length: 18428 byte(s)
Diff to previous 1466
Update.

Revision 1466 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Mon Jan 16 11:31:26 2017 UTC (7 months ago) by feinerer
File length: 18287 byte(s)
Diff to previous 1450
Document new parallelization environment.

Revision 1450 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Mon Dec 12 11:34:10 2016 UTC (8 months, 1 week ago) by feinerer
File length: 17875 byte(s)
Diff to previous 1446
Document findMostFreqTerms() in NEWS

Revision 1446 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Wed Nov 2 15:41:49 2016 UTC (9 months, 2 weeks ago) by feinerer
File length: 17725 byte(s)
Diff to previous 1445
Revive parallel::mclapply()

Experiments show that, with the right hardware, mclapply() gives you measurable
performance gains. So reenable it --- despite substantial drawbacks (RAM and
CPU overhead) in some scenarios.

Revision 1445 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sun Oct 9 09:30:58 2016 UTC (10 months, 1 week ago) by feinerer
File length: 17882 byte(s)
Diff to previous 1440
Speed up termFreq(), general cleanup

- Avoid parallel::mclapply()
- Use custom .table()
- Use rep.int(), rep_len() and lengths()
- Fix typos
- Shorten overlong lines
- Consistent formatting

Revision 1440 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sat Jul 30 06:34:57 2016 UTC (12 months, 3 weeks ago) by feinerer
File length: 17725 byte(s)
Diff to previous 1438
Corpus() now chooses between SimpleCorpus and VCorpus based on its arguments

Revision 1438 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sat Jul 16 18:32:59 2016 UTC (13 months ago) by feinerer
File length: 17724 byte(s)
Diff to previous 1437
Use Rcpp for efficient term-document matrix construction from a SimpleCorpus

Revision 1437 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Wed Jul 13 19:23:49 2016 UTC (13 months, 1 week ago) by feinerer
File length: 17723 byte(s)
Diff to previous 1436
Add SimpleCorpus

SimpleCorpus provides a corpus which is optimized for the most common usage
scenario: importing plain texts from files in a directory or directly from a
vector in R, preprocessing and transforming the texts, and finally exporting
them to a term-document matrix. The aim is to boost performance and minimize
memory pressure. It loads all documents into memory, and is designed for
medium-sized to large data sets.

Revision 1436 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Wed Nov 18 11:38:50 2015 UTC (21 months ago) by feinerer
File length: 17245 byte(s)
Diff to previous 1432
inspect.TermDocumentMatrix() now displays a sample instead of the full matrix

Revision 1432 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Wed Jul 1 19:17:31 2015 UTC (2 years, 1 month ago) by feinerer
File length: 16788 byte(s)
Diff to previous 1425
Update NEWS

Revision 1425 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Tue May 5 14:36:17 2015 UTC (2 years, 3 months ago) by feinerer
File length: 16215 byte(s)
Diff to previous 1413
Describe new features

Revision 1413 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sat Apr 4 08:21:38 2015 UTC (2 years, 4 months ago) by feinerer
File length: 15900 byte(s)
Diff to previous 1397
Correctly process words being truncations of others

Revision 1397 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Fri Sep 12 19:30:27 2014 UTC (2 years, 11 months ago) by feinerer
File length: 15688 byte(s)
Diff to previous 1396
Add open() and close() for sources

Useful for sources with complex or expensive setup, e.g., database connections
or file handles.

Revision 1396 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sun Aug 31 13:51:02 2014 UTC (2 years, 11 months ago) by khornik
File length: 15528 byte(s)
Diff to previous 1372
Add encoding.

Revision 1372 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Wed May 7 15:17:06 2014 UTC (3 years, 3 months ago) by feinerer
File length: 15511 byte(s)
Diff to previous 1368
Saved objects need to be rebuilt

Revision 1368 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Mon Apr 28 20:47:43 2014 UTC (3 years, 3 months ago) by feinerer
File length: 15314 byte(s)
Diff to previous 1363
Romanian stopwords (suggested by Cristian Chirita)

Revision 1363 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Mon Apr 28 09:49:46 2014 UTC (3 years, 3 months ago) by feinerer
File length: 15251 byte(s)
Diff to previous 1345
Keep Corpus() alias

Revision 1345 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sun Apr 20 16:48:32 2014 UTC (3 years, 4 months ago) by feinerer
File length: 15389 byte(s)
Diff to previous 1283
Update NEWS

Revision 1283 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Wed Jan 8 06:04:16 2014 UTC (3 years, 7 months ago) by feinerer
File length: 13049 byte(s)
Diff to previous 1280
Close parentheses

Revision 1280 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Tue Jan 7 08:35:12 2014 UTC (3 years, 7 months ago) by feinerer
File length: 13047 byte(s)
Diff to previous 1277
Improve wording

Revision 1277 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sun Jan 5 16:32:40 2014 UTC (3 years, 7 months ago) by feinerer
File length: 13046 byte(s)
Diff to previous 1260
Remove Dictionary class and functions

At the moment a Dictionary has no added value compared to a simple character
vector. We might want to reconsider dictionaries in the context of NLP but with
more functionality later on.

Revision 1260 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sat Sep 21 09:10:20 2013 UTC (3 years, 11 months ago) by feinerer
File length: 12876 byte(s)
Diff to previous 1258
Move preprocessReut21578XML() to package tm.corpus.Reuters21578

Revision 1258 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Fri Sep 20 12:15:42 2013 UTC (3 years, 11 months ago) by feinerer
File length: 12771 byte(s)
Diff to previous 1255
Remove GmaneSource() and readGmane(), simplify readers, improve documentation

Revision 1255 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Wed Sep 11 07:30:06 2013 UTC (3 years, 11 months ago) by feinerer
File length: 12567 byte(s)
Diff to previous 1253
Rename tm_tag_score() to tm_term_score()

Revision 1253 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Fri Aug 30 10:03:09 2013 UTC (3 years, 11 months ago) by feinerer
File length: 12435 byte(s)
Diff to previous 1242
Remove getFilters(), searchFullText(), and tm_intersect() (use grep() instead)

Revision 1242 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Mon Aug 19 05:33:57 2013 UTC (4 years ago) by feinerer
File length: 12266 byte(s)
Diff to previous 1239
Do not register VCorpus and PlainTextDocument as S4 classes anymore

Revision 1239 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Fri Aug 9 10:11:21 2013 UTC (4 years ago) by feinerer
File length: 12056 byte(s)
Diff to previous 1233
Document change to GPL-3

Revision 1233 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Thu Jul 11 08:25:14 2013 UTC (4 years, 1 month ago) by feinerer
File length: 11882 byte(s)
Diff to previous 1227
Use \pkg{} macro

Revision 1227 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sun Jun 16 08:37:10 2013 UTC (4 years, 2 months ago) by feinerer
File length: 11852 byte(s)
Diff to previous 1226
Use package parallel instead of Rmpi and snow

Revision 1226 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sun Jun 16 07:38:58 2013 UTC (4 years, 2 months ago) by feinerer
File length: 11445 byte(s)
Diff to previous 1224
Document SnowballC switch in NEWS

Revision 1224 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Sat Jun 15 12:27:17 2013 UTC (4 years, 2 months ago) by feinerer
File length: 11306 byte(s)
Diff to previous 1174
Document Snowball stopword lists

Revision 1174 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Mon Jan 23 09:55:47 2012 UTC (5 years, 7 months ago) by feinerer
File length: 11050 byte(s)
Diff to previous 1173
Add Catalan stopwords

Revision 1173 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Mon Jan 16 15:05:22 2012 UTC (5 years, 7 months ago) by feinerer
File length: 10925 byte(s)
Diff to previous 1172
Process tolower and tokenize options first in termFreq()

Revision 1172 - (view) (download) (as text) (annotate) - [select for diffs]
Modified Mon Jan 16 10:46:18 2012 UTC (5 years, 7 months ago) by feinerer
File length: 10364 byte(s)
Diff to previous 1170
Remove space

Revision 1170 - (view) (download) (as text) (annotate) - [select for diffs]
Added Mon Jan 16 09:22:27 2012 UTC (5 years, 7 months ago) by feinerer
File length: 10367 byte(s)
Use NEWS.Rd instead of NEWS

This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a

Sort log by:

R-Forge@R-project.org
ViewVC Help
Powered by ViewVC 1.0.0  
Thanks to:
Vienna University of Economics and Business University of Wisconsin - Madison Powered By FusionForge