I have trouble narrowing down where exactly the problem is occurring (it seems to keep moving each time I think that I am getting closer).
In the short term, and for your needs, I guess that it is good if calls to gc prevents rpy from crashing. To save on runtime, you can try a gc pooling strategy (call gc every N iterations). 2008/7/15 laurent oget <[EMAIL PROTECTED]>: > Calling r.gc() before each creation seems to have solved the problem. We ran > a whole lot of things over the night without any segmentation fault. This is > however pretty expensive timewise. > > Laurent > > 2008/7/14 laurent oget <[EMAIL PROTECTED]>: >> >> I am running things with a call to gc() before each regression, in case >> what happens is a race condition where the gc is called in the middle of the >> constrction of a new dataframe... >> >> Thanks for the prompt help! >> >> Laurent >> >> 2008/7/14 Laurent Gautier <[EMAIL PROTECTED]>: >>> >>> 2008/7/15 laurent oget <[EMAIL PROTECTED]>: >>> > can i get a quick hint on how i would go about calling --verbose >>> > through RPY >>> > ? >>> >>> I suspect that the only way is to hack >>> line 93 of rpymodule.c >>> >>> char *defaultargv[] = {"rpy", "-q", "--vanilla"}; >>> >>> and recompile/install rpy. >>> >>> > I am pretty clueless about the way R does the garbage collection. One >>> > thing >>> > I know is there are columns that are shared between different linear >>> > regressions, so it might be that the garbage collector cleans up a >>> > column >>> > from iteration n-1 and breaks column n in the process. >>> >>> >>> >>> > Is there a way to >>> > call the garbage collection explicitely before I create a new dataframe >>> > to >>> > test this hypothesis? >>> >>> gc() >>> #possibly gc(verbose = TRUE) >>> >>> >>> I have made a toy example to see if I could reproduce it here (with rpy2 >>> - >>> but there are similarities in the way R objects pointed at from Python >>> objects >>> are protected from R's garbage collection)... and it seem that there >>> is something going on with garbage collection (depending on how the >>> python implementation, it either crashes after 10-50 iterations... or can >>> go through 1000 iterations. >>> >>> >>> > Thanks, >>> > >>> > Laurent >>> > >>> > 2008/7/14 Laurent Gautier <[EMAIL PROTECTED]>: >>> >> >>> >> Someone else reported on this list a similar sounding problem not so >>> >> long >>> >> ago. >>> >> >>> >> >>> >> The problem might be caused by manipulating a stale pointer to an R >>> >> object >>> >> (that is an object that was discarded during R's garbage collection), >>> >> and troubleshooting this will likely mean running things through a C >>> >> debugger. >>> >> You could try starting up you embedded R process with '--verbose' and >>> >> see >>> >> if the problem happens right after garbage collection. >>> >> Without having further details on the exact code ran, it is difficult >>> >> to say more. >>> >> >>> >> >>> >> >>> >> >>> >> >>> >> 2008/7/14 laurent oget <[EMAIL PROTECTED]>: >>> >> > I am using rpy/R to perform linear regressions on a large number of >>> >> > datasets, in one python run, and am encountering segmentation faults >>> >> > after a >>> >> > large number of iteration, while handling cases which, taken on >>> >> > their >>> >> > own >>> >> > run without a problem. My intuition is that the previous iterations >>> >> > somehow >>> >> > corrupted the heap (or the stack). The code breaks in different >>> >> > places. >>> >> > One >>> >> > sample stack trace: >>> >> > >>> >> > >>> >> > *** caught segfault *** >>> >> > address 0x8, cause 'memory not mapped' >>> >> > >>> >> > Traceback: >>> >> > 1: unique.default(unlist(lapply(answer, length))) >>> >> > 2: unique(unlist(lapply(answer, length))) >>> >> > 3: sapply(xlev, is.null) >>> >> > 4: .getXlevels(mt, mf) >>> >> > 5: function (formula, data, subset, weights, na.action, method = >>> >> > "qr", >>> >> > model = TRUE, x = FALSE, y = FALSE, qr = TRUE, singular.ok = TRUE, >>> >> > contrasts = NULL, offset, ...) { ret.x <- x ret.y <- y cl >>> >> > <- >>> >> > match.call() mf <- match.call(expand.dots = FALSE) m <- >>> >> > match(c("formula", "data", "subset", "weights", "na.action", >>> >> > "offset"), names(mf), 0L) mf <- mf[c(1L, m)] >>> >> > mf$drop.unused.levels >>> >> > <- >>> >> > TRUE mf[[1L]] <- as.name("model.frame") mf <- eval(mf, >>> >> > parent.frame()) if (method == "model.frame") return(mf) >>> >> > else >>> >> > if (method != "qr") warning(gettextf("method = '%s' is not >>> >> > supported. Using 'qr'", method), domain = NA) mt <- >>> >> > attr(mf, >>> >> > "terms") y <- model.response(mf, "numeric") w <- >>> >> > as.vector(model.weights(mf)) if (!is.null(w) && !is.numeric(w)) >>> >> > stop("'weights' must be a numeric vector") offset <- >>> >> > as.vector(model.offset(mf)) if (!is.null(offset)) { if >>> >> > (length(offset) == 1) offset <- rep(offset, NROW(y)) >>> >> > else >>> >> > if (length(offset) != NROW(y)) stop(gettextf("number of >>> >> > offsets >>> >> > is %d, should equal %d (number of observations)", >>> >> > length(offset), NROW(y)), domain = NA) } if >>> >> > (is.empty.model(mt)) >>> >> > { x <- NULL z <- list(coefficients = if (is.matrix(y)) >>> >> > matrix(, 0, 3) else numeric(0), residuals = y, >>> >> > fitted.values >>> >> > = 0 >>> >> > * y, weights = w, rank = 0L, df.residual = if >>> >> > (is.matrix(y)) >>> >> > nrow(y) else length(y)) if (!is.null(offset)) { >>> >> > z$fitted.values <- offset z$residuals <- y - offset >>> >> > } >>> >> > } else { x <- model.matrix(mt, mf, contrasts) z <- >>> >> > if >>> >> > (is.null(w)) lm.fit(x, y, offset = offset, singular.ok = >>> >> > singular.ok, ...) else lm.wfit(x, y, w, >>> >> > offset = >>> >> > offset, singular.ok = singular.ok, ...) } class(z) >>> >> > <- >>> >> > c(if >>> >> > (is.matrix(y)) "mlm", "lm") z$na.action <- attr(mf, "na.action") >>> >> > z$offset <- offset z$contrasts <- attr(x, "contrasts") >>> >> > z$xlevels >>> >> > <- >>> >> > .getXlevels(mt, mf) z$call <- cl z$terms <- mt if (model) >>> >> > z$model <- mf if (ret.x) z$x <- x if (ret.y) >>> >> > z$y >>> >> > <- >>> >> > y if (!qr) z$qr <- NULL >>> >> > z}("sls_1133745~dp_1133745+dp_3124901", list(dp_3124901 = c(0, 0, 0, >>> >> > 0, >>> >> > 0, >>> >> > -0.328504066972036, 0, 0, 0, 0, 0, 0, 0, -0.328504066972036, 0, 0, >>> >> > 0, 0, >>> >> > 0, >>> >> > -0.328504066972036, -0.328504066972036, 0, 0, 0, 0, 0, 0, 0, 0, 0, >>> >> > 0, >>> >> > -0.693147180559945, 0, -0.693147180559945, 0, 0, 0, >>> >> > -0.328504066972036, >>> >> > 0, >>> >> > 0, 0, 0, 0, 0, 0, 0, 0, -0.328504066972036, -0.328504066972036, 0, >>> >> > 0, 0, >>> >> > 0, >>> >> > 0, -0.693147180559945, 0, 0, 0, 0, -0.693147180559945, 0, >>> >> > -0.693147180559945, -0.693147180559945, 0, 0, 0, 0, 0, 0, 0, >>> >> > -0.248461359298500, 0, -0.248461359298500, 0, 0, 0, 0, 0, 0, 0, 0, >>> >> > 0, 0, >>> >> > 0, >>> >> > -0.693147180559945, 0, 0, 0, -0.248461359298500, 0, 0, >>> >> > -0.248461359298500, >>> >> > 0, 0, -0.693147180559945, 0, 0, 0, 0, 0, 0), sls_1133745 = >>> >> > c(7.3487612610094, 7.30613534736707, 8.00046685932928, >>> >> > 7.39499697533915, >>> >> > 7.34321275228777, 7.88963522981753, 7.22973732372956, >>> >> > 8.18020882367827, >>> >> > 7.91558910409137, 7.26812538738378, 7.25818677308703, >>> >> > 7.3498801314529, >>> >> > 7.31767047076391, 7.65018297965786, 7.90898624450853, >>> >> > 8.19434808229588, >>> >> > 8.21457885218454, 7.3641669547084, 7.35953128995355, >>> >> > 7.7040815977251, >>> >> > 7.2031227430099, 7.31216649861154, 7.19473710631952, >>> >> > 7.29547045158008, >>> >> > 7.35154326140946, 7.17917838793797, 8.36984796851258, >>> >> > 7.1754285028756, >>> >> > 8.12093952739025, 7.687240693355, 7.2225733180656, 7.36405924490386, >>> >> > 7.3452615426351, 7.25804588634565, 7.23995411189234, >>> >> > 7.21269261931628, >>> >> > 7.3210695020337, 7.8386981421886, 7.373405696737, 7.17968137538913, >>> >> > 7.82070844738252, 7.43464714668485, 7.25244367037469, >>> >> > 7.23292820385588, >>> >> > 7.33945977029367, 7.2810277192681, 7.34753163260778, >>> >> > 7.94492823905035, >>> >> > 7.87310661464411, 7.14571650431343, 7.1568316491227, >>> >> > 8.04067559226017, >>> >> > 7.16502195742237, 7.27351582356697, 7.28802564845575, >>> >> > 8.77370188389706, >>> >> > 7.3224047480958, 6.4802899305943, 7.2810277192681, 7.26784638120198, >>> >> > 7.35638204246829, 7.20306320059336, 7.15358357480953, >>> >> > 8.01940553608164, >>> >> > 8.78195611526034, 7.24723014605046, 7.319056656224, >>> >> > 7.24884540813453, >>> >> > 7.25487774291456, 7.211054762329, 8.21131456124779, >>> >> > 7.19920773459773, >>> >> > 7.9786879985435, 7.28814186671376, 7.27840117311202, >>> >> > 7.2972806890663, >>> >> > 8.14356390282749, 7.34424090367824, 7.32067255061534, >>> >> > 7.34136070803461, >>> >> > 7.76719625525884, 7.20539015753155, 7.19294925869672, >>> >> > 8.12521670148863, >>> >> > 7.3678032648606, 7.32193563210652, 7.16022459567817, >>> >> > 7.34345210203388, >>> >> > 8.11895788519276, 7.81886461055585, 7.76421532379024, >>> >> > 7.93634917049156, >>> >> > 7.20526391225165, 7.2842720651344, 7.28607526956675, >>> >> > 7.29555188093676, >>> >> > 7.19784694602772, 7.18371895847972, 7.21964936074816, >>> >> > 7.87895077602558, >>> >> > 7.26955407354936), dp_1133745 = c(-0.223143551314210, 0, >>> >> > -0.287682072451781, >>> >> > 0, 0, -0.116533816255952, 0, -0.223143551314210, -0.116533816255952, >>> >> > 0, >>> >> > 0, >>> >> > 0, 0, -0.116533816255952, -0.116533816255952, -0.223143551314210, >>> >> > -0.287682072451781, 0, 0, -0.116533816255952, 0, 0, 0, 0, 0, 0, >>> >> > -0.287682072451781, 0, -0.287682072451781, -0.116533816255952, 0, 0, >>> >> > 0, >>> >> > 0, >>> >> > 0, 0, 0, -0.116533816255952, 0, 0, -0.248461359298500, 0, 0, 0, 0, >>> >> > 0, 0, >>> >> > -0.116533816255952, -0.116533816255952, 0, 0, -0.287682072451781, 0, >>> >> > 0, >>> >> > 0, >>> >> > -0.400477566597125, 0, 0, 0, 0, 0, 0, 0, -0.248461359298500, >>> >> > -0.400477566597125, 0, 0, 0, 0, 0, -0.287682072451781, 0, >>> >> > -0.287682072451781, 0, 0, 0, -0.223143551314210, 0, 0, 0, >>> >> > -0.116533816255952, 0, 0, -0.223143551314210, 0, 0, 0, 0, >>> >> > -0.287682072451781, -0.248461359298500, -0.116533816255952, >>> >> > -0.287682072451781, 0, 0, 0, 0, 0, 0, 0, -0.116533816255952, 0))) >>> >> > >>> >> > >>> >> > would anybody have a hint on how to troubleshoot this? >>> >> > >>> >> > Thanks, >>> >> > >>> >> > Laurent >>> >> > >>> >> > >>> >> > >>> >> > >>> >> > ------------------------------------------------------------------------- >>> >> > Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! >>> >> > Studies have shown that voting for your favorite open source >>> >> > project, >>> >> > along with a healthy diet, reduces your potential for chronic >>> >> > lameness >>> >> > and boredom. Vote Now at http://www.sourceforge.net/community/cca08 >>> >> > _______________________________________________ >>> >> > rpy-list mailing list >>> >> > rpy-list@lists.sourceforge.net >>> >> > https://lists.sourceforge.net/lists/listinfo/rpy-list >>> >> > >>> >> > >>> > >>> > >> > > ------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _______________________________________________ rpy-list mailing list rpy-list@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rpy-list