Hi Vincent,

Ah that sounds really cool. I will also try it out.

Kind regards,
You Zhou

发件人: Vincent Carey <st...@channing.harvard.edu>
日期: 2021年5月27日 星期四 00:53
收件人: Stuart Lee <le...@wehi.edu.au>
抄送: "Kern, Lori" <lori.sheph...@roswellpark.org>, You Zhou 
<youzhoulearn...@gmail.com>, "bioc-devel@r-project.org" 
<bioc-devel@r-project.org>
主题: Re: [Bioc-devel] About the size limitation of the package



On Wed, May 26, 2021 at 5:55 PM Stuart Lee 
<le...@wehi.edu.au<mailto:le...@wehi.edu.au>> wrote:
Hi You and Lori,

Are fitted models in scope for ExperimentHub? I thought it was more for data. 
Maybe there should be a ModelHub for developers to include trained models from 
papers in their packages?

@You: if that model has been fitted in R take a look at 
https://github.com/tidymodels/butcher for some ways of reducing it’s size.

Thanks for these suggestions Stuart!  butcher certainly seems relevant.  I 
tried it out on
the adabag output in You's package was able to effectuate some nice reductions

> tr1 = x$trees[[1]]
> obj_size(tr1)
340,192 B
> obj_size(axe_data(axe_call(axe_fitted(tr1))))
171,896 B

So this in conjunction with xz compression could make this a moot point for 
@You Zhou.

As for the ModelHub, two thoughts.  First, I'd be more inclined at this stage 
to partner with a
system like kipoi.org<http://kipoi.org>, with fitted models archived there and 
retrieved by API as needed by bioc
packages.  I wonder if there are any good examples of this by now.

Second, although I don't feel we have capacity in core to introduce a new Hub 
at just this point,
I think we'd be able to help a motivated community-based team to produce one -- 
if kipoi suggestion isn't
viable -- utilizing some Azure resources that have been contributed by 
Microsoft Genomics.  Interested
parties should write to the list.

I don't see a bioc slack channel devoted to AI/ML and maybe there would be good 
traffic on one.
This "task area" could be added to biocchallenges, or could be a topic for a 
developer forum meeting.


Thanks
Stuart
________________________________
From: Bioc-devel 
<bioc-devel-boun...@r-project.org<mailto:bioc-devel-boun...@r-project.org>> on 
behalf of Kern, Lori <lori.sheph...@roswellpark.org>
Sent: Wednesday, 26 May 2021 10:01 PM
To: You Zhou <youzhoulearn...@gmail.com<mailto:youzhoulearn...@gmail.com>>; 
bioc-devel@r-project.org<mailto:bioc-devel@r-project.org> 
<bioc-devel@r-project.org<mailto:bioc-devel@r-project.org>>
Subject: Re: [Bioc-devel] About the size limitation of the package

Please consider using Experiment Hub to host the large data file. More 
information can be found here:
https://bioconductor.org/packages/devel/bioc/vignettes/AnnotationHub/inst/doc/CreateAHubPackage.html

Cheers,



Lori Shepherd

Bioconductor Core Team

Roswell Park Comprehensive Cancer Center

Department of Biostatistics & Bioinformatics

Elm & Carlton Streets

Buffalo, New York 14263

________________________________
From: Bioc-devel 
<bioc-devel-boun...@r-project.org<mailto:bioc-devel-boun...@r-project.org>> on 
behalf of You Zhou <youzhoulearn...@gmail.com<mailto:youzhoulearn...@gmail.com>>
Sent: Wednesday, May 26, 2021 5:09 AM
To: bioc-devel@r-project.org<mailto:bioc-devel@r-project.org> 
<bioc-devel@r-project.org<mailto:bioc-devel@r-project.org>>
Subject: [Bioc-devel] About the size limitation of the package

Dear Bioc team,


I am compiling a package �m6Aboost� and planning to submit it in the 
Bioconductor. This package using a trained machine learning model to identify 
the correct m6A signals from the miCLIP2 data set (more detail about this 
machine learning model can be found in our paper 
https://www.biorxiv.org/content/10.1101/2020.12.20.423675v1).



Now I meet a problem: the size of this machine learning model is 10 Mb, which 
is bigger than 5 Mb. Since this model is crucial for the package, I was 
wondering whether I can ignore the warning message about the size limitation. 
Thank you : )

Best regards,
You Zhou

        [[alternative HTML version deleted]]



This email message may contain legally privileged and/or confidential 
information.  If you are not the intended recipient(s), or the employee or 
agent responsible for the delivery of this message to the intended 
recipient(s), you are hereby notified that any disclosure, copying, 
distribution, or use of this email message is prohibited.  If you have received 
this message in error, please notify the sender immediately by e-mail and 
delete this email message from your computer. Thank you.
        [[alternative HTML version deleted]]

_______________________________________________
Bioc-devel@r-project.org<mailto:Bioc-devel@r-project.org> mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel

        [[alternative HTML version deleted]]

_______________________________________________
Bioc-devel@r-project.org<mailto:Bioc-devel@r-project.org> mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel

The information in this e-mail is intended only for the person to whom it is
addressed. If you believe this e-mail was sent to you in error and the e-mail
contains patient information, please contact the Partners Compliance HelpLine at
http://www.partners.org/complianceline . If the e-mail was sent to you in error
but does not contain patient information, please contact the sender and properly
dispose of the e-mail.

        [[alternative HTML version deleted]]

_______________________________________________
Bioc-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel

Reply via email to