Is Git LFS an option?
https://www.atlassian.com/git/tutorials/git-lfs#installing-git-lfs
Needs an LFS-aware host e.g. Bitbucket; I don't know what the Apache
hosting setup is like.


On Fri, Jun 3, 2022 at 9:31 AM Finan, Sean
<sean.fi...@childrens.harvard.edu.invalid> wrote:

> Hi Tim,
>
> >we ran into issues in previous attempts at migration with the large file
> sizes in our repo
>
> Indeed we did, and over the years I have had thoughts on that.
>
> Those large files are large ml models, which are (mostly) static,
> replaceable/interchangeable, not always necessary, and in separate resource
> (-res) modules separated from code modules.
>
> When I was a ctakes newby really disliked the separation of code from
> resources by entirely separate -res modules.  Since then, through working
> on projects that use ctakes code but not (huge) resources as dependencies,
> I have realized the wisdom of the modular separation.  In fact, I put a
> -huge- model in its own -res module so that I could <exclude> it from a
> ctakes-dependent project, saving compile (download) time and disk space.
> Like you, I don't like to "download the internet" with maven   ;^)
>
> Right now we have the ner dictionaries in sourceforge, not the apache
> repos.  While this is done for legal reasons it has worked pretty well.
>
> I think that we could maintain an apache SVN repo of -res modules
> containing only huge model files.   I am guessing that we would have to
> make it a "side/sub project" to maintain a separate repo (jenkins build,
> etc.).
>
> Anyway, it would give us the freedom to use a github repo for code (and
> non-model resources) without users needing to go through the github
> large-file workflow, which I see as a barrier to entry.
>
> Thoughts?
>
> ________________________________________
> From: Miller, Timothy <timothy.mil...@childrens.harvard.edu.INVALID>
> Sent: Thursday, June 2, 2022 6:21 PM
> To: dev@ctakes.apache.org
> Subject: Re: Apache cTAKES GitHub mirror is stuck in 2019 [EXTERNAL]
> [SUSPICIOUS] [SUSPICIOUS]
>
> * External Email - Caution *
>
>
> My recollection was that we ran into issues in previous attempts at
> migration with the large file sizes in our repo.
> Tim
>
>
> On Thu, 2022-06-02 at 20:55 +0000, Finan, Sean wrote:
>
> * External Email - Caution *
>
>
>
> Thank you Gandhi and Richard.
>
>
> Unless somebody else beats me to it I will perform some research and see
> what approaches can be used and which might be best.  In the end the cTAKES
> Project Management Committee will need to vote for any action as sweeping
> as moving to github.
>
>
> Sean
>
> ________________________________________
>
> From: gandhi rajan <
>
> <mailto:gandhiraja...@gmail.com>
>
> gandhiraja...@gmail.com
>
> >
>
> Sent: Thursday, June 2, 2022 9:02 AM
>
> To:
>
> <mailto:dev@ctakes.apache.org>
>
> dev@ctakes.apache.org
>
>
> Subject: Re: Apache cTAKES GitHub mirror is stuck in 2019 [EXTERNAL]
>
>
> * External Email - Caution *
>
>
>
> Hi Sean,
>
>
> If we are sure that the SVN has all the latest changes and active
>
> development is primarily on SVN, then why don't we request a fresh git
>
> repository and push all the changes over there.
>
>
> More info on
>
> <
> https://urldefense.com/v3/__https://infra.apache.org/svn-to-git-migration.html__;!!NZvER7FxgEiBAiR_!rXFMCtlZM4NpDPkgzeq-X2pj1rNwzQNTpZkMZXDoYiZKdJp0n4tDY6q9IcsGRPGrA6KhvmouV_1y_txDVok-tGy3dVLaqefQlQ$
> >
>
>
> https://urldefense.com/v3/__https://infra.apache.org/svn-to-git-migration.html__;!!NZvER7FxgEiBAiR_!rXFMCtlZM4NpDPkgzeq-X2pj1rNwzQNTpZkMZXDoYiZKdJp0n4tDY6q9IcsGRPGrA6KhvmouV_1y_txDVok-tGy3dVLaqefQlQ$
>
>
>
> On Thu, Jun 2, 2022 at 5:52 PM Finan, Sean
>
> <
>
> <mailto:sean.fi...@childrens.harvard.edu.invalid>
>
> sean.fi...@childrens.harvard.edu.invalid
>
> > wrote:
>
>
> Hi Richard, you bring up a valid concern.
>
>
> cTAKES Developers:
>
>
> The Apache Foundation has had an initiative to "move" all projects to
>
> GitHub for some time now.
>
>
> I don't know much about how this is done.  If anybody out there has
>
> knowledge or experience that they can pass on, please share.
>
>
> Thanks,
>
> Sean
>
> ________________________________________
>
> From: Richard Eckart de Castilho <
>
> <mailto:r...@apache.org>
>
> r...@apache.org
>
> >
>
> Sent: Thursday, June 2, 2022 3:39 AM
>
> To:
>
> <mailto:dev@ctakes.apache.org>
>
> dev@ctakes.apache.org
>
>
> Subject: Apache cTAKES GitHub mirror is stuck in 2019 [EXTERNAL]
>
>
> * External Email - Caution *
>
>
>
> Hi,
>
>
> it appears that the GitHub mirror of Apache cTAKES may be stuck.
>
>
> When I check the svn log of
>
> <
> https://urldefense.com/v3/__https://svn.apache.org/repos/asf/ctakes/trunk/__;!!NZvER7FxgEiBAiR_!pH7M7eePuLp7ejJW09QaoQOZsyoj1CD8QySUDx79FZmu6CUuooFcB0dk0hJQ7aI7G3Sq3Mz_GzoiL9XZi-zSEw$
> >
>
>
> https://urldefense.com/v3/__https://svn.apache.org/repos/asf/ctakes/trunk/__;!!NZvER7FxgEiBAiR_!pH7M7eePuLp7ejJW09QaoQOZsyoj1CD8QySUDx79FZmu6CUuooFcB0dk0hJQ7aI7G3Sq3Mz_GzoiL9XZi-zSEw$
>
>
> , I can
>
> see activity as recent as May 2022.
>
>
> However, on GitHub, I can only see stale branches:
>
>
>
> <
> https://urldefense.com/v3/__https://github.com/apache/ctakes/branches__;!!NZvER7FxgEiBAiR_!pH7M7eePuLp7ejJW09QaoQOZsyoj1CD8QySUDx79FZmu6CUuooFcB0dk0hJQ7aI7G3Sq3Mz_GzoiL9Uu2s-59w$
> >
>
>
> https://urldefense.com/v3/__https://github.com/apache/ctakes/branches__;!!NZvER7FxgEiBAiR_!pH7M7eePuLp7ejJW09QaoQOZsyoj1CD8QySUDx79FZmu6CUuooFcB0dk0hJQ7aI7G3Sq3Mz_GzoiL9Uu2s-59w$
>
>
>
> Wouldn't it be good if the GitHub mirror would be kept up-to-date?
>
>
> Best,
>
>
> -- Richard
>
>
>
>
> --
>
> Regards,
>
> Gandhi
>
>
> "The best way to find urself is to lose urself in the service of others
> !!!"
>

Reply via email to