[Cloud] Re: LLM services

2025-01-10 Thread Platonides
Huji wrote:
> Even free, lightweight LLMs (like LLaMa) could be helpful

LLaMa itself is not under a free license. Let's call it an "almost-free
license".
So, I'm not sure if it would be acceptable to run it, given the requisite
that
> All code in the Tools project must be published under an OSI approved
open source license

I think the debate would be whether the models (in this case LLaMa) is
"code" or "data". It might be considered both ways.

Separatedly, regarding the resources point mentioned by Siddhart:
> LLMs might also require significant memory/CPU resources and/or system
software not available to tools

LLMs are very memory-hungry, but what they would benefit most would be from
GPU memory. Of which we probably don't have any in cloud. The ideal setup
would probably be a specific host in cloud providing a LLM service and
offering that to tools and VMs.

(*) It's still possible to run LLMs without GPU and get acceptable results,
although the time required and amount of requests that can be fulfilled (as
well as number of models loaded) would be much more limited.
___
Cloud mailing list -- cloud@lists.wikimedia.org
List information: 
https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/


[Cloud] Re: LLM services

2025-01-10 Thread Arturo Borrero Gonzalez

On 1/10/25 05:21, Huji Lee wrote:

Hi all,

Are there any LLMs available on Cloud services, or are there any plans for them? 
I think there are many possible use cases. Even free, lightweight LLMs (like 
LLaMa) could be helpful, e.g. in bots that review edits, categorize pages, etc.




Hi Huji,

No, there is none at this point in time.

But this has been in my radar for some time now, and it would be useful to know 
particular use cases and needs before we can work on any implementation.


This email is already helpful (to know there is at least one person interested). 
A phabricator ticket would also be useful, to help inform future actions. Would 
you mind creating one?


regards.
___
Cloud mailing list -- cloud@lists.wikimedia.org
List information: 
https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/


[Cloud] Re: LLM services

2025-01-10 Thread Siddharth VP
Since it's now possible in Toolforge to expose services at custom ports[0],
I think it should already be possible for someone to host an LLM service
for other tools to use.

I could be wrong though, as LLMs might also require significant memory/CPU
resources and/or system software not available to tools.

[0]:
https://wikitech.wikimedia.org/wiki/Help:Toolforge/Jobs_framework#Configuring_internal_domain_names_for_your_jobs

On Fri, 10 Jan 2025 at 16:31, Francesco Negri  wrote:

> > A phabricator ticket would also be useful, to help inform future
> actions. Would
> > you mind creating one?
>
> We have https://phabricator.wikimedia.org/T336905 that seems to cover
> the same topic.
>
> --
> Francesco Negri (he/him) -- IRC: dhinus
> Site Reliability Engineer, Cloud Services team
> Wikimedia Foundation
> ___
> Cloud mailing list -- cloud@lists.wikimedia.org
> List information:
> https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/
>
___
Cloud mailing list -- cloud@lists.wikimedia.org
List information: 
https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/


[Cloud] [Mediawiki-api-announce] [Breaking Change] Fix of 'sub' field format in the OAuth identity endpoint

2025-01-10 Thread Gergo Tisza
The OAuth identity endpoint (the Special:OAuth/identify special page for
OAuth 1, the oauth2/resource/profile REST API endpoint for OAuth 2) used to
return an incorrectly formatted JSON web token, where value of the 'sub' field
(the user's CentralAuth central user ID) was an integer, rather than a
string as required by the JWT spec.

Due to the latest release of the pyJWT library getting more strict about
format validation, this started causing errors in various tools recently.
As of this week, this behavior has been fixed for Wikimedia sites, and it
has been fixed in all maintained versions (MediaWiki 1.39 and upwards) of
the OAuth MediaWiki extension which provides this API. Because the old
behavior was a spec violation and caused errors, and it's unlikely the
correct behavior would break clients, we are making this fix as a breaking
change rather than following the usual API deprecation policies.

For more details and discussion see
https://phabricator.wikimedia.org/T382139
___
Mediawiki-api-announce mailing list -- 
mediawiki-api-annou...@lists.wikimedia.org
To unsubscribe send an email to mediawiki-api-announce-le...@lists.wikimedia.org
___
Cloud mailing list -- cloud@lists.wikimedia.org
List information: 
https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/


[Cloud] [Cloud-announce] Re: mild dns changes in cloud-vps on Wednesday

2025-01-10 Thread Andrew Bogott
This didn't happen! Further testing suggests that my original assertion 
("now reaching public IPs works as expected") is not as true as I 
thought it was.



On 1/6/25 9:26 AM, Andrew Bogott wrote:

Tl;dr:

Please let me know if you encounter changes in network connectivity 
between cloud-vps hosts, specifically when routing to floating IPs.



Complicated explanation:

A previous version of our network setup prevented routing between 
private and floating IPs within cloud-vps. To work around this we run 
an agent called 'labs-ip-aliaser' which insertes corresponding private 
IPs for any dns lookup initiating from within cloud-vps that would 
otherwise return a floating IP.


So, for instance, if you did something like

$ ping hostname.wmcloud.org

you would actually be pinging vmname.project.eqiad.wikimedia.cloud 
instead, because that worked and pinging the wmcloud.org address did not.


We have since refactored the network so that now reaching public IPs 
works as expected. So, later in the week I'm going to remove the 
labs-ip-aliaser hack so that floating-ip-associated hostnames can 
resolve to the actual floating IPs as expected. This change should be 
almost entirely invisible to cloud-vps services, but please notify me 
if you find an interesting edge-case!



https://phabricator.wikimedia.org/T374129

https://gerrit.wikimedia.org/r/c/operations/puppet/+/1105443/7




___
Cloud-announce mailing list -- cloud-annou...@lists.wikimedia.org
List information: 
https://lists.wikimedia.org/postorius/lists/cloud-announce.lists.wikimedia.org/
___
Cloud mailing list -- cloud@lists.wikimedia.org
List information: 
https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/


[Cloud] Re: LLM services

2025-01-10 Thread Francesco Negri
> A phabricator ticket would also be useful, to help inform future actions. 
> Would
> you mind creating one?

We have https://phabricator.wikimedia.org/T336905 that seems to cover
the same topic.

-- 
Francesco Negri (he/him) -- IRC: dhinus
Site Reliability Engineer, Cloud Services team
Wikimedia Foundation
___
Cloud mailing list -- cloud@lists.wikimedia.org
List information: 
https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/