Would anyone be interested if I actually implemented support regex by
calling into the C library?

I'm envisioning something like this:

*      "^([a-z]+):([0-9]+)$" regex∆scan "foo:123"*
┏→━━━━━━━━━━┓
┃"foo" "123"┃
┗∊━━━━━━━━━━┛

Regards,
Elias

On 22 June 2016 at 10:35, Xiao-Yong Jin <jinxiaoy...@gmail.com> wrote:

> At some point I really wanted to implement the simplest editor, namely ed
> (it would be much better than any APL system has offered), in APL, but I
> never finished the regex part.  Perhaps it would be a nice exercise for
> someone who’s interested in learning APL to actually implement some of the
> most used Unix command line utilities.
>
> > On Jun 21, 2016, at 8:53 PM, Christian Robert <
> christian.rob...@polymtl.ca> wrote:
> >
> > Hi,
> >
> > A more general purpose search&replace (some bug fixed), take (2), was
> "snr".
> >
> >
> >
> >      )sic
> >      )erase replace
> >      ∇z←s replace p;fr;to;P;i;⎕io
> > ⍝ -------------
> > →(2≤≡s)/ForEach
> > →(2≤⍴⍴s)/Matrix
> > →Vector
> > ⍝ -------------
> > ForEach: z←{⍵ replace p}¨s ◊ →0
> > Matrix: z←⊃ {⍵ replace p}¨ ⊂[2]s ◊ →0
> > ⍝ -------------
> > Vector: p←,¨p ◊ z←s←,s ⋄ ⎕io←1
> > Loop:→((⍴p)<2)/0
> > (fr to)←2↑p ◊ p←2↓p
> > P←¯1↓∧/[1](¯1+⍳⍴fr)⌽fr∘.=z,⎕ucs 1
> > i←0
> > Sanitize: i←i+1
> > →(i>(⍴P)-⍴fr)/Substitute
> > →(0=P[i])/Sanitize
> > ((⊂ i+¯1+⍳⍴fr)⌷P)←(⍴fr)↑1
> > →Sanitize
> > Substitute: P←⌽(P/⍳⍴P)
> > ⊣ {z←((¯1+⍵)↑z),to,(¯1+⍵+⍴fr)↓z}¨P
> > →Loop
> > ∇
> >
> > It should be vector, matrix and embedded objects "aware" (as long as it
> is all text)
> >
> > Syntax:
> >
> >  source replace "this" "by_that" "this" "by_that" [...]  (eg: pairs of
> "from" and "to")
> >
> > Examples:
> >
> >      (⎕cr 'replace') replace 'replace' 'snr' 'ForEach' "Each"
> "Substitute" "Subst" "Sanitize" "Sane"
> > z←s snr p;fr;to;P;i;⎕io
> > ⍝ -------------
> > →(2≤≡s)/Each
> > →(2≤⍴⍴s)/Matrix
> > →Vector
> > ⍝ -------------
> > Each: z←{⍵ snr p}¨s ◊ →0
> > Matrix: z←⊃ {⍵ snr p}¨ ⊂[2]s ◊ →0
> > ⍝ -------------
> > Vector: p←,¨p ◊ z←s←,s ⋄ ⎕io←1
> > Loop:→((⍴p)<2)/0
> > (fr to)←2↑p ◊ p←2↓p
> > P←¯1↓∧/[1](¯1+⍳⍴fr)⌽fr∘.=z,⎕ucs 1
> > i←0
> > Sane: i←i+1
> > →(i>(⍴P)-⍴fr)/Subst
> > →(0=P[i])/Sane
> > ((⊂ i+¯1+⍳⍴fr)⌷P)←(⍴fr)↑1
> > →Sane
> > Subst: P←⌽(P/⍳⍴P)
> > ⊣ {z←((¯1+⍵)↑z),to,(¯1+⍵+⍴fr)↓z}¨P
> > →Loop
> >
> >
> > You can even try "⎕fx" the result if it looks like Ok. (ps: always check
> the result before committing)
> >
> > Substitutions are done in pair sequence for each /vector/matrix/embedded
> obj.
> >
> > So replace order is *very* important.
> >
> > It must still have some bugs, report them to me, with examples if so.
> Will do my best but no promises.
> >
> > Purpose: well, not sure, it was an exercise. I used it on itself to
> finally rename the function and change some variables/labels names.
> >
> >
> > Features: would be fun to pass parameters to each of the /from/to/ like
> maximum number of substitutions to do per line ...
> >          whole words only (taking apl characters as blanks) ... so
> variable "s" won't match last character of variable "pairs"
> >          ... no limits
> >
> >
> > Xtian.
> >
> >
> > On 2016-06-21 12:27, Juergen Sauermann wrote:
> >> Hi Xtian,
> >>
> >> how about this:
> >>
> >> *
> **s←"444This4is4a4444test44with44444lot4of4blanks4444at4beginning4and444end444444"**
> >>      from←"444"**
> >> **      to←"5"**
> >> **
> >> **      ↑⍎')HOST echo "',s,'" | sed s/',from,'/',to,'/g'**
> >> **5This4is4a54test44with544lot4of4blanks54at4beginning4and5end55**
> >> **  **
> >> *I have replaced the *^* in your example because it (and some other
> characters) have a special meaning in
> >> regular expressions. But the special characters can be easily quoted in
> APL if needed, for example:
> >>
> >> *
> s←"^^^This^is^a^^^^test^^with^^^^^lot^of^blanks^^^^at^beginning^and^^^end^^^^^^"
> **
> >> **      ((s='^')/s)←⊂'\^'**
> >> **      ⊃,/s**
> >>
> **\^\^\^This\^is\^a\^\^\^\^test\^\^with\^\^\^\^\^lot\^of\^blanks\^\^\^\^at\^beginning\^and\^\^\^end\^\^\^\^\^\^**
> >> *
> >> /// Jürgen
> >>
> >>
> >> On 06/21/2016 05:12 AM, Christian Robert wrote:
> >>> Hi, it's not a bug but a request for help,
> >>>
> >>>
> >>> suppose
> s="^^^This^is^a^^^^test^^with^^^^^lot^of^blanks^^^^at^beginning^and^^^end^^^^^^"
> >>> suppose from="^^^"
> >>> suppose to="^"
> >>>
> >>> I need a function who can replace ("^^^" to "^"), or ("^" to
> "^^^^^^^^") or ("" to "blabla" with limits of source size)  ie: without
> being lost in infinite loop.
> >>>
> >>>
> >>>
> >>> "s" is a character vector, "from" is a character vector, and "to" is a
> character vector,
> >>> both "s", "from" and "to" can be "".
> >>>
> >>>
> >>>
> >>> I'm pretty sure this idiom already exist.
> >>>
> >>> if you can help me, please do.
> >>>
> >>> Xtian.
> >>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>
>
>
>

Reply via email to