[ 
https://issues.apache.org/jira/browse/FLINK-9990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16569043#comment-16569043
 ] 

ASF GitHub Bot commented on FLINK-9990:
---------------------------------------

walterddr commented on a change in pull request #6448: [FLINK-9990] [table] Add 
regex_extract supported in TableAPI and SQL
URL: https://github.com/apache/flink/pull/6448#discussion_r207699504
 
 

 ##########
 File path: docs/dev/table/sql.md
 ##########
 @@ -1842,6 +1842,16 @@ RPAD(text string, len integer, pad string)
     <tr>
       <td>
         {% highlight text %}
+REGEX_EXTRACT(str string, regex string, extractIndex integer)
+{% endhighlight %}
+      </td>
+      <td>
+        <p>Returns the string str extracted using specified regex pattern and 
index. If str or regex is null, returns null. E.g. 
<code>REGEX_EXTRACT('foothebar', 'foo(.*?)(bar)', 2)</code> returns 
<code>bar</code>.</p>
 
 Review comment:
   Good point. so another question is: does `REGEX_EXTRACT` returns an array of 
String similar to how Pattern/Matcher in java does it when extract all 
capturing groups? or is it concatenated? If so, what's the delimiter? (since in 
the code it seems only `String` type is returned.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Add regex_extract supported in TableAPI and SQL
> -----------------------------------------------
>
>                 Key: FLINK-9990
>                 URL: https://issues.apache.org/jira/browse/FLINK-9990
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Table API &amp; SQL
>            Reporter: vinoyang
>            Assignee: vinoyang
>            Priority: Minor
>              Labels: pull-request-available
>
> regex_extract is a very useful function, it returns a string based on a regex 
> pattern and a index.
> For example : 
> {code:java}
> regexp_extract('foothebar', 'foo(.*?)(bar)', 2) // returns 'bar.'
> {code}
> It is provided as a UDF in Hive, more details please see[1].
> [1]: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to