What is in a name? :)

"Which SQL feature you are talking about here, that forces single reducer
and hence should not be supported?"

Joining on anything besides = comes to mind

Pretty sure the query mentioned here will not work (without being
re-written)
http://en.wikipedia.org/wiki/SQL

SELECT isbn, title, price
 FROM Book
 WHERE price < (SELECT AVG(price) FROM Book)
 ORDER BY title;


"Using the term HiveQL conveys the wrong idea"

I disagree. HiveQL is HiveQL. The only idea it can convey is itself. Our
wiki describes this perfectly
https://cwiki.apache.org/confluence/display/Hive/Home

"Hive defines a simple SQL-like query language, called QL, that enables
users familiar with SQL to query the data. "

Hive-SQL looks like it is trying to convey the idea that hive supports
extensions like T-SQL http://en.wikipedia.org/wiki/Transact-SQL or PL/SQL.
http://www.oracle.com/technetwork/database/features/plsql/index.html.
T-SQL and PLSQL are both adding extensions on top of a "mostly" SQL
compliant database. Instead we would be using Hive-SQL to sugar coat/hide
the fact that Hive is not SQL complaint (and not even close IMHO).

Lessons from my mother.
"So I don't expect hive to be 100% sql complaint, but when we add new
features, we always aim to be as compliant with SQL as possible."
You can't be half a saint.

"considering how much other databases deviate from the standard -
http://troels.arvin.dk/db/rdbms/ . See how much deviation is there for
example in  'limit clause' or the data types supported (and details of
data type support) -"
If all your friends jumped off a bridge would you do it?

If I had to suggest another name for HiveQL. I might call it Almost-SQL++.



On Tue, Jul 2, 2013 at 10:19 PM, Thejas Nair <the...@hortonworks.com> wrote:

> On Tue, Jul 2, 2013 at 6:52 PM, Edward Capriolo <edlinuxg...@gmail.com>
> wrote:
> > "I see that the hive community intends to make hive SQL compliant and is
> on
> > that path." . I was not under the general impression that we are on that
> > path.
>
> Well, whenever we try to decide what the behavior should be, we refer
> to what SQL standard says. So I think we are certainly on that path,
> and not going the other way.
>
> > Also some SQL operations can not be executed efficiently in map-reduce
> and
> > should IMHO not be added to the language. IE if doing the operation means
> > all data must be shuffled to a single reducer its not something hive
> should
> > do.
>
> Which SQL feature you are talking about here, that forces single
> reducer and hence should not be supported?
> But I agree in general that it does not make sense for hive to support
> ALL sql features. For example, supporting all the transaction
> semantics or updating a single row in a table does not make sense.
>
> So I don't expect hive to be 100% sql complaint, but when we add new
> features, we always aim to be as compliant with SQL as possible. I
> think we should convey this to our users.
>
> Using the term HiveQL conveys the wrong idea. But calling it SQL not a
> lie, considering how much other databases deviate from the standard -
> http://troels.arvin.dk/db/rdbms/ . See how much deviation is there for
> example in  'limit clause' or the data types supported (and details of
> data type support) -
> http://troels.arvin.dk/db/rdbms/#select-limit-simple .
>
> > I do not think hive should be mis-representing itself. Simply put, hive
> is
> > NOT SQL complaint we should NOT call the language SQL.
>
> I think calling it HIVE-SQL conveys the idea very well.
>
>
> > Doing so would only only lead to confusion for people new to the project.
> > Imagine you are new to hive:
> > You read "SQL"in the documentation.
> > You run an "SQL" standard create table query, it fails.
>
> See the above page. Any given SQL "standard" create table query is not
> guaranteed to work in most of the widely used commercial "SQL"
> databases.
>

Reply via email to