Re: [RESULT][VOTE] Release Apache Arrow 6.0.1 - RC1

2021-11-18 Thread Paul Taylor
v6.0.1 JS packages have been uploaded. On 11/18/21 2:28 AM, Sutou Kouhei wrote: Hi, Could someone help post release tasks? Especially, JavaScripts, Conda, R, vcpkg and document. 2. [done] upload source 3. [done] upload binaries 4. [done] update website 5. [kou] upload ruby gems 6. [ ] upl

Re: [VOTE] Remove compute from Arrow JS

2021-11-02 Thread Paul Taylor
+1 from me as well > On Oct 27, 2021, at 6:58 PM, Brian Hulette wrote: > >  > +1 > > I don't think there's much reason to keep the compute code around when > there's a more performant, easier to use alternative. I think the only unique > feature of the arrow compute code was the ability to o

Re: [VOTE] Release Apache Arrow 4.0.0 - RC3

2021-04-27 Thread Paul Taylor
ies 6. [Paul] upload js packages 10. [Uwe] update conda recipes 12. [todo] update homebrew packages 14. [Kou] update msys2 15. [Neal] update R packages 16. [in-pr|Krisztian] update docs On Tue, Apr 27, 2021 at 2:42 PM Krisztián Szűcs wrote: On Tue, Apr 27, 2021 at 2:21 PM Paul Taylor wrote:

Re: [VOTE] Release Apache Arrow 4.0.0 - RC3

2021-04-27 Thread Paul Taylor
ERNA `lerna.json` does not exist, have you run `lerna init`? I assume the lerna configuration should be updated including the version number. @Paul Taylor could you please handle the JS release? On Mon, Apr 26, 2021 at 9:01 PM Krisztián Szűcs wrote: The current status of the post-release tasks:

Re: [JS] Exploring usage of apache arrow at my company for complex table rendering

2021-02-26 Thread Paul Taylor
Hi Michael, The answer to your question about metadata will likely be application-specific. For small amounts of metadata (i.e. communicating a bounding box of included geometry), there isn't much room for optimization, so a string could be fine. For larger amounts of metadata (or other constrai

Re: [javascript] streaming IPC examples?

2021-01-26 Thread Paul Taylor
e on GitHub if you have more questions about the Grafana integration. Best, Paul On 1/24/21 4:21 PM, Brian Hulette wrote: +Paul Taylor <mailto:ptay...@apache.org> would your work with whatwg streams be relevant here? Are there any examples that would be useful for Ryan? Brian On

Re: [JS] BigNum toJSON returns a string with quotations in it

2020-10-28 Thread Paul Taylor
emoved. BigNum is a decimal derivative and decimals in > JSON have quotes. > > This is needed to distinguish values like "2.1" vs "2.100", etc. which are > decimal(2,1) vs decimal(4,3) datatypes. > > -Original Message- > From: Paul Tayl

Re: [JS] BigNum toJSON returns a string with quotations in it

2020-10-28 Thread Paul Taylor
Yes, the quotes should be removed. I'd recommend using the binary IPC format to send data to the browser though. Deserializing to JSON is expensive, loses the benefits of features like dictionary encoding, and reconstructing types on the receiving end is error-prone (as illustrated here). Paul

Re: MEAN Stack Use-case understanding

2020-09-29 Thread Paul Taylor
Hi Thomas, You can read CSVs in the browser using the browser's File input APis and an appropriate CSV library. The CSV library should be able to parse rows into JS objects, which can then be passed to the Arrow Struct Builder for serialization. In this example[1] I'm parsing the first row o

Re: JavaScript/TypeScript lib - officially supported?

2020-09-14 Thread Paul Taylor
Hi Tim, I've started working on it in a branch on my fork[1] in the evenings, but it's slow going as I've been busy with work/moving. I'll try to get it finished up and PR'd this week, there's not much left to do. Thanks, Paul 1. https://github.com/trxcllnt/arrow/tree/typescript-3.9 On 9

Re: Upcoming JS fixes and release timeline

2020-07-10 Thread Paul Taylor
On Wed, Jul 1, 2020 at 9:23 AM Paul Taylor <mailto:ptaylor.apa...@gmail.com>> wrote: The TypeScript compiler has made breaking changes in recent releases, meaning we can't easily upgrade past 3.5 and projects on 3.6+ can't compile our types. I'm working

Upcoming JS fixes and release timeline

2020-07-01 Thread Paul Taylor
The TypeScript compiler has made breaking changes in recent releases, meaning we can't easily upgrade past 3.5 and projects on 3.6+ can't compile our types. I'm working on upgrading our tsc dependency to 3.9. The fixes could include a few backwards-incompatible API changes, and might not be do

Re: [Format][C++] Offering limited support for unsigned dictionary indices

2020-06-26 Thread Paul Taylor
ch language supports them. The changes needed to support unsigned integers in C++ are probably not that invasive but I haven't taken a close look at it yet On Fri, Jun 26, 2020 at 3:23 PM Paul Taylor wrote: If positive integers are expected, I'm in favor of supporting unsigned inde

Re: [Format][C++] Offering limited support for unsigned dictionary indices

2020-06-26 Thread Paul Taylor
If positive integers are expected, I'm in favor of supporting unsigned index types. I was surprised at Arrow C++ restriction on signed indices in the RAPIDS thread, perhaps it's newer than when I ported the logic in JS. Based on the flatbuffer schemas, dictionary indices could technically be a

Re: [JavaScript] how to set column name after creation?

2020-06-26 Thread Paul Taylor
You can also use the `Field.prototype.clone()` method[1] like this to further reduce the boilerplate: function renameColumn(col, new_name) {   return Column.new(col.field.clone(new_name), col.chunks); } 1. https://github.com/apache/arrow/blob/master/js/src/schema.ts#L139-L146 On 6/26/20 7:54

Re: [DISCUSS] Need for Arrow 0.17.1 patch release (binary only?)

2020-05-05 Thread Paul Taylor
Would it be possible to include the variant.hpp update for nvcc in 0.17.1? Thanks, Paul On 5/4/20 4:17 PM, Wes McKinney wrote: hi folks, We have accumulated a few regressions ARROW-8657 https://github.com/apache/arrow/pull/7089 ARROW-8694 https://

[jira] [Created] (ARROW-6886) [C++] arrow::io header nvcc compiler warnings

2019-10-14 Thread Paul Taylor (Jira)
Paul Taylor created ARROW-6886: -- Summary: [C++] arrow::io header nvcc compiler warnings Key: ARROW-6886 URL: https://issues.apache.org/jira/browse/ARROW-6886 Project: Apache Arrow Issue Type

Re: [RESULT] [VOTE] Alter Arrow binary protocol to address 8-byte Flatbuffer alignment requirements (2nd vote)

2019-08-23 Thread Paul Taylor
I'll do the JS updates. Is it safe to validate against the Arrow C++ integration tests? On 8/22/19 7:28 PM, Micah Kornfield wrote: I created https://issues.apache.org/jira/browse/ARROW-6313 as a tracking issue with sub-issues on the development work. So far no-one has claimed Java and Javascr

[jira] [Created] (ARROW-6053) [Python] RecordBatchStreamReader::Open2 cdef type signature doesn't match C++

2019-07-26 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-6053: -- Summary: [Python] RecordBatchStreamReader::Open2 cdef type signature doesn't match C++ Key: ARROW-6053 URL: https://issues.apache.org/jira/browse/ARROW-6053 Pr

Re: [Discuss] Do a 0.15.0 release before 1.0.0?

2019-07-24 Thread Paul Taylor
ssing a buffer to the reader). I'm not sure I understand this suggestion: 1. Wouldn't this cause old readers to miss the last 4 bytes of the buffer (and provide meaningless bytes at the beginning). 2. The current proposal on the other thread is to have the pattern be <0xffff> Than

Building on Arrow CUDA

2019-07-24 Thread Paul Taylor
I'm looking at options to replace the custom Arrow logic in cuDF with Arrow library calls. What's the recommended way to declare a dependency on pyarrow/arrowcpp with CUDA support? I see in the docs it says to build from source, but that's only an option for an (advanced) end-user. And buildin

Re: [Discuss] Do a 0.15.0 release before 1.0.0?

2019-07-23 Thread Paul Taylor
+1 for a 0.15.0 before 1.0 if we go ahead with this. I'm curious to hear other's thoughts about compatibility. I think we should avoid breaking backwards compatibility if possible. It's common for apps/libs to be pinned on specific Arrow versions, and I worry it'd cause a lot of work for downs

Re: Error building cuDF on new Arrow with std::variant backport

2019-07-19 Thread Paul Taylor
/rapidsai/cudf/blob/branch-0.9/cpp/src/io/convert/csr/cudf_to_csr.cu#L30 On 7/19/19 8:41 PM, Micah Kornfield wrote: Hi Paul, This actually looks like it might be a problem with arrow-4800.   Did the build of arrow use c++14 or c++11? Thanks, Micah On Friday, July 19, 2019, Paul Taylor

Error building cuDF on new Arrow with std::variant backport

2019-07-19 Thread Paul Taylor
We're updating cuDF to Arrow 0.14 but encountering errors building that look related to PR #4259 . We can build Arrow itself, but we can't build cuDF when we include Arrow headers. Using C++ 14 and have tried gcc/g++ 5, 7, and clang. Has anyone seen t

Re: [Discuss] Format additions to Arrow for sparse data and data integrity

2019-07-06 Thread Paul Taylor
Hi Micah, Similar to Jacques I'm not disagreeing, but wondering if they belong in Arrow vs. can be done externally. I'm mostly interested in changes that might impact SIMD processing, considering Arrow's already made conscious design decisions to trade memory for speed. Apologies in advance if

Re: JS - proper way to instantiate a RecordBatch with nulls

2019-06-21 Thread Paul Taylor
Hi Jenny, In the 0.13 you have to pre-allocate the null bitmap and data buffers ahead of time, then use `vec.set(idx, val)` to write each value in. In 0.14 you can instead use the new Builders from PR #4476 . These will create and resize the underly

[jira] [Created] (ARROW-5537) [JS] Support delta dictionaries in RecordBatchWriter and DictionaryBuilder

2019-06-09 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-5537: -- Summary: [JS] Support delta dictionaries in RecordBatchWriter and DictionaryBuilder Key: ARROW-5537 URL: https://issues.apache.org/jira/browse/ARROW-5537 Project: Apache

[jira] [Created] (ARROW-5396) [JS] Ensure reader and writer support files and streams with no RecordBatches

2019-05-22 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-5396: -- Summary: [JS] Ensure reader and writer support files and streams with no RecordBatches Key: ARROW-5396 URL: https://issues.apache.org/jira/browse/ARROW-5396 Project

Re: [Discuss][Format] Zero size record batches

2019-05-21 Thread Paul Taylor
I'd be happy to PR a fix for JS today if someone can link me to Wes's PR. On 5/21/19 11:02 AM, Wes McKinney wrote: I agree also. As a practical use case, the results of a request made with Arrow Flight might yield an empty result set. I'm not sure if this needs to be formally noted in the specif

[jira] [Created] (ARROW-5115) [JS] Implement the Vector Builders

2019-04-03 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-5115: -- Summary: [JS] Implement the Vector Builders Key: ARROW-5115 URL: https://issues.apache.org/jira/browse/ARROW-5115 Project: Apache Arrow Issue Type: New Feature

[jira] [Created] (ARROW-5100) [JS] Writer swaps byte order if buffers share the same underlying ArrayBuffer

2019-04-02 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-5100: -- Summary: [JS] Writer swaps byte order if buffers share the same underlying ArrayBuffer Key: ARROW-5100 URL: https://issues.apache.org/jira/browse/ARROW-5100 Project

Re: [VOTE] Release Apache Arrow JS 0.4.1 - RC1

2019-03-24 Thread Paul Taylor
Yes, the solution here is to publish all the packages again, and ignore errors from ones that have already been published via: npx lerna exec --no-bail -- npm publish Best, Paul > On Mar 24, 2019, at 2:10 PM, Krisztián Szűcs > wrote: > > Hi Kou, > > Paul has already added me, and I was tryi

Re: [VOTE] Release Apache Arrow JS 0.4.1 - RC0

2019-03-20 Thread Paul Taylor
+1 non-binding Ran `dev/release/js-verify-release-candidate.sh 0.4.1 0` on MacOS High Sierra w/ node v11.6.0 On Wed, Mar 20, 2019 at 5:21 PM Kouhei Sutou wrote: > +1 (binding) > > I ran the followings on Debian GNU/Linux sid: > > * dev/release/js-verify-release-candidate.sh 0.4.1 0 > > with:

[jira] [Created] (ARROW-4976) [JS] RecordBatchReader should reset its Node/DOM streams

2019-03-20 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4976: -- Summary: [JS] RecordBatchReader should reset its Node/DOM streams Key: ARROW-4976 URL: https://issues.apache.org/jira/browse/ARROW-4976 Project: Apache Arrow

Union typeIds width

2019-03-20 Thread Paul Taylor
I noticed the the DenseUnion docs[1] says the typeIds buffer is 8-bit signed integers, but in the flatbuffer schema[2] it's typed as int (and flatc generates a function that returns an Int32Array). How are the other implementations treating this buffer, and should we update the docs or the fla

Re: Timeline for 0.13 Arrow release

2019-03-19 Thread Paul Taylor
I agree, the JS has matured a lot in the last few months. I think it's ready to join the regular Arrow releases. Let me know if I can help integrate the publish scripts :-) The two main things in progress are docs + Vector Builders, neither of which should block this release. We're going to

Re: [DISCUSS] Format changes: process and requirements

2019-03-17 Thread Paul Taylor
Hi Jacques, I think we should have two complete implementations. I don't think having one feature in C# and Go and another in JavaScript and Rust does justice to the project goals. Agree 100%. We may already be in this situation with the DictionaryBatch "isDelta" flag. I haven't checked the C

[jira] [Created] (ARROW-4781) [JS] Ensure empty data initializes empty typed arrays

2019-03-05 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4781: -- Summary: [JS] Ensure empty data initializes empty typed arrays Key: ARROW-4781 URL: https://issues.apache.org/jira/browse/ARROW-4781 Project: Apache Arrow Issue

[jira] [Created] (ARROW-4780) [JS] Package sourcemap files, update default package JS version

2019-03-05 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4780: -- Summary: [JS] Package sourcemap files, update default package JS version Key: ARROW-4780 URL: https://issues.apache.org/jira/browse/ARROW-4780 Project: Apache Arrow

[jira] [Created] (ARROW-4738) [JS] NullVector should include a null data buffer

2019-03-01 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4738: -- Summary: [JS] NullVector should include a null data buffer Key: ARROW-4738 URL: https://issues.apache.org/jira/browse/ARROW-4738 Project: Apache Arrow Issue

Re: Passing user-defined "extension" types in the Arrow protocol

2019-02-26 Thread Paul Taylor
An alternative that's worked for us is (ab)using single-child SparseUnions to represent custom types. We have an enum of "well-known" typeIds (UUID, vec2's, IP addresses, etc), whose data is stored in one of the known Arrow types, as you've done. Pros are the typeIds buffer is tiny, and doesn'

[jira] [Created] (ARROW-4682) [JS] Writer should be able to write empty tables

2019-02-26 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4682: -- Summary: [JS] Writer should be able to write empty tables Key: ARROW-4682 URL: https://issues.apache.org/jira/browse/ARROW-4682 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-4674) [JS] Update arrow2csv to new Row API

2019-02-25 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4674: -- Summary: [JS] Update arrow2csv to new Row API Key: ARROW-4674 URL: https://issues.apache.org/jira/browse/ARROW-4674 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-4652) [JS] RecordBatchReader throughNode should respect autoDestroy

2019-02-21 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4652: -- Summary: [JS] RecordBatchReader throughNode should respect autoDestroy Key: ARROW-4652 URL: https://issues.apache.org/jira/browse/ARROW-4652 Project: Apache Arrow

[jira] [Created] (ARROW-4579) [JS] Add more interop with BigInt/BigInt64Array/BigUint64Array

2019-02-14 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4579: -- Summary: [JS] Add more interop with BigInt/BigInt64Array/BigUint64Array Key: ARROW-4579 URL: https://issues.apache.org/jira/browse/ARROW-4579 Project: Apache Arrow

[jira] [Created] (ARROW-4580) [JS] Accept Iterables in IntVector/FloatVector from() signatures

2019-02-14 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4580: -- Summary: [JS] Accept Iterables in IntVector/FloatVector from() signatures Key: ARROW-4580 URL: https://issues.apache.org/jira/browse/ARROW-4580 Project: Apache Arrow

[jira] [Created] (ARROW-4578) [JS] Float16Vector toArray should be zero-copy

2019-02-14 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4578: -- Summary: [JS] Float16Vector toArray should be zero-copy Key: ARROW-4578 URL: https://issues.apache.org/jira/browse/ARROW-4578 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-4557) [JS] Add Table/Schema/RecordBatch `selectAt(...indices)` method

2019-02-12 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4557: -- Summary: [JS] Add Table/Schema/RecordBatch `selectAt(...indices)` method Key: ARROW-4557 URL: https://issues.apache.org/jira/browse/ARROW-4557 Project: Apache Arrow

[jira] [Created] (ARROW-4555) [JS] Add high-level Table and Column creation methods

2019-02-12 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4555: -- Summary: [JS] Add high-level Table and Column creation methods Key: ARROW-4555 URL: https://issues.apache.org/jira/browse/ARROW-4555 Project: Apache Arrow Issue

[jira] [Created] (ARROW-4554) [JS] Implement logic for combining Vectors with different lengths/chunksizes

2019-02-12 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4554: -- Summary: [JS] Implement logic for combining Vectors with different lengths/chunksizes Key: ARROW-4554 URL: https://issues.apache.org/jira/browse/ARROW-4554 Project

[jira] [Created] (ARROW-4553) [JS] Implement Schema/Field/DataType comparators

2019-02-12 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4553: -- Summary: [JS] Implement Schema/Field/DataType comparators Key: ARROW-4553 URL: https://issues.apache.org/jira/browse/ARROW-4553 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-4552) [JS] Table and Schema assign implementations

2019-02-12 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4552: -- Summary: [JS] Table and Schema assign implementations Key: ARROW-4552 URL: https://issues.apache.org/jira/browse/ARROW-4552 Project: Apache Arrow Issue Type

Re: [RESULT][VOTE] Release Apache Arrow JS 0.4.0 - RC1

2019-02-05 Thread Paul Taylor
Arrow blog? Uwe On Mon, Feb 4, 2019, at 10:45 PM, Paul Taylor wrote: +1 verified on 18.04.1-Ubuntu and node v11.6.0 On 1/31/19 9:38 AM, Krisztián Szűcs wrote: +1 (binding), verified on OSX Mojave and Node v11.4.0 On Thu, Jan 31, 2019 at 6:06 PM Brian Hulette wrote: +1 verified on Archlinux

[jira] [Created] (ARROW-4477) [JS] Bn shouldn't override constructor of the resulting typed array

2019-02-04 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4477: -- Summary: [JS] Bn shouldn't override constructor of the resulting typed array Key: ARROW-4477 URL: https://issues.apache.org/jira/browse/ARROW-4477 Project: Apache

Re: [VOTE] Release Apache Arrow JS 0.4.0 - RC1

2019-02-04 Thread Paul Taylor
+1 verified on 18.04.1-Ubuntu and node v11.6.0 > On 1/31/19 9:38 AM, Krisztián Szűcs wrote: > +1 (binding), > > verified on OSX Mojave and Node v11.4.0 > >> On Thu, Jan 31, 2019 at 6:06 PM Brian Hulette wrote: >> >> +1 >> >> verified on Archlinux with Node v11.9.0 >> >> Thanks a lot for put

[jira] [Created] (ARROW-4442) [JS] Overly broad type annotation for Chunked typeId leading to type mismatches in generated typing

2019-01-31 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4442: -- Summary: [JS] Overly broad type annotation for Chunked typeId leading to type mismatches in generated typing Key: ARROW-4442 URL: https://issues.apache.org/jira/browse/ARROW-4442

Re: [Format] Passing selection masks with Arrow record batches

2019-01-27 Thread Paul Taylor
We’ve been doing this in a few different ways at Graphistry, mostly guided by use case and device characteristics. For temporary/in-memory/microservice CPU workloads, we’ll compute a set of valid row indices as one side of a DictionaryVector, with the original table/column as the dictionary sid

[jira] [Created] (ARROW-4396) Update Typedoc to support TypeScript 3.2

2019-01-27 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4396: -- Summary: Update Typedoc to support TypeScript 3.2 Key: ARROW-4396 URL: https://issues.apache.org/jira/browse/ARROW-4396 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-4395) ts-node throws type error running `bin/arrow2csv.js`

2019-01-27 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4395: -- Summary: ts-node throws type error running `bin/arrow2csv.js` Key: ARROW-4395 URL: https://issues.apache.org/jira/browse/ARROW-4395 Project: Apache Arrow Issue

[jira] [Created] (ARROW-4283) Should RecordBatchStreamReader/Writer be AsyncIteraable?

2019-01-17 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-4283: -- Summary: Should RecordBatchStreamReader/Writer be AsyncIteraable? Key: ARROW-4283 URL: https://issues.apache.org/jira/browse/ARROW-4283 Project: Apache Arrow

Re: Problems building Arrow Java

2018-12-31 Thread Paul Taylor
On 12/30/18 5:11 PM, Wes McKinney wrote: Hi Paul -- Java development has all been happening on Java 8 and 9. You might want to try an older JDK since newer ones aren't being formally supported yet Wes On Sun, Dec 30, 2018, 12:39 PM Paul Taylor Ah, just realized I was missing javac. I

Re: Problems building Arrow Java

2018-12-30 Thread Paul Taylor
projectarrow-vector:*Dependency problems found* ->*[Help 1]* On 12/30/18 10:21 AM, Paul Taylor wrote: Is anyone else having issues building Arrow Java? I'm trying to run the integration tests locally, but can't figure out why `mvn install` is failing. I see a number of warnin

Problems building Arrow Java

2018-12-30 Thread Paul Taylor
Is anyone else having issues building Arrow Java? I'm trying to run the integration tests locally, but can't figure out why `mvn install` is failing. I see a number of warnings, and a few checkstyle errors, but nothing besides that stands out. Thanks, Paul *~/dev/arrow/java*$ java --version P

Re: npmjs.com account to release Apache Arrow JavaScript

2018-12-14 Thread Paul Taylor
Hi Kouhei, I've added you as a maintainer of the apache-arrow top level package, as well as an owner on the @apache-arrow organization on npm. Paul On 12/14/18 1:59 PM, Kouhei Sutou wrote: Hi Brian, I read this change: https://cwiki.apache.org/confluence/pages/diffpagesbyversion.action

Re: Arrow JS 0.4.0 Release

2018-12-14 Thread Paul Taylor
again, now just fixing the last few issues in the build scripts. I'm especially pleased that `cat ./some-gigantic-table.arrow | npx arrow2csv | less` doesn't stream the entire table to less and terminate with a broken-pipe error anymore :-) Paul On 12/14/18 10:31 AM, Wes McKinney

Re: Arrow JS 0.4.0 Release

2018-12-13 Thread Paul Taylor
on point. And finally selfishly - if we had been on the main release cycle, the contributions I made in the summer would have been released in either 0.10 or 0.11 by now. Brian On Thu, Dec 13, 2018 at 3:29 AM Paul Taylor wrote: The ongoing JS refactor/upgrade branch <https://github.com/trxcl

Re: Arrow JS 0.4.0 Release

2018-12-13 Thread Paul Taylor
The ongoing JS refactor/upgrade branch is just about done. It's passing all the integration tests, as well as a hundred or so new unit tests. I have to update existing tests where the APIs changed, battle with closure-compiler a bit,

Re: Assign/update : NA bitmap vs sentinel

2018-11-10 Thread Paul Taylor
While I'm not qualified to debate the merits of various physical representations inside databases, I would like to chime in from the perspective of both an Arrow contributor and architect of perhaps one of the more exotic applications of Arrow in the wild (client/server + JavaScript + GPUs + gr

[jira] [Created] (ARROW-3337) JS writer doesn't serialize the dictionary of nested Vectors

2018-09-26 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-3337: -- Summary: JS writer doesn't serialize the dictionary of nested Vectors Key: ARROW-3337 URL: https://issues.apache.org/jira/browse/ARROW-3337 Project: Apache

[jira] [Created] (ARROW-3336) JS writer doesn't serialize sliced Vectors correctly

2018-09-26 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-3336: -- Summary: JS writer doesn't serialize sliced Vectors correctly Key: ARROW-3336 URL: https://issues.apache.org/jira/browse/ARROW-3336 Project: Apache Arrow

[jira] [Created] (ARROW-3304) JS stream reader should yield all messages

2018-09-23 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-3304: -- Summary: JS stream reader should yield all messages Key: ARROW-3304 URL: https://issues.apache.org/jira/browse/ARROW-3304 Project: Apache Arrow Issue Type

Re: [RESULT][VOTE] Release Apache Arrow 0.10.0 (RC1)

2018-08-06 Thread Paul Taylor
Looks like typedoc is using the wrong version of typescript. I can take a look at updating it later this afternoon if it’s blocking the release. On Mon, Aug 6, 2018 at 12:55 PM Li Jin wrote: > I was trying to build the documentation but hit a JavaScript error. I am > not familiar with JavaScript

Re: Help understanding IPC Message/Buffer structure

2018-07-12 Thread Paul Taylor
Hi Randy, The first four bytes are the int32 length of the flatbuffers Message metadata plus 4 bytes of padding between the length and the Message metadata itself. The Message metadata start

[jira] [Created] (ARROW-2839) [JS] Support whatwg/streams in IPC reader/writer

2018-07-12 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-2839: -- Summary: [JS] Support whatwg/streams in IPC reader/writer Key: ARROW-2839 URL: https://issues.apache.org/jira/browse/ARROW-2839 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-2828) [JS] Refactor Vector Data classes

2018-07-10 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-2828: -- Summary: [JS] Refactor Vector Data classes Key: ARROW-2828 URL: https://issues.apache.org/jira/browse/ARROW-2828 Project: Apache Arrow Issue Type: Task

Re: [DISCUSS] Concerns about the Arrow Slack channel

2018-07-09 Thread Paul Taylor
Brian, Leo, and I yesterday discussed creating an ArrowJS channel in the Graphistry Slack as an alternative. Graphistry doesn't mind being the channel admins, we just don't want to run afoul of any ASF guidelines for project organization. On 07/09/2018 11:47 AM, Uwe L. Korn wrote: Bumping th

Re: [VOTE] Close down Arrow Slack channel

2018-07-09 Thread Paul Taylor
-1 I use it to coordinate with folks on the JS side. As I already have to participate in a number other Slack workspaces on a daily basis, Arrow slack is generally less intrusive than even opening and responding to email. On 07/09/2018 10:30 AM, Bryan Cutler wrote: +1 On Mon, Jul 9, 2018 a

[jira] [Created] (ARROW-2779) [JS] Fix node stream reader/writer compatibility

2018-07-01 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-2779: -- Summary: [JS] Fix node stream reader/writer compatibility Key: ARROW-2779 URL: https://issues.apache.org/jira/browse/ARROW-2779 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-2650) [JS] Finish implementing Unions

2018-05-31 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-2650: -- Summary: [JS] Finish implementing Unions Key: ARROW-2650 URL: https://issues.apache.org/jira/browse/ARROW-2650 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-2640) JS Writer should serialize schema metadata

2018-05-27 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-2640: -- Summary: JS Writer should serialize schema metadata Key: ARROW-2640 URL: https://issues.apache.org/jira/browse/ARROW-2640 Project: Apache Arrow Issue Type: Bug

Proposed Arrow Graph representations

2018-05-19 Thread Paul Taylor
At GTC San Jose last month, NVidia's Joe Eaton (cc'd) presented on the nvGraph team's goals for accelerating in-memory graph processing and analytics. A major component of that is advancing and standardizing a common, efficient representation for graphs that c

Re: [JS] Arrow output from JS library?

2018-05-10 Thread Paul Taylor
ata to ArrayBuffers. Next steps are to get more tests in place, finish the RecordBatchFileWriter, do the JSON writers, validate against Arrow cpp/java, implement builders, fixes/optimizations, and get a PR ready. Best, Paul On Tue, Apr 17, 2018 at 7:09 PM, Paul Taylor wrote: > Hi Naveen, > &

Re: [JS] Arrow output from JS library?

2018-04-17 Thread Paul Taylor
Hi Naveen, I have some work in a branch on my fork, and perhaps a bit more locally, but it's not finished. Feel free to reach out if you want to collaborate. Otherwise Graphistry intends to staff it full time in the next few weeks. Best, Paul On Tue, Apr 17, 2018 at 3:33 PM, Naveen Michau

Re: Next Arrow sync call

2018-03-29 Thread Paul Taylor
I'd like to join the gcal invite as well. Thanks! > On Mar 29, 2018, at 11:10 AM, Wes McKinney wrote: > > Looks good. > > The next Arrow sync will be Wednesday April 4 at 12:00 US Eastern time > > On Thu, Mar 29, 2018 at 7:53 AM, Uwe L. Korn wrote: >> Hi, >> >> I've added all who have reques

[jira] [Created] (ARROW-2356) [JS] JSON reader fails on FixedSizeBinary data buffer

2018-03-26 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-2356: -- Summary: [JS] JSON reader fails on FixedSizeBinary data buffer Key: ARROW-2356 URL: https://issues.apache.org/jira/browse/ARROW-2356 Project: Apache Arrow Issue

Re: [DISCUSS] Arrow 1.0 Compatibility Issues: Union and Interval

2018-03-20 Thread Paul Taylor
Jumping in b/c I did the JS Union implementations. I inferred the behavior from what I understood the C++ and Java to be doing, so I may have misunderstood how they should work. > To that end, we talked about > introducing a "single-primitive" (a.k.a. "javascript") union behavior that > would op

Re: Working towards getting 0.9.0 release candidate up next week

2018-03-14 Thread Paul Taylor
This should fix it: https://github.com/apache/arrow/pull/1751 > On Mar 14, 2018, at 6:42 PM, Wes McKinney wrote: > > Last item https://issues.apache.org/jira/browse/ARROW-2312 > > I can start the release vote as soon as we have the release > verifica

Re: [VOTE] Apache Arrow JavaScript 0.3.1 - RC1

2018-03-14 Thread Paul Taylor
+1 (non-binding) > On Mar 14, 2018, at 5:10 PM, Wes McKinney wrote: > > Hello all, > > I\'d like to propose the following release candidate (rc1) of Apache Arrow > JavaScript version 0.3.1. > > The source release rc1 is hosted at [1]. > > This release candidate is based on commit > 077bd53df5

Re: [VOTE] Release Apache Arrow JavaScript 0.3.1 - RC0

2018-03-14 Thread Paul Taylor
This issue has been resolved. I'm available this week to help with anything else blocking this release. Thx > On Mar 12, 2018, at 9:10 AM, Wes McKinney wrote: > > OK, thanks Brian. I will cancel this release and we can cut a new RC > after this issue is resolved. > > On Mon, Mar 12, 2018 at 10

[jira] [Created] (ARROW-2226) [JS] DictionaryData should use indices' offset in constructor

2018-02-26 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-2226: -- Summary: [JS] DictionaryData should use indices' offset in constructor Key: ARROW-2226 URL: https://issues.apache.org/jira/browse/ARROW-2226 Project: Apache

[jira] [Created] (ARROW-2225) [JS] Vector reader should support reading tables split across buffers

2018-02-26 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-2225: -- Summary: [JS] Vector reader should support reading tables split across buffers Key: ARROW-2225 URL: https://issues.apache.org/jira/browse/ARROW-2225 Project: Apache

[jira] [Created] (ARROW-2214) [JS] proxy DictionaryVector's nullBitmap to its indices' nullBitmap

2018-02-25 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-2214: -- Summary: [JS] proxy DictionaryVector's nullBitmap to its indices' nullBitmap Key: ARROW-2214 URL: https://issues.apache.org/jira/browse/ARROW-2214 Project: Ap

[jira] [Created] (ARROW-2213) [JS] Fix npm-release.sh

2018-02-25 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-2213: -- Summary: [JS] Fix npm-release.sh Key: ARROW-2213 URL: https://issues.apache.org/jira/browse/ARROW-2213 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-1903) [JS] Fix typings consuming apache-arrow module when noImplicitAny is false

2017-12-07 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-1903: -- Summary: [JS] Fix typings consuming apache-arrow module when noImplicitAny is false Key: ARROW-1903 URL: https://issues.apache.org/jira/browse/ARROW-1903 Project: Apache

Arrow JS tasks and roadmap

2017-10-19 Thread Paul Taylor
Brian Hulette and I have outlined this list of tasks/improvements for the expanded Arrow JS implementation: https://docs.google.com/document/d/142dek89oM2TVI2Yql106Zo8IB1Ff_9zDg_EG6jPWS0M/edit?usp=sharing

[jira] [Created] (ARROW-1590) Flow TS Table method generics

2017-09-21 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-1590: -- Summary: Flow TS Table method generics Key: ARROW-1590 URL: https://issues.apache.org/jira/browse/ARROW-1590 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-1549) [JS] Integrate auto-generated Arrow test files

2017-09-17 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-1549: -- Summary: [JS] Integrate auto-generated Arrow test files Key: ARROW-1549 URL: https://issues.apache.org/jira/browse/ARROW-1549 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-1544) [JS] Export Vector type definitions

2017-09-15 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-1544: -- Summary: [JS] Export Vector type definitions Key: ARROW-1544 URL: https://issues.apache.org/jira/browse/ARROW-1544 Project: Apache Arrow Issue Type: Bug