GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/25
SAMZA-1054: Refactor Operator APIs
Some suggestions for an Operator API refactor and misc. cleanup. It does
contain some implementation changes, mostly due to deleted, extracted or merged
classes
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/35
SAMZA-1065: Change the commit order to support at least once processing
when using local state store for deduping.
You can merge this pull request into a Git repository by running:
$ git pull
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/46
Upgraded RocksDB version to 5.0.1 and added configuration for managing
RocksDB logging
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/60
SAMZA-1091: Implement key-based inner join operator with no time constraints
... and no retention
You can merge this pull request into a Git repository by running:
$ git pull https://github.com
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/78
SAMZA-1121: StreamAppender should not propagate exceptions to the caller
StreamAppender#append currently propagates any exceptions while sending
messages to the underlying logging system to the
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/92
SAMZA-1094, SAMZA-1101: Remove MessageEnvelope from public operator APIs.
Delay the creation of SinkFunction for output streams.
Additionally,
1. Removed references to StreamGraph in
GitHub user prateekm reopened a pull request:
https://github.com/apache/samza/pull/92
SAMZA-1094: Remove MessageEnvelope from public operator APIs. SAMZA-1101:
Delay the creation of SinkFunction for output streams. SAMZA-1159: Move
StreamSpec from a public API to an internal class
Github user prateekm closed the pull request at:
https://github.com/apache/samza/pull/92
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/105
SAMZA-1176: Make TestJoinOperator unit tests safe for concurrent execution
There are occasional failures like the following:
```joinRetainsMatchedMessagesReverse FAILED
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/113
SAMZA-1191: Fixed flaky test: TestExponentialSleepStrategy
testThreadInterruptInRetryLoop
It's possible that the interruptee thread (see `#interruptedThread`) gets
pre-empted before it
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/115
SAMZA-1192: Fixed TestJoinOperator test failure on JDK 1.8.0_05
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza join-test-fix
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/123
SAMZA-1211: Remove Thread.sleep() from TestJoinOperator tests
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza join-test-no-sleep
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/131
SAMZA-1176: Intermittent TestJoinOperator unit test failure
Join TTL was set too low (10 ms). `joinRetainsMatchedMessagesReverse` will
fail if the execution time between line 176 and 186 is longer
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/134
SAMZA-1229: Disk space monitor should only count data in use by the
container
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/142
SAMZA-1219 Add metrics for operator message received and execution times
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/143
SAMZA-1200: Scala compile for samza-core fails with ambiguous reference
error...
... for some compiler versions.
You can merge this pull request into a Git repository by running:
$ git pull
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/144
SAMZA-1026: HDFS System Producer should not have Kafka dependency
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza hdfs-kafka
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/159
SAMZA-1247: MessageStreamImpl#merge shouldn't mutate input collection
Also fixes
SAMZA-1253: MessageStream.merge operator broken for nested types
You can merge this pull request into
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/169
SAMZA-1268: Javadoc cleanup for public APIs for 0.13 release
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza api-docs-cleanup
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/170
Disabled flaky test TestProcessJob testProcessJobKillShouldWork
To be fixed in SAMZA-1261
You can merge this pull request into a Git repository by running:
$ git pull https://github.com
Github user prateekm closed the pull request at:
https://github.com/apache/samza/pull/170
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/171
Disabled a few flaky tests and added corresponding tickets to fix.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza disable-flaky
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/174
SAMZA-1274: Update kafka-python and kafka broker version for integration
tests
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/175
SAMZA-1268: More javadoc cleanup for public APIs for 0.13 release
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza documentation
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/182
SAMZA-1277: Add a static merge() operator that takes all streams to merge
as input
Also updated documentation for join and partitionBy.
You can merge this pull request into a Git repository by
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/183
Disabled flaky test for TestExponentialSleepStrategy
testThreadInterruptInOperationSleep
To be fixed in SAMZA-1269
You can merge this pull request into a Git repository by running:
$ git pull
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/187
SAMZA-1287: Updated instructions for running integration and stability
tests.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/194
SAMZA-1221, SAMZA-1101: Internal cleanup for High-Level API implementation.
SAMZA-1221: Separated the OperatorSpec and MessageStream DAGs so that
they're now duals of each other. Users int
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/224
Updated test build versions to future SNAPSHOT version
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza version-update
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/227
Updated links, versions and feature information in README.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza readme-update
Github user prateekm closed the pull request at:
https://github.com/apache/samza/pull/183
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/248
SAMZA-1361: OperatorImplGraph is using wrong keys to store/retrieve
OperatorImpl in the map
You can merge this pull request into a Git repository by running:
$ git pull https://github.com
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/251
SAMZA-1370: Memory leak in CachedStore when using ByteBufferSerde as key
Serde
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/293
Support for declaring serdes in high level API code for
input/output/intermediate streams.
@nickpan47 @vjagadish1989 @jmakes Please take a look.
Some notes/considerations:
* Serde
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/309
Added wiring for High Level API state stores, their serdes and changelogs.
Provided join operator access to durable state stores.
cc @vjagadish1989 for review.
You can merge this pull
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/310
SAMZA-1109: Updated High Level API serde impl with Yi's feedback
@nickpan47 for review.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/pra
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/311
Added self to committers list.
@vjagadish1989
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza commiters-update
Alternatively
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/312
Added my code signing key to KEYS
@vjagadish1989 for review
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza commiters-update
Github user prateekm closed the pull request at:
https://github.com/apache/samza/pull/227
---
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/315
Removed circular dependency b/w samza-core test and samza-kv-rocksdb test
Also added an implementation for KVSerde.
@vjagadish1989, thanks for the TestInMemoryStore implementation!
The
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/316
SAMZA-1435: Changed samza-api Serde implementations from Scala to Java.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza java
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/318
SAMZA-272, SAMZA-1440, SAMZA-1269: Fixed thread interrupt tests in
TestExponentialSleepStrategy.
You can merge this pull request into a Git repository by running:
$ git pull https
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/320
SAMZA-1292: Merge operator can be no-op when there are no streams to merge
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza small
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/324
SAMZA-1454: Globally unique and user settable IDs for stateful operators
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/332
Minor fixes to KeyValueStore and RocksDBKeyValueStore
1. Replaced extension class in KeyValueStore with default methods.
2. Fixed formatting in RocksDBKeyValueStore#openDB.
3. Now logs
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/333
SAMZA-1466: Flaky test: TestRocksDbKeyValueStore suite
Test failures happen intermittently because:
1. RocksDB throws an exception if open is called on a store that's already
open.
2.
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/334
SAMZA-1464: Flushing a closed RocksDB store causes SIGSEGVs
Made RocksDB operations check if DB is still open to avoid segfaults.
You can merge this pull request into a Git repository by running
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/354
Ignore java fatal error log files from git and rat
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza rat-exclude-hserr
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/358
SAMZA-1495: Set intermediate streams as higher priority by default
Most changes in StreamConfig are formatting fixes.
You can merge this pull request into a Git repository by running:
$ git
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/359
SAMZA-1501: Validate operator IDs so that they don't contain special
characters and spaces
You can merge this pull request into a Git repository by running:
$ git pull https://githu
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/376
Updated Serde related documentation and error messages for High Level API
Updated and clarified the documentation and error messages related to
Serdes for Input/Output/PartitionBy streams.
You can
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/389
SAMZA-1538: Disabled Flaky Tests in TestStreamProcessor
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza disable-flaky-test
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/404
SAMZA-1500: Added metrics for RocksDB state store memory usage
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza rocksdb-memory
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/414
Added some logging to stdout for easier parsing by tools.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza print-container-info
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/433
Misc. minor cleanup.
1. Added a meaningful name for the container thread pool threads.
2. Made the thread names for framework threads consistent.
3. Made a couple of monitoring/metrics
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/455
Misc. Util cleanup
Major changes:
1. Broke up 'Util' class into multiple classes: 'FileUtil', 'HttpUtil',
'CoordinatorStreamUtil'.
2. Consolidated so
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/460
SAMZA-1630: Log a thread dump on timeouts
Would be useful to get a thread dump on timeouts, e.g. for AsyncStreamTask
callback timeout, container shutdown timeout, heartbeat monitor graceful
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/461
SAMZA-1631: Improve logging on Task callback timeout
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza task-callback-logging
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/462
SAMZA-1630: Move thread dump from stdout to logs
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza thread-dump-on-timeout
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/468
SAMZA-1643: StreamPartitionCountMonitor should only restart/shut down the
job if partition count increases
As an aside, also update the gauge to report current number of partitions
instead of the
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/513
Fixed test failure for TestRocksDbKeyValueStoreJava#testIterate
RocksDB Snapshots and any iterators obtained from them need to be closed
before the store is closed.
Otherwise the process
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/514
Fixed test failure for TestRocksDbKeyValueStoreJava#testPerf
Iterators (incl. those obtained from snapshots) must be closed before store
close.
You can merge this pull request into a Git
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/538
SAMZA-1724: Guarantee exit from ApplicationRunnerMain during deploys
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza process
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/539
SAMZA-1736: Add counters and timers for batch get/put/delete operations in
KeyValueStorageEngine
You can merge this pull request into a Git repository by running:
$ git pull https
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/546
Minor: KafkaConfig should treat empty changelog name as no changelog.
If a store changelog stream name is empty, treat is as a non-changelogged
store instead of throwing an exception.
You can
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/552
SAMZA-1745: Remove all usages of StreamSpec and ApplicationRunner from the
operator spec and impl layers.
This PR is a pre-requisite for adding support for user-provided
SystemDescriptors and
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/586
SAMZA-1790: LocalContainerRunner should not extend
AbstractApplicationRunner.
LocalContainerRunner is the launcher for the process running SamzaContainer
in YARN. It extends the
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/603
SAMZA-1804: System and Stream Descriptors
Design details:
https://cwiki.apache.org/confluence/display/SAMZA/SEP-14%3A+System+and+Stream+Descriptors
You can merge this pull request into a Git
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/612
SAMZA-1813: ApplicationRunner should use Planner generated configs for
StreamManager
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/627
SAMZA-1841: Added support for marking an input stream as broadcast using
InputDescriptors
You can merge this pull request into a Git repository by running:
$ git pull https://github.com
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/628
SAMZA-1845: Require providing serde for default system in
ApplicationDescriptor#setDefaultSystem
You can merge this pull request into a Git repository by running:
$ git pull https
Github user prateekm closed the pull request at:
https://github.com/apache/samza/pull/628
---
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/693
SAMZA-1845: Require setting serde for intermediate streams in High Level API
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/702
Application Descriptor implementation cleanup.
Major changes:
1. Move withDefaultSystem to the base ApplicationDescriptor interface so
that it can be used for TaskApplication as well
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/708
System and Stream Descriptor API cleanup.
Major changes:
1. Made withPhysicalName(String) a protected method in StreamDescriptor.
The primary reason to set the physical name is to use
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/720
Consolidating package names for System, Stream, Application and Table
descriptors.
Everything in this PR is either:
1. A package name change and a corresponding file move.
2. Javadoc
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/722
Fix broken tests on master.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza test-fixes
Alternatively you can review and apply
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/725
Minor fixes for issues found during testing.
1. Made public methods in ApplicationDescriptorImpl non-final for mocking.
2. Updated CachedTableDescriptor public method return types to include
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/728
User provided configuration should have higher precedence than Samza
generated configuration.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/737
Javadoc cleanup for new Application, Descriptor, Context and Table APIs.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza javadoc
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/741
Changed TaskApplicationDescriptor APIs to be fluent and consistent with
StreamApplicationDescriptor API.
You can merge this pull request into a Git repository by running:
$ git pull https
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/743
LegacyTaskApplication should set app.mode to batch by default.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza app-mode-fix
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/754
Update hardcoded versions for the 1.0.0 release.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza version-updates
Alternatively
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/760
Javadoc cleanup for new Application, Descriptor, Context and Table APIs -
Part 2
Currently, we don't allow imports for use only in javadocs. This requires
using FQNs in link tags, which i
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/783
Updated RELEASE instructions.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza release-docs
Alternatively you can review and
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/840
Minor fix to some config variable names and accessor methods.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza fix-config-names
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/855
Minor: Log full thread stacks in thread dumps.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/prateekm/samza detailed-thread-dump
GitHub user prateekm opened a pull request:
https://github.com/apache/samza/pull/859
SAMZA-2042: Add support for inspecting/modifying checkpoints before they're
written.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/pra
Github user prateekm closed the pull request at:
https://github.com/apache/samza/pull/627
---
89 matches
Mail list logo