This is an automated email from the ASF dual-hosted git repository.
git-site-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/datasketches-website.git
The following commit(s) were added to refs/heads/asf-site by this push:
new 9eea4c04 Automatic Site Publish by Buildbot
9eea4c04 is described below
commit 9eea4c04be06d84aa88311178345a5c978a9c436
Author: buildbot <[email protected]>
AuthorDate: Mon Sep 9 16:45:42 2024 +0000
Automatic Site Publish by Buildbot
---
output/css/fonts.css | 2 +-
output/docs/Architecture/MajorSketchFamilies.html | 2 +-
output/docs/Background/Presentations.html | 2 +-
output/docs/Community/ReleaseProcessForJavaComponents.html | 2 +-
output/docs/Frequency/FrequentItemsPigUDFs.html | 2 +-
output/docs/HLL/HllSketchVsDruidHyperLogLogCollector.html | 4 ++--
output/docs/HLL/Hll_vs_CS_Hllpp.html | 2 +-
output/docs/Quantiles/ClassicQuantilesSketch.html | 2 +-
output/docs/QuantilesStudies/DruidApproxHistogramStudy.html | 2 +-
output/docs/QuantilesStudies/MomentsSketchStudy.html | 2 +-
output/docs/QuantilesStudies/QuantilesStreamAStudy.html | 2 +-
output/docs/Theta/ThetaAccuracy.html | 2 +-
output/docs/Theta/ThetaSketchFramework.html | 4 +---
output/docs/Theta/ThetaSketchSetOpsAccuracy.html | 4 ++--
14 files changed, 16 insertions(+), 18 deletions(-)
diff --git a/output/css/fonts.css b/output/css/fonts.css
index 5c8833ca..59d4e4b1 100644
--- a/output/css/fonts.css
+++ b/output/css/fonts.css
@@ -56,5 +56,5 @@
font-family: 'Open Sans Condensed';
font-style: normal;
font-weight: 700;
- src: url(/fonts/open_sans_700.ttf) format('truetype');
+ src: url(/fonts/open_sans_condensed_700.ttf) format('truetype');
}
diff --git a/output/docs/Architecture/MajorSketchFamilies.html
b/output/docs/Architecture/MajorSketchFamilies.html
index 820d07ab..1aa1b736 100644
--- a/output/docs/Architecture/MajorSketchFamilies.html
+++ b/output/docs/Architecture/MajorSketchFamilies.html
@@ -553,7 +553,7 @@ However, if an approximate answer to these problems is
acceptable, <a href="/doc
<li><a
href="https://github.com/apache/datasketches-java/blob/master/src/main/java/org/apache/datasketches/tuple/adouble/DoubleSketch.java">tuple/adouble/DoubleSketch</a>
with a single column of <em>double</em> values as the <em>summary</em>.</li>
<li><a
href="https://github.com/apache/datasketches-java/blob/master/src/main/java/org/apache/datasketches/tuple/aninteger/IntegerSketch.java">tuple/aninteger/IntegerSketch</a>
with a single column of <em>int</em> values as the <em>summary</em>.</li>
<li><a
href="https://github.com/apache/datasketches-java/blob/master/src/main/java/org/apache/datasketches/tuple/strings/ArrayOfStringsSketch.java">tuple/strings/ArrayOfStringsSketch</a>,
which is effectively a variable number of columns of strings as the
<em>summary</em>.</li>
- <li><a
href="https://github.com/apache/datasketches-java/blob/master/src/main/java/org/apache/datasketches/tuple/ArrayOfDoublesSketch.java">tuple/ArrayOfDoublesSketch</a>,
which enables the user to specify the number of columns of double values as
the <em>summary</em>. This variant also provides both on-heap and off-heap
operation.</li>
+ <li><a
href="https://github.com/apache/datasketches-java/blob/master/src/main/java/org/apache/datasketches/tuple/arrayofdoubles/ArrayOfDoublesSketch.java">tuple/ArrayOfDoublesSketch</a>,
which enables the user to specify the number of columns of double values as
the <em>summary</em>. This variant also provides both on-heap and off-heap
operation.</li>
</ul>
<h3 id="hyperloglog-sketches-estimating-stream-cardinalities"><a
href="/docs/HLL/HLL.html">HyperLogLog Sketches</a>: Estimating Stream
Cardinalities</h3>
diff --git a/output/docs/Background/Presentations.html
b/output/docs/Background/Presentations.html
index e0c18014..bebdd0a6 100644
--- a/output/docs/Background/Presentations.html
+++ b/output/docs/Background/Presentations.html
@@ -571,7 +571,7 @@
</tr>
<tr>
<td>Presenter</td>
- <td><a
href="https://warwick.ac.uk/fac/sci/dcs/people/pavel_vesely/">Pavel Vesely</a>
from <a href="https://www.mff.cuni.cz/en/iuuk">CSI Charles Univ, Prague</a></td>
+ <td><a href="https://iuuk.mff.cuni.cz/~vesely/">Pavel Vesely</a> from <a
href="https://www.mff.cuni.cz/en/iuuk">CSI Charles Univ, Prague</a></td>
</tr>
<tr>
<td>Date</td>
diff --git a/output/docs/Community/ReleaseProcessForJavaComponents.html
b/output/docs/Community/ReleaseProcessForJavaComponents.html
index 12f416fc..3f58b69c 100644
--- a/output/docs/Community/ReleaseProcessForJavaComponents.html
+++ b/output/docs/Community/ReleaseProcessForJavaComponents.html
@@ -837,7 +837,7 @@ Also specifies line separator characters for text
files.</li>
<h3 id="create-copy-of-external-artifact-distributions">Create Copy of
External Artifact Distributions</h3>
<h4 id="java-only">JAVA ONLY</h4>
<ul>
- <li>Place copies of the artifact jars deployed to Nexus under a “maven”
directory. For example see <a
href="https://dist.apache.org/repos/dist/dev/datasketches/memory/1.3.0-RC1/">https://dist.apache.org/repos/dist/dev/datasketches/memory/1.3.0-RC1/</a></li>
+ <li>Place copies of the artifact jars deployed to Nexus under a “maven”
directory. For example see <a
href="https://dist.apache.org/repos/dist/dev/datasketches/memory/3.0.0-RC1/">https://dist.apache.org/repos/dist/dev/datasketches/memory/3.0.0-RC1/</a></li>
<li>Note that the <code class="highlighter-rouge">jar</code> files with
their <code class="highlighter-rouge">asc</code>, (optional <code
class="highlighter-rouge">md5</code> and <code
class="highlighter-rouge">sha1</code>) signature are all together in the .md2
archive</li>
<li>Add a <code class="highlighter-rouge">maven</code> directory under the
<code
class="highlighter-rouge">dist/dev/datasketches/<component>/A.B.0/</code></li>
<li>Bulk copy the <code class="highlighter-rouge">jar, asc, (optional
md5</code> and <code class="highlighter-rouge">sha1</code>) files into the
<code class="highlighter-rouge">maven</code> directory.</li>
diff --git a/output/docs/Frequency/FrequentItemsPigUDFs.html
b/output/docs/Frequency/FrequentItemsPigUDFs.html
index da2d3aa4..5564564c 100644
--- a/output/docs/Frequency/FrequentItemsPigUDFs.html
+++ b/output/docs/Frequency/FrequentItemsPigUDFs.html
@@ -589,7 +589,7 @@ dump g;
<p>In this example the results are exact due to small input.</p>
-<h3 id="datatxt-tab-separated"><a
href="/docs/FrequentItems/data.txt">data.txt</a> (tab separated)</h3>
+<h3 id="datatxt-tab-separated"><a href="/docs/Frequency/data.txt">data.txt</a>
(tab separated)</h3>
<div class="highlighter-rouge"><div class="highlight"><pre
class="highlight"><code>a c1
a c1
a c1
diff --git a/output/docs/HLL/HllSketchVsDruidHyperLogLogCollector.html
b/output/docs/HLL/HllSketchVsDruidHyperLogLogCollector.html
index 164cccbd..6cc1fb18 100644
--- a/output/docs/HLL/HllSketchVsDruidHyperLogLogCollector.html
+++ b/output/docs/HLL/HllSketchVsDruidHyperLogLogCollector.html
@@ -523,9 +523,9 @@
specific language governing permissions and limitations
under the License.
-->
-<h1 id="hll-sketch-vs-druid-hyperloglogcollector">HLL sketch vs Druid
HyperLogLogCollector</h1>
+<h1 id="hll-sketch-vs-druids-hyperunique-aggregator">HLL sketch vs Druid’s
HyperUnique Aggregator</h1>
-<p>The goal of this article is to compare the HLL sketch implemented in this
library to the <a href="https://github.com/apache/druid/tree/master/hll">Druid
HyperLogLogCollector</a>.</p>
+<p>The goal of this article is to compare the HLL sketch implemented in this
library to <a
href="https://druid.apache.org/docs/latest/querying/aggregations/#count-distinct">Druid’s
original HyperUnique aggregator sketch</a>.</p>
<h2 id="versions">Versions</h2>
diff --git a/output/docs/HLL/Hll_vs_CS_Hllpp.html
b/output/docs/HLL/Hll_vs_CS_Hllpp.html
index cbd78661..367f4251 100644
--- a/output/docs/HLL/Hll_vs_CS_Hllpp.html
+++ b/output/docs/HLL/Hll_vs_CS_Hllpp.html
@@ -761,7 +761,7 @@ Note that the Y-axis scale is now 100 nanoseconds. Some of
the peaks in these pl
<ul>
<li>[1] <a
href="https://github.com/apache/datasketches-java/tree/master/src/main/java/org/apache/datasketches/hll">DataSketches
HllSketch GitHub</a></li>
- <li>[2] <a href="/api/java/snapshot/apidocs/index.html">DataSketches
HllSketch JavaDocs</a></li>
+ <li>[2] <a
href="https://datasketches.apache.org/docs/HLL/HLL.html">DataSketches HllSketch
JavaDocs (top of page)</a></li>
<li>[3] <a
href="https://github.com/addthis/stream-lib/blob/master/src/main/java/com/clearspring/analytics/stream/cardinality/HyperLogLogPlus.java">HyperLogLogPlus
GitHub</a></li>
<li>[4] <a
href="https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/40671.pdf">Google:
HyperLogLog in Practice: Algorithmic Engineering of a State of The Art
Cardinality Estimation Algorithm</a></li>
<li>[5] The Root-Mean-Square of the Relative Error (RMS-RE) is sensitive to
bias of the mean if there is any. However, if the bias is zero RMS-RE will
produce the same results as the theoretical Relative Standard Error (RSE) of
the stochastic process.</li>
diff --git a/output/docs/Quantiles/ClassicQuantilesSketch.html
b/output/docs/Quantiles/ClassicQuantilesSketch.html
index 452e48c8..25933b10 100644
--- a/output/docs/Quantiles/ClassicQuantilesSketch.html
+++ b/output/docs/Quantiles/ClassicQuantilesSketch.html
@@ -526,7 +526,7 @@
<h1 id="classic-quantiles-sketch">Classic Quantiles Sketch</h1>
<h2 id="quantiles-sketches-accuracy-and-size">Quantiles Sketches Accuracy and
Size</h2>
-<p>Please review the Quantiles <a
href="/docs/Quantiles/Definitions.html">Definitions</a>.</p>
+<p>Please review the Quantiles <a
href="/docs/Quantiles/SketchingQuantilesAndRanksTutorial.html">Tutorial</a>.</p>
<p>The accuracy of a quantile sketch is a function of the configured value
<i>k</i>, which also affects
the overall size of the sketch.</p>
diff --git a/output/docs/QuantilesStudies/DruidApproxHistogramStudy.html
b/output/docs/QuantilesStudies/DruidApproxHistogramStudy.html
index f4ce729f..0965f33a 100644
--- a/output/docs/QuantilesStudies/DruidApproxHistogramStudy.html
+++ b/output/docs/QuantilesStudies/DruidApproxHistogramStudy.html
@@ -527,7 +527,7 @@
<p>The goal of this article is to compare the accuracy performance of the
Druid built-in Approximate Histogram to an exact, brute-force computation using
actual data extracted from one of our back-end servers.</p>
-<p>Please get familiar with the <a
href="/docs/Quantiles/Definitions.html">Definitions</a> for quantiles.</p>
+<p>Please get familiar with the <a
href="/docs/Quantiles/SketchingQuantilesAndRanksTutorial.html">Tutorial</a> for
quantiles.</p>
<p>Compare this study with the DataSketches <a
href="/docs/QuantilesStudies/QuantilesStreamAStudy.html">Quantiles StreamA
Study</a> with the same input data.</p>
diff --git a/output/docs/QuantilesStudies/MomentsSketchStudy.html
b/output/docs/QuantilesStudies/MomentsSketchStudy.html
index 2bed0897..bb45c36f 100644
--- a/output/docs/QuantilesStudies/MomentsSketchStudy.html
+++ b/output/docs/QuantilesStudies/MomentsSketchStudy.html
@@ -527,7 +527,7 @@
<p>The goal of this article is to compare the accuracy performance of the
Moments Sketch to an exact, brute-force computation using actual data extracted
from one of our back-end servers.</p>
-<p>Please get familiar with the <a
href="/docs/Quantiles/Definitions.html">Definitions</a> for quantiles.</p>
+<p>Please get familiar with the <a
href="/docs/Quantiles/SketchingQuantilesAndRanksTutorial.html">Tutorial</a> for
quantiles.</p>
<p>Compare this study with the DataSketches <a
href="/docs/QuantilesStudies/QuantilesStreamAStudy.html">Quantiles StreamA
Study</a> with the same input data.</p>
diff --git a/output/docs/QuantilesStudies/QuantilesStreamAStudy.html
b/output/docs/QuantilesStudies/QuantilesStreamAStudy.html
index cb0e4294..cc5e4e86 100644
--- a/output/docs/QuantilesStudies/QuantilesStreamAStudy.html
+++ b/output/docs/QuantilesStudies/QuantilesStreamAStudy.html
@@ -527,7 +527,7 @@
<p>The goal of this article is to compare the accuracy performance of the
DataSketches Quantiles Sketch to an exact, brute-force computation using actual
data extracted from one of our back-end servers.</p>
-<p>Please get familiar with the <a
href="/docs/Quantiles/Definitions.html">Definitions</a> for quantiles.</p>
+<p>Please get familiar with the <a
href="/docs/Quantiles/SketchingQuantilesAndRanksTutorial.html">Tutorial</a> for
quantiles.</p>
<h2 id="versions">Versions</h2>
diff --git a/output/docs/Theta/ThetaAccuracy.html
b/output/docs/Theta/ThetaAccuracy.html
index 59c75049..f6526d7f 100644
--- a/output/docs/Theta/ThetaAccuracy.html
+++ b/output/docs/Theta/ThetaAccuracy.html
@@ -539,7 +539,7 @@ It is important to understand that the sketch has no idea
what the true value is
it only knows the internal state of the sketch.</p>
<p>From the mathematical theory of these sketches
-(see <a
href="https://github.com/apache/datasketches-website/tree/master/docs/pdf/SketchEquations.pdf">Sketch
Equations</a> and
+(see <a
href="https://github.com/apache/datasketches-website/tree/master/docs/pdf/ThetaSketchEquations.pdf">Theta
Sketch Equations</a> and
<a
href="https://github.com/apache/datasketches-website/tree/master/docs/pdf/ThetaSketchFramework.pdf">Theta
Sketch Framework</a>) we know:</p>
<ul>
diff --git a/output/docs/Theta/ThetaSketchFramework.html
b/output/docs/Theta/ThetaSketchFramework.html
index 4bde8a7e..5115ad74 100644
--- a/output/docs/Theta/ThetaSketchFramework.html
+++ b/output/docs/Theta/ThetaSketchFramework.html
@@ -580,9 +580,7 @@ We will discuss the RSE in a later section.</p>
<p>[1] Z. Bar-Yossef, T. Jayram, R. Kumar, D. Sivakumar, and L. Trevisan.
Counting distinct elements in a data stream. In <i>Randomization and
Approximation Techniques in Computer Science</i>, pages 1–10. Springer,
2002.</p>
-<p>[2] See <a href="/docs/Theta/KMVempty.html">KMV Tutorial</a> and
-<a
href="https://research.neustar.biz/2012/07/09/sketch-of-the-day-k-minimum-values/">Sketch
of the Day: K-Minimum Values</a>
-for a brief tutorials on KMV Sketches.</p>
+<p>[2] See <a href="/docs/Theta/KMVempty.html">KMV Tutorial</a> for a brief
tutorial on KMV Sketches.</p>
<p>[3] This is a limited “KMV perspective” on how <i>θ</i> gets assigned. The
attached paper
<a
href="https://github.com/apache/datasketches-website/tree/master/docs/pdf/ThetaSketchFramework.pdf">Theta
Sketch Framework</a>
diff --git a/output/docs/Theta/ThetaSketchSetOpsAccuracy.html
b/output/docs/Theta/ThetaSketchSetOpsAccuracy.html
index ebc43a4b..cf87ac0a 100644
--- a/output/docs/Theta/ThetaSketchSetOpsAccuracy.html
+++ b/output/docs/Theta/ThetaSketchSetOpsAccuracy.html
@@ -577,7 +577,7 @@ sketches) and the qualifying subset of values.</p>
zero, all, or some number in between of the retained values of the Union
sketch.
Mixed set expressions can produce an error distribution that is larger that of
a standard sketch
of a given <i>Nominal Entries</i> or <i>k</i> and is mathematically described
in
-<a
href="https://github.com/apache/datasketches-website/tree/master/docs/pdf/SketchEquations.pdf">Sketch
Equations / Subsets of Fixed <i>k</i> Sampling</a>.</p>
+<a
href="https://github.com/apache/datasketches-website/tree/master/docs/pdf/ThetaSketchEquations.pdf">Theta
Sketch Equations / 2.3 Subsets of Fixed <i>k</i> Sampling</a>.</p>
<h4
id="source-sketches-and-target-with-the-same-nominal-entries-or-k-1">Source
sketches and target with the same <i>Nominal Entries</i> or <i>k</i></h4>
@@ -634,7 +634,7 @@ or approximately 4 of the bottom values will remain.</li>
<p>The mean estimate from the intersection sketch will be 4/.001 = 4K.
This happens to be correct using this hand-wavy analysis but in general is a
random result with a variance.
The proof that the estimate will be unbiased is in the attached
-<a
href="https://github.com/apache/datasketches-website/tree/master/docs/pdf/SketchEquations.pdf">Sketch
Equations</a>.</p>
+<a
href="https://github.com/apache/datasketches-website/tree/master/docs/pdf/ThetaSketchEquations.pdf">Theta
Sketch Equations</a>.</p>
<p>The RSE of a sketch with only 4 values is ~ 1/sqrt(4) = .5 or 50% error.
This is considerably larger than the RSE of either <i>S<sub>A</sub></i> or
<i>S<sub>B</sub></i>,
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]