[ 
https://issues.apache.org/jira/browse/PIG-5468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17943956#comment-17943956
 ] 

Rohini Palaniswamy commented on PIG-5468:
-----------------------------------------

[~niallp],
  I did remove the external svn link and copied locally. That is committed now.
{code:java}
svn propdel svn:externals author/src/documentation
rm -rf author/src/documentation/skins
svn copy 
https://svn.apache.org/repos/asf/hadoop/common/site/main/author/src/documentation/skins
 author/src/documentation/skins
svn commit -m "PIG-5468: Remove svn external link to hadoop skins and copy 
locally"
{code}

Tried to apply the pig-ga.patch. But it was generated assuming the skins 
directory was nonexistent. So applied separately and tried to diff and 
reconcile the changes or site-to-xhtml.xsl. But I could not find anything 
related to removing Google Analytics in site-to-xhtml.xsl. Found two other xml 
files with references to Google, but they were not part of the patch. Can you 
take a look ?

{code}
find . -type f -not -path '*/\.*' | grep -v "./publish" | xargs grep -i google
./author/src/documentation/skins/hadoop-pelt/xslt/html/site-to-xhtml.xsl:    
ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 
'http://www') + '.google-analytics.com/ga.js';
./author/src/documentation/skins/hadoop-pelt/xslt/html/site-to-xhtml.xsl:       
             <form class="roundtopsmall" method="get" 
action="http://www.google.com/search";>
./author/src/documentation/skins/hadoop-pelt/xslt/html/site-to-xhtml.xsl:       
       <form method="get" action="http://www.google.com/search";>
./author/src/documentation/skinconf.xml:  <!-- To enable lucene search add 
provider="lucene" (default is google).
./author/src/documentation/skinconf.xml:    no search box. @domain will enable 
sitesearch for the specific domain with google.
./author/src/documentation/skinconf.xml:    In other words google will search 
the @domain for the query string.
./author/src/documentation/skinconf.xml:  <search domain="pig.apache.org" 
provider="google"/>
./author/src/documentation/content/xdocs/privacypolicy.xml:<a 
href="http://www.google.com/analytics/";>Google Analytics</a>
./author/src/documentation/content/xdocs/privacypolicy.xml:service and handled 
by Google as
./author/src/documentation/content/xdocs/privacypolicy.xml:described in their 
<a href="http://www.google.com/privacy.html";>privacy policy</a>.
./author/src/documentation/content/xdocs/privacypolicy.xml:cookie if you prefer 
not to share this data with Google.</p>
{code}




> Remove Google Analytics from the Pig Website
> --------------------------------------------
>
>                 Key: PIG-5468
>                 URL: https://issues.apache.org/jira/browse/PIG-5468
>             Project: Pig
>          Issue Type: Task
>            Reporter: Niall Pemberton
>            Assignee: Niall Pemberton
>            Priority: Major
>         Attachments: pig-ga.patch
>
>
> Hi Pig Team
> The ASF {_}*Privacy Policy*{_}[1][2] does not permit the use of _*Google 
> Analytics*_ on any ASF websites and the ASF Infra team will soon enforce a 
> {_}*Content Security Policy*{_}(CSP) that will block access to external 
> trackers:
>  * [https://lists.apache.org/thread/w34sd92v4rz3j28hyddmt5tbprbdq6lc]
> Please could you remove the use of the Google Analytics from the Pig website?
>  * [https://lists.apache.org/thread/4I 
> 17v5034773lhoytoqs7h343vpjnwn1c|https://lists.apache.org/thread/417v5034773lhoytoqs7h343vpjnwn1c]
> I would have submitted a patch to remove Google Analytics, but it seems that 
> Pig is pulling in file _*site-to-xhtml.xsl*_ from the old Hadoop Subversion 
> repository, via an svn:external link
>  * 
> [https://svn.apache.org/viewvc/hadoop/common/site/main/author/src/documentation/skins/hadoop-pelt/xslt/html/site-to-xhtml.xsl?view=markup]
>  * [https://svn.apache.org/viewvc/pig/site/author/src/documentation/]
> Since Hadoop has now moved to GitHub, then it would probably be a good idea 
> to copy that folder (as its not longer maintained) into your site and 
> maintain it within the Pig project - or ask the hadoop project to modify it
>  
> The ASF hosts its own _*Matomo*_ instance to provide projects with analytics 
> and you can request a tracking id for your project by sending a mail to 
> *privacy AT apache.org.*
>  * 
> [https://privacy.apache.org/faq/committers.html#can-i-use-web-analytics-matomo]
> Additionally I would recommend reviewing any external resources loaded by 
> your website. The Content Security Policy will prevent any resources being 
> loaded from 3rd Party providers that the ASF does not have a Data Processing 
> Agreement (DPA) with. On the 1st February Infra will begin a temporary 
> "brownout" when the CSP will be turned on for a short period. This will allow 
> projects to check which parts, if any, of their websites will stop working. 
> The Privacy FAQ answers a number of questions about which external providers 
> are permitted or not:
>  * [https://privacy.apache.org/faq/committers.html]
> Thanks
> Niall
> [1] [https://privacy.apache.org/policies/website-policy.html]
> [2] 
> [https://privacy.apache.org/faq/committers.html#can-i-use-google-analytics]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to