Re: [BUGS] BUG #1987: UTF8 encoding differences hamper upgrades

John Hansen Sun, 23 Oct 2005 11:23:36 -0700

Paul,

To fix your dump so it will import, run:


iconv -c -f UTF8 -t UTF8 -o fixed.sql dump.sql 

Kind Regards,

John Hansen

> -----Original Message-----
> From: [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] On Behalf Of Paul Lindner
> Sent: Sunday, October 23, 2005 1:34 AM
> To: pgsql-bugs@postgresql.org
> Subject: [BUGS] BUG #1987: UTF8 encoding differences hamper upgrades
> 
> 
> The following bug has been logged online:
> 
> Bug reference:      1987
> Logged by:          Paul Lindner
> Email address:      [EMAIL PROTECTED]
> PostgreSQL version: 8.1beta3
> Operating system:   Fedora Core 4 x86_64
> Description:        UTF8 encoding differences hamper upgrades
> Details: 
> 
> I've been doing some test imports of UNICODE databases into 
> Postgres        
>                                                               
>               
>            
> 8.1beta3.  The only problem I've seen is that some data from 
> 8.0            
>                                                               
>               
>            
> databases will not import.                                    
>               
>                                                               
>               
>            
>                                                               
>               
>                                                               
>               
>            
> I've generated dumps using pg_dump from 8.0 and 8.1.  
> Attempting to         
>                                                               
>               
>            
> restore these results in                                      
>               
>                                                               
>               
>            
>                                                               
>               
>                                                               
>               
>            
>  Invalid UNICODE byte sequence detected near byte ...         
>               
>                                                               
>               
>            
>                                                               
>               
>                                                               
>               
>            
> Question:                                                     
>               
>                                                               
>               
>            
>                                                               
>               
>                                                               
>               
>            
> Does the 8.1 Unicode sanity code accept the full set of 
> characters          
>                                                               
>               
>            
> accepted by the 8.0 Unicode sanity code?                      
>               
>                                                               
>               
>            
>                                                               
>               
>                                                               
>               
>            
> If not we'll see a lot of problems like the one above.        
>               
>                                                               
>               
>            
>                                                               
>               
>                                                               
>               
>            
>                                                               
>               
>                                                               
>               
>            
> I believe this patch is the one causing the problem I see:    
>               
>                                                               
>               
>            
>                                                               
>               
>                                                               
>               
>            
>  
> http://www.mail-archive.com/pgsql-patches@postgresql.org/msg08
198/unicode.di
> ff                                                            
>               
>           
>                                                               
>               
>                                                               
>               
>            
>                                                               
>               
>                                                               
>               
>            
> Is there any solution other than scrubbing the entire dataset 
> to            
>                                                               
>               
>            
> conform to the new (8.1) encoding rules?
> 
> ---------------------------(end of 
> broadcast)---------------------------
> TIP 5: don't forget to increase your free space map settings
> 
> 

---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

               http://archives.postgresql.org

Re: [BUGS] BUG #1987: UTF8 encoding differences hamper upgrades

Reply via email to