I like your examples, Richard, of strings that should not be considered as
numbers. There are times you want to recognize something like "221B Baker
Street" and extract the 221 or perhaps the 221B as a part of a full address and
ignore the rest. But one goal of a function like as.numeric() is to
This is not for the R inferno.
This is for the Microsoft interno, or perhaps the Unicode inferno.
The Byte Order Mark is supposed to appear at the beginning of UTF-32
or UTF-16 *external* data, like a file
or data coming over a socket.
In the Microsoft world, it also tends to appear at the beginnin
The zero-width no-break space character is used as the Byte Order
Mark. That is, an official function for it at the beginning of a
character sequence
is to indicate whether you have 2-byte or 4-byte big-endian or
little-endian encoding. It was not intended for use in UTF-8, where
there is nothing
This issue looks like grist for the R Inferno.
cheers,
Rolf
On Mon, 3 Mar 2025 12:19:02 -0500
wrote:
> The second solution Ivan offers looks good, and a bit more general
> than his first that simply removes one non-visible character.
>
> It begs the question of why the data has that anomal
The second solution Ivan offers looks good, and a bit more general than his
first that simply removes one non-visible character.
It begs the question of why the data has that anomaly at all. Did the data come
from a text-processing environment where it was going to wrap there and was
protected?
Thanks all for your responses. My problem is solved.
Sent from my iPhone
On Mar 3, 2025, at 1:08 AM, Deepayan Sarkar wrote:
This is possible but a little cumbersome. Bert is on the right track with
strip.custom:
xyplot(ts(cbind(x1 = cs, x2 = cs + w)),
strip = strip.custom(factor.lev
В Mon, 3 Mar 2025 12:08:43 +0530
Christofer Bogaso пишет:
> dat2 = c("-24.43728533300", "4.8506950",
> "-1.91849566670",
>
> "2.6418180", "6.77752766670", "3.2080840",
>
> "4.19328766670", "0.3782577", "4.6589550",
>
> "-9.881474
?tools::showNonASCII
On March 3, 2025 12:09:22 AM PST, Ivan Krylov via R-help
wrote:
>В Mon, 3 Mar 2025 13:21:31 +0530
>Christofer Bogaso пишет:
>
>> Is there any way to remove all possible "Unicode character" that may
>> be present in the array at once?
>
>Define a range of characters you cons
В Mon, 3 Mar 2025 13:21:31 +0530
Christofer Bogaso пишет:
> Is there any way to remove all possible "Unicode character" that may
> be present in the array at once?
Define a range of characters you consider acceptable, and you'll be
able to use regular expressions to remove everything else. For e
9 matches
Mail list logo