man-pages/SOURCES/man-pages-4.15-nl_langinfo.3-charsets.7-codesets.patch
2021-09-09 21:49:09 +00:00

46 lines
1.6 KiB
Diff

From 260d71520ec0c96ef20670eca17d7b08cc9601ad Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Nikola=20Forr=C3=B3?= <nforro@redhat.com>
Date: Fri, 27 Jul 2018 17:10:43 +0200
Subject: [PATCH 1/5] nl_langinfo.3: note that Latin-1 is the default codeset
for en_US locale charsets.7: note that UTF-8 is the recommended encoding for
all locales
---
man3/nl_langinfo.3 | 6 ++++++
man7/charsets.7 | 2 ++
2 files changed, 8 insertions(+)
diff --git a/man3/nl_langinfo.3 b/man3/nl_langinfo.3
index e37f07b..63de5c2 100644
--- a/man3/nl_langinfo.3
+++ b/man3/nl_langinfo.3
@@ -167,6 +167,12 @@ or
.PP
POSIX specifies that the application may not modify
the string returned by these functions.
+.PP
+Codeset for en_US defaults to ISO-8859-1 (Latin-1).
+The Latin-1 default has historical reasons,
+since all Unix systems originally used only 8-bit character encoding.
+For more information about ISO-8859-1 see
+.BR charsets (7).
.SH ATTRIBUTES
For an explanation of the terms used in this section, see
.BR attributes (7).
diff --git a/man7/charsets.7 b/man7/charsets.7
index 5f91784..9de88fe 100644
--- a/man7/charsets.7
+++ b/man7/charsets.7
@@ -29,6 +29,8 @@ ASCII, GB 2312, ISO 8859, JIS, KOI8-R, KS, and Unicode.
The primary emphasis is on character sets that were actually used by
locale character sets, not the myriad others that could be found in data
from other systems.
+.LP
+The recommended encoding in all settings and locales is UTF-8.
.SS ASCII
ASCII (American Standard Code For Information Interchange) is the original
7-bit character set, originally designed for American English.
--
2.17.1