Merge pull request #2437 from corob-msft/cr-524

TedA-M · web-flow · commit 916f376b4ab8 · 2019-10-22T16:58:49.000-07:00
Windows 18671203 c16rtomb, mbrtoc16 issue
diff --git a/docs/c-runtime-library/interpretation-of-multibyte-character-sequences.md b/docs/c-runtime-library/interpretation-of-multibyte-character-sequences.md
@@ -1,15 +1,15 @@
 ---
-title: "Interpretation of Multibyte-Character Sequences"
-ms.date: "04/11/2018"
+title: "Interpretation of multibyte-character sequences"
+ms.date: "10/22/2019"
 f1_keywords: ["c.character.multibyte"]
 helpviewer_keywords: ["MBCS [C++], locale code page"]
 ms.assetid: da9150de-70ea-4d2f-90e6-ddb9202dd80b
 ---
-# Interpretation of Multibyte-Character Sequences
+# Interpretation of multibyte-character sequences
 
-Most multibyte-character routines in the Microsoft run-time library recognize multibyte-character sequences relating to a multibyte code page. The output value is affected by the setting of the **LC_CTYPE** category setting of the locale; see [setlocale](../c-runtime-library/reference/setlocale-wsetlocale.md) for more information. The versions of these functions without the **_l** suffix use the current locale for this locale-dependent behavior; the versions with the **_l** suffix are identical except that they use the locale parameter passed in instead.
+Most multibyte-character routines in the Microsoft run-time library recognize multibyte-character sequences relating to a multibyte code page. The output value is affected by the setting of the **LC_CTYPE** category setting of the locale. For more information, see [setlocale](../c-runtime-library/reference/setlocale-wsetlocale.md). The versions of these functions without the **_l** suffix use the current locale for this locale-dependent behavior. The versions with the **_l** suffix are identical, except they use the locale parameter instead of the current locale.
 
-## Locale-Dependent Multibyte Routines
+## Locale-dependent multibyte routines
 
 |Routine|Use|
 |-------------|---------|
@@ -19,10 +19,15 @@ Most multibyte-character routines in the Microsoft run-time library recognize mu
 |[mbtowc, _mbtowc_l](../c-runtime-library/reference/mbtowc-mbtowc-l.md)|Convert multibyte character to corresponding wide character|
 |[wcstombs, _wcstombs_l](../c-runtime-library/reference/wcstombs-wcstombs-l.md), [wcstombs_s, _wcstombs_s_l](../c-runtime-library/reference/wcstombs-s-wcstombs-s-l.md)|Convert sequence of wide characters to corresponding sequence of multibyte characters|
 |[wctomb, _wctomb_l](../c-runtime-library/reference/wctomb-wctomb-l.md), [wctomb_s, _wctomb_s_l](../c-runtime-library/reference/wctomb-s-wctomb-s-l.md)|Convert wide character to corresponding multibyte character|
-|[mbrtoc16, mbrtoc32](../c-runtime-library/reference/mbrtoc16-mbrtoc323.md)|Convert multibyte character to equivalent UTF-16 or UTF-32 character|
-|[c16rtomb, c32rtomb](../c-runtime-library/reference/c16rtomb-c32rtomb1.md)|Convert UTF-16 or UTF-32 character to equivalent multibyte character|
+
+## Locale-independent multibyte routines
+
+|Routine|Use|
+|-------------|---------|
+|[mbrtoc16, mbrtoc32](../c-runtime-library/reference/mbrtoc16-mbrtoc323.md)|Convert multibyte UTF-8 character to equivalent UTF-16 or UTF-32 character|
+|[c16rtomb, c32rtomb](../c-runtime-library/reference/c16rtomb-c32rtomb1.md)|Convert UTF-16 or UTF-32 character to equivalent UTF-8 multibyte character|
 
 ## See also
 
-[Internationalization](../c-runtime-library/internationalization.md)<br/>
-[Universal C runtime routines by category](../c-runtime-library/run-time-routines-by-category.md)<br/>
+[Internationalization](../c-runtime-library/internationalization.md)\
+[Universal C runtime routines by category](../c-runtime-library/run-time-routines-by-category.md)
diff --git a/docs/c-runtime-library/reference/c16rtomb-c32rtomb1.md b/docs/c-runtime-library/reference/c16rtomb-c32rtomb1.md
@@ -1,6 +1,6 @@
 ---
 title: "c16rtomb, c32rtomb"
-ms.date: "01/22/2018"
+ms.date: "10/22/2019"
 api_name: ["c16rtomb", "c32rtomb"]
 api_location: ["msvcrt.dll", "msvcr80.dll", "msvcr90.dll", "msvcr100.dll", "msvcr100_clr0400.dll", "msvcr110.dll", "msvcr110_clr0400.dll", "msvcr120.dll", "msvcr120_clr0400.dll", "ucrtbase.dll", "api-ms-win-crt-convert-l1-1-0.dll"]
 api_type: ["DLLExport"]
@@ -11,7 +11,7 @@ ms.assetid: 7f5743ca-a90e-4e3f-a310-c73e16f4e14d
 ---
 # c16rtomb, c32rtomb
 
-Convert a UTF-16 or UTF-32 wide character into a multibyte character in the current locale.
+Convert a UTF-16 or UTF-32 wide character into a UTF-8 multibyte character.
 
 ## Syntax
 
@@ -30,40 +30,44 @@ size_t c32rtomb(
 
 ### Parameters
 
-*mbchar*<br/>
-Pointer to an array to store the multibyte converted character.
+*mbchar*\
+Pointer to an array to store the converted UTF-8 multibyte character.
 
-*wchar*<br/>
+*wchar*\
 A wide character to convert.
 
-*state*<br/>
+*state*\
 A pointer to an **mbstate_t** object.
 
-## Return Value
+## Return value
 
-The number of bytes stored in array object *mbchar*, including any shift sequences. If *wchar* is not a valid wide character, the value (**size_t**)(-1) is returned, **errno** is set to **EILSEQ**, and the value of *state* is unspecified.
+The number of bytes stored in array object *mbchar*, including any shift sequences. If *wchar* isn't a valid wide character, the value (**size_t**)(-1) is returned, **errno** is set to **EILSEQ**, and the value of *state* is unspecified.
 
 ## Remarks
 
-The **c16rtomb** function converts the UTF-16 character *wchar* to the equivalent multibyte narrow character sequence in the current locale. If *mbchar* is not a null pointer, the function stores the converted sequence in the array object pointed to by *mbchar*. Up to **MB_CUR_MAX** bytes are stored in *mbchar*, and *state* is set to the resulting multibyte shift state.    If *wchar* is a null wide character, a sequence required to restore the initial shift state is stored, if needed, followed by the null character, and *state* is set to the initial conversion state. The **c32rtomb** function is identical, but converts a UTF-32 character.
+The **c16rtomb** function converts the UTF-16 LE character *wchar* to the equivalent UTF-8 multibyte narrow character sequence. If *mbchar* isn't a null pointer, the function stores the converted sequence in the array object pointed to by *mbchar*. Up to **MB_CUR_MAX** bytes are stored in *mbchar*, and *state* is set to the resulting multibyte shift state.
+
+If *wchar* is a null wide character, a sequence required to restore the initial shift state is stored, if needed, followed by the null character. *state* is set to the initial conversion state. The **c32rtomb** function is identical, but converts a UTF-32 character.
 
 If *mbchar* is a null pointer, the behavior is equivalent to a call to the function that substitutes an internal buffer for *mbchar* and a wide null character for *wchar*.
 
-The *state* conversion state object allows you to make subsequent calls to this function and other restartable functions that maintain the shift state of the multibyte output characters. Results are undefined when you mix the use of restartable and non-restartable functions, or if a call to **setlocale** is made between restartable function calls.
+The *state* conversion state object allows you to make subsequent calls to this function and other restartable functions that maintain the shift state of the multibyte output characters. Results are undefined when you mix the use of restartable and non-restartable functions.
+
+To convert UTF-16 characters into non-UTF-8 multibyte characters, use the [wcstombs, _wcstombs_l](wcstombs-wcstombs-l.md), [wcstombs_s, or _wcstombs_s_l](wcstombs-s-wcstombs-s-l.md) functions.
 
 ## Requirements
 
 |Routine|Required header|
 |-------------|---------------------|
 |**c16rtomb**, **c32rtomb**|C, C++: \<uchar.h>|
 
-For compatibility information, see [Compatibility](../../c-runtime-library/compatibility.md).
+For compatibility information, see [Compatibility](../compatibility.md).
 
 ## See also
 
-[Data Conversion](../../c-runtime-library/data-conversion.md)<br/>
-[Locale](../../c-runtime-library/locale.md)<br/>
-[Interpretation of Multibyte-Character Sequences](../../c-runtime-library/interpretation-of-multibyte-character-sequences.md)<br/>
-[mbrtoc16, mbrtoc32](mbrtoc16-mbrtoc323.md)<br/>
-[wcrtomb](wcrtomb.md)<br/>
-[wcrtomb_s](wcrtomb-s.md)<br/>
+[Data conversion](../data-conversion.md)\
+[Locale](../locale.md)\
+[Interpretation of multibyte-character sequences](../interpretation-of-multibyte-character-sequences.md)\
+[mbrtoc16, mbrtoc32](mbrtoc16-mbrtoc323.md)\
+[wcrtomb](wcrtomb.md)\
+[wcrtomb_s](wcrtomb-s.md)
diff --git a/docs/c-runtime-library/reference/mbrtoc16-mbrtoc323.md b/docs/c-runtime-library/reference/mbrtoc16-mbrtoc323.md
@@ -1,6 +1,6 @@
 ---
 title: "mbrtoc16, mbrtoc323"
-ms.date: "11/04/2016"
+ms.date: "10/22/2019"
 api_name: ["mbrtoc16", "mbrtoc32"]
 api_location: ["msvcrt.dll", "msvcr80.dll", "msvcr90.dll", "msvcr100.dll", "msvcr100_clr0400.dll", "msvcr110.dll", "msvcr110_clr0400.dll", "msvcr120.dll", "msvcr120_clr0400.dll", "ucrtbase.dll", "api-ms-win-crt-convert-l1-1-0.dll"]
 api_type: ["DLLExport"]
@@ -11,7 +11,7 @@ ms.assetid: 099ade4d-56f7-4e61-8b45-493f1d7a64bd
 ---
 # mbrtoc16, mbrtoc32
 
-Translates the first multibyte character in a narrow string into the equivalent UTF-16 or UTF-32 character.
+Translates the first UTF-8 multibyte character in a string into the equivalent UTF-16 or UTF-32 character.
 
 ## Syntax
 
@@ -33,52 +33,54 @@ size_t mbrtoc32(
 
 ### Parameters
 
-*destination*<br/>
-Pointer to the **char16_t** or **char32_t** equivalent of the multibyte character to convert. If null, the function does not store a value.
+*destination*\
+Pointer to the **char16_t** or **char32_t** equivalent of the UTF-8 multibyte character to convert. If null, the function doesn't store a value.
 
-*source*<br/>
-Pointer to the multibyte character string to convert.
+*source*\
+Pointer to the UTF-8 multibyte character string to convert.
 
-*max_bytes*<br/>
-The maximum number of bytes in *source* to examine for a character to convert. This should be a value between one and the number of bytes, including any null terminator, remaining in *source*.
+*max_bytes*\
+The maximum number of bytes in *source* to examine for a character to convert. This argument should be a value between one and the number of bytes, including any null terminator, remaining in *source*.
 
-*state*<br/>
-Pointer to a **mbstate_t** conversion state object used to interpret the multibyte string to one or more output characters.
+*state*\
+Pointer to a **mbstate_t** conversion state object used to interpret the UTF-8 multibyte string to one or more output characters.
 
-## Return Value
+## Return value
 
 On success, returns the value of the first of these conditions that applies, given the current *state* value:
 
 |Value|Condition|
 |-----------|---------------|
-|0|The next *max_bytes* or fewer characters converted from *source* correspond to the null wide character, which is the value stored if *destination* is not null.<br /><br /> *state* contains the initial shift state.|
-|Between 1 and *max_bytes*, inclusive|The value returned is the number of bytes of *source* that complete a valid multibyte character. The converted wide character is stored  if *destination* is not null.|
-|-3|The next wide character resulting from a previous call to the function has been stored in *destination* if *destination* is not null. No bytes from *source* are consumed by this call to the function.<br /><br /> When  *source* points to a multibyte character that requires more than one wide character to represent (for example, a surrogate pair), then the *state* value is updated so that the next function call writes  out the additional character.|
-|-2|The next *max_bytes* bytes represent an incomplete, but potentially valid, multibyte character. No value is stored in *destination*. This result can occur if *max_bytes* is zero.|
-|-1|An encoding error has occurred. The next *max_bytes* or fewer bytes do not contribute to a complete and valid multibyte character. No value is stored in *destination*.<br /><br /> **EILSEQ** is stored in **errno** and the conversion state *state* is unspecified.|
+|0|The next *max_bytes* or fewer characters converted from *source* correspond to the null wide character, which is the value stored if *destination* isn't null.<br /><br /> *state* contains the initial shift state.|
+|Between 1 and *max_bytes*, inclusive|The value returned is the number of bytes of *source* that complete a valid multibyte character. The converted wide character is stored if *destination* isn't null.|
+|-3|The next wide character resulting from a previous call to the function has been stored in *destination* if *destination* isn't null. No bytes from *source* are consumed by this call to the function.<br /><br /> When  *source* points to a UTF-8 multibyte character that requires more than one wide character to represent (for example, a surrogate pair), then the *state* value is updated so that the next function call writes out the additional character.|
+|-2|The next *max_bytes* bytes represent an incomplete, but potentially valid, UTF-8 multibyte character. No value is stored in *destination*. This result can occur if *max_bytes* is zero.|
+|-1|An encoding error has occurred. The next *max_bytes* or fewer bytes do not contribute to a complete and valid UTF-8 multibyte character. No value is stored in *destination*.<br /><br /> **EILSEQ** is stored in **errno** and the conversion state value *state* is unspecified.|
 
 ## Remarks
 
-The **mbrtoc16** function reads up to *max_bytes* bytes from *source* to find  the first complete, valid multibyte character, and then stores the equivalent UTF-16 character in *destination*. The source bytes are interpreted according to the current thread multibyte locale. If the multibyte character requires more than one UTF-16 output character, such as a surrogate pair, then the *state* value is set to store the next UTF-16 character in *destination* on the next call to **mbrtoc16**. The **mbrtoc32** function is identical, but output is stored as a UTF-32 character.
+The **mbrtoc16** function reads up to *max_bytes* bytes from *source* to find the first complete, valid UTF-8 multibyte character, and then stores the equivalent UTF-16 character in *destination*. If the character requires more than one UTF-16 output character, such as a surrogate pair, then the *state* value is set to store the next UTF-16 character in *destination* on the next call to **mbrtoc16**. The **mbrtoc32** function is identical, but output is stored as a UTF-32 character.
 
-If *source* is null, these functions return the equivalent of a call made using arguments of **NULL** for *destination*, **""** for *source*,  and 1 for *max_bytes*. The passed values of *destination* and *max_bytes* are ignored.
+If *source* is null, these functions return the equivalent of a call made using arguments of **NULL** for *destination*, `""` (an empty, null-terminated string) for *source*,  and 1 for *max_bytes*. The passed values of *destination* and *max_bytes* are ignored.
 
-If *source* is not null, the function starts at the beginning of the string and inspects up to *max_bytes* bytes to determine the number of bytes required to complete the next multibyte character, including any shift sequences. If the examined bytes contain a valid and complete multibyte character, the function converts the character into the equivalent 16-bit or 32-bit wide character or characters. If *destination* is not null, the function stores the first (and possibly only) result character in destination. If additional output characters are required, a value is set in *state*, so that subsequent calls to the function output the additional characters and return the value -3. If no more output characters are required, then *state* is set to the initial shift state.
+If *source* isn't null, the function starts at the beginning of the string and inspects up to *max_bytes* bytes to determine the number of bytes required to complete the next UTF-8 multibyte character, including any shift sequences. If the examined bytes contain a valid and complete UTF-8 multibyte character, the function converts the character into the equivalent 16-bit or 32-bit wide character or characters. If *destination* isn't null, the function stores the first (and possibly only) result character in destination. If additional output characters are required, a value is set in *state*, so that subsequent calls to the function output the additional characters and return the value -3. If no more output characters are required, then *state* is set to the initial shift state.
+
+To convert non-UTF-8 multibyte characters to UTF-16 LE characters, use the [mbrtowc](mbrtowc.md), [mbtowc, or _mbtowc_l](mbtowc-mbtowc-l.md) functions.
 
 ## Requirements
 
 |Function|C header|C++ header|
 |--------------|--------------|------------------|
 |**mbrtoc16**, **mbrtoc32**|\<uchar.h>|\<cuchar>|
 
-For additional compatibility information, see [Compatibility](../../c-runtime-library/compatibility.md).
+For additional compatibility information, see [Compatibility](../compatibility.md).
 
 ## See also
 
-[Data Conversion](../../c-runtime-library/data-conversion.md)<br/>
-[Locale](../../c-runtime-library/locale.md)<br/>
-[Interpretation of Multibyte-Character Sequences](../../c-runtime-library/interpretation-of-multibyte-character-sequences.md)<br/>
-[c16rtomb, c32rtomb](c16rtomb-c32rtomb1.md)<br/>
-[mbrtowc](mbrtowc.md)<br/>
-[mbsrtowcs](mbsrtowcs.md)<br/>
-[mbsrtowcs_s](mbsrtowcs-s.md)<br/>
+[Data conversion](../data-conversion.md)\
+[Locale](../locale.md)\
+[Interpretation of multibyte-character sequences](../interpretation-of-multibyte-character-sequences.md)\
+[c16rtomb, c32rtomb](c16rtomb-c32rtomb1.md)\
+[mbrtowc](mbrtowc.md)\
+[mbsrtowcs](mbsrtowcs.md)\
+[mbsrtowcs_s](mbsrtowcs-s.md)