unicode-strings

About

The unicode-strings module was written to make transforming strings from Unicode to ASCII simpler. This transformation may come in useful when trying to use strings with systems that aren't compatible with the extended Unicode character set.

1. Installation

npm install git+https://github.com/cbroad/unicode-strings.git

2. Importing

ES6 Syntax

import * as UnicodeStrings from "unicode-strings"
// Alternatively
import { escapeString, unescapeString } from "unicode-strings"

CommonJS Syntax

const UnicodeStrings = require( "unicode-strings" );
// Alternatively
const { escapeString, unescapeString } = require( "unicode-strings" );

3. Usage

const asciiString = UnicodeStrings.escapeString("Good Morning! おはようございます 🐡");
console.log( asciiString );
// Good Morning! \u304a\u306f\u3088\u3046\u3054\u3056\u3044\u307e\u3059 \ud83d\udc21
const unicodeString = unescapeString("Good Morning! \\u304a\\u306f\\u3088\\u3046\\u3054\\u3056\\u3044\\u307e\\u3059 \\ud83d\\udc21");
console.log( unicodeString );
// Good Morning! おはようございます 🐡

API

Function	Description
`UnicodeStrings.escapeString()`	Blah
`UnicodeStrings.unescapeString()`	Blah
`UnicodeStrings.unicodeEscapeString()`	Blah

UnicodeStrings.escapeString( str )

Transforms a string. Any unicode characters will be backslash-escaped using the method producing the shortest length possible.

Escape characters encoded include:

"\b", "\f", "\n", "\r", "\t", "\v"
"\oo" - 2-digit octal-value for characters with value<0x20
"\xXX" - 2-digit hex-value for characters with value such that 0x80<=value<=0xff
"\uXXXX" - 4-digit hex-value for characters with value such that 0x100<=value<=0xffff


str	a string possibly containing unicode characters
returns	encoded string

UnicodeStrings.unescapeString( str )

Transforms a string. Any escape sequences will be converted to their character equivalents.

Escape characters decoded include:

"\b", "\f", "\n", "\r", "\t", "\v"
"\oo" - 2-digit octal-value for characters with value<0x20
"\xXX" - 2-digit hex-value for characters with value such that 0x80<=value<=0xff
"\uXXXX" - 4-digit hex-value for characters with value such that 0x100<=value<=0xffff


str	a string possibly containing escaped characters
returns	decoded string

UnicodeStrings.unicodeEscapeString( str )

Transforms a string. Only uses single character escape sequences and unicode escape sequences. This is useful because only these escape sequences are allowed in JSON formatted data.

Escape characters decoded include:

"\b", "\f", "\n", "\r", "\t", "\v"
"\uXXXX" - 4-digit hex-value for characters with value not in the range that 0x20<=value<0x80


str	a string possibly containing escaped characters
returns	decoded string

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
src		src
.eslintignore		.eslintignore
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
License.md		License.md
License.txt		License.txt
README.md		README.md
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

unicode-strings

About

API

UnicodeStrings.escapeString( str )

UnicodeStrings.unescapeString( str )

UnicodeStrings.unicodeEscapeString( str )

About

Licenses found

Releases

Packages

Languages

License

Licenses found

cbroad/unicode-strings

Folders and files

Latest commit

History

Repository files navigation

unicode-strings

About

API

UnicodeStrings.escapeString( str )

UnicodeStrings.unescapeString( str )

UnicodeStrings.unicodeEscapeString( str )

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages