New module to parse a simple markup language

2000-10-24  Havoc Pennington  <hp@pobox.com>

        * gmarkup.h, gmarkup.c: New module to parse a simple
	markup language

	* Makefile.am: add gmarkup.h, gmarkup.c

	* tests/Makefile.am: add markup-test

	* gstring.h (g_string_new_len): new function to create a string
	with a length
	(g_string_new): avoid a gratuitous realloc
This commit is contained in:
Havoc Pennington
2000-10-27 02:46:04 +00:00
committed by Havoc Pennington
parent 7ea09e4589
commit 32ef70d4b2
59 changed files with 4229 additions and 2 deletions

View File

View File

@@ -0,0 +1,2 @@
<foo>
</|foo>

View File

@@ -0,0 +1,4 @@
<foo>
<bar>
</foo>
</bar>

View File

@@ -0,0 +1 @@
</foo>

View File

@@ -0,0 +1 @@
</foo|>

View File

@@ -0,0 +1,2 @@
<foo>
<

View File

@@ -0,0 +1,3 @@
<foo>
<bar>
</bar>

View File

@@ -0,0 +1 @@
<foo/

View File

@@ -0,0 +1 @@
<fo

View File

@@ -0,0 +1 @@
<foo bar

View File

@@ -0,0 +1 @@
<foo

View File

@@ -0,0 +1 @@
<EFBFBD>ν

View File

@@ -0,0 +1 @@
<foo bar=

View File

@@ -0,0 +1 @@
<foo bar="fdsf

View File

@@ -0,0 +1 @@
<foo>

View File

@@ -0,0 +1,2 @@
<foo>
<fo

View File

@@ -0,0 +1 @@
<!-- dfklsjdf;kljsdf;ljk document ends here

View File

@@ -0,0 +1 @@
<? document ending unexpectedly

View File

@@ -0,0 +1 @@
<foo>&;</foo>

View File

@@ -0,0 +1 @@
<foo>&|;</foo>

View File

@@ -0,0 +1 @@
<foo>&am|;</foo>

View File

@@ -0,0 +1 @@
<foo>&bar;</foo>

View File

@@ -0,0 +1,49 @@
<foobar>
Παν語
This is a list of ways to say hello in various languages. Its purpose is to illustrate a number of scripts.
(Converted into UTF-8)
---------------------------------------------------------
Arabic السلام عليكم
Czech (česky) Dobrý den
Danish (Dansk) Hej, Goddag
English Hello
Esperanto Saluton
Estonian Tere, Tervist
FORTRAN PROGRAM
Finnish (Suomi) Hei
French (Français) Bonjour, Salut
German (Deutsch Nord) Guten Tag
German (Deutsch Süd) Grüß Gott
Greek (Ελληνικά) Γειά σας
Hebrew שלום
Hindi नमस्ते, <20><>मस्कार।
Italiano Ciao, Buon giorno
Maltese Ċaw, Saħħa
Nederlands, Vlaams Hallo, Dag
Norwegian (Norsk) Hei, God dag
Polish Dzień dobry, Hej
Russian (Русский) Здравствуйте!
Slovak Dobrý deň
Spanish (Español) ¡Hola!
Swedish (Svenska) Hej, Goddag
Thai (ภาษาไทย) สวัสดีครับ, สวัสดีค่ะ
Turkish (Türkçe) Merhaba
Vietnamese (Tiếng Việt) Xin Chào
Yiddish (ײַדישע) דאָס הײַזעלע
Japanese (日本語) こんにちは, コンニチハ
Chinese (中文,普通话,汉语) 你好
Cantonese (粵語,廣東話) 早晨, 你好
Korean (한글) 안녕하세요, 안녕하십니까
Difference among chinese characters in GB, JIS, KSC, BIG5:
GB -- 元气 开发
JIS -- 元気 開発
KSC -- 元氣 開發
BIG5 -- 元氣 開發
</foobar>

View File

@@ -0,0 +1 @@
<foo>&sdfkljsdsdfsdfsdfsdf</foo>

View File

@@ -0,0 +1 @@
<foo>&#34592348345343453453455645765736575865767;</foo>

View File

@@ -0,0 +1 @@
<foo>&#x10;</foo>

View File

@@ -0,0 +1 @@
<foo>&#;</foo>

View File

@@ -0,0 +1 @@
<foo>&#234234</foo>

View File

@@ -0,0 +1 @@
foo

View File

@@ -0,0 +1,2 @@
<|foo>
</|foo>

View File

@@ -0,0 +1,2 @@
<foo|>
</foo>

View File

@@ -0,0 +1,2 @@
<foo bar}"baz">
</foo>

View File

@@ -0,0 +1,2 @@
<foo/}>
</foo>

View File

@@ -0,0 +1,2 @@
<foo bar={baz">
</foo>

View File

@@ -0,0 +1,9 @@
<!-- Comment -->
<?PI ?>
<foobar>
<e1>Hi &amp; this is some text inside an element Two 'E' chars as character refs: &#69; &#x45; and some 'J': &#74; &#x4A;</e1>
<e2:foo> Text <childfree/> with some <nested>nested elements</nested> and entities &quot;&amp; &lt; &gt;&gt; &apos; and whitespace </e2:foo>
<tag ab="fo&lt;o" bar="foo" baz="blah">This element has attributes</tag>
<nochildren a="b" xyz="qrs"/>
</foobar>

View File

@@ -0,0 +1,49 @@
<foobar>
Παν語
This is a list of ways to say hello in various languages. Its purpose is to illustrate a number of scripts.
(Converted into UTF-8)
---------------------------------------------------------
Arabic السلام عليكم
Czech (česky) Dobrý den
Danish (Dansk) Hej, Goddag
English Hello
Esperanto Saluton
Estonian Tere, Tervist
FORTRAN PROGRAM
Finnish (Suomi) Hei
French (Français) Bonjour, Salut
German (Deutsch Nord) Guten Tag
German (Deutsch Süd) Grüß Gott
Greek (Ελληνικά) Γειά σας
Hebrew שלום
Hindi नमस्ते, नमस्कार।
Italiano Ciao, Buon giorno
Maltese Ċaw, Saħħa
Nederlands, Vlaams Hallo, Dag
Norwegian (Norsk) Hei, God dag
Polish Dzień dobry, Hej
Russian (Русский) Здравствуйте!
Slovak Dobrý deň
Spanish (Español) ¡Hola!
Swedish (Svenska) Hej, Goddag
Thai (ภาษาไทย) สวัสดีครับ, สวัสดีค่ะ
Turkish (Türkçe) Merhaba
Vietnamese (Tiếng Việt) Xin Chào
Yiddish (ײַדישע) דאָס הײַזעלע
Japanese (日本語) こんにちは, コンニチハ
Chinese (中文,普通话,汉语) 你好
Cantonese (粵語,廣東話) 早晨, 你好
Korean (한글) 안녕하세요, 안녕하십니까
Difference among chinese characters in GB, JIS, KSC, BIG5:
GB -- 元气 开发
JIS -- 元気 開発
KSC -- 元氣 開發
BIG5 -- 元氣 開發
</foobar>