Issue 63572 - Rubies (phonetics guide) dialog should recognize word stems
Summary: Rubies (phonetics guide) dialog should recognize word stems
Status: CONFIRMED
Alias: None
Product: Writer
Classification: Application
Component: formatting (show other issues)
Version: OOo 2.0.2
Hardware: All All
: P3 Trivial with 2 votes (vote)
Target Milestone: ---
Assignee: AOO issues mailing list
QA Contact:
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-03-24 11:45 UTC by attahb
Modified: 2013-02-07 22:37 UTC (History)
2 users (show)

See Also:
Issue Type: FEATURE
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
no combined stuff in the dialog (28.63 KB, image/png)
2006-08-05 00:23 UTC, lohmaier
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description attahb 2006-03-24 11:45:07 UTC
Rubies dialog is not fully functional.

Japanese language support is on.

User enters text in Japanese, and selects it.

User invokes "Format/Asian phonetics guide..." from menu.

What's supposed to happen is that the software identifies kanji word stems
("base text") in the selected text, and puts them one to each box in the "base
text" column of the dialog box, and in the right-hand column, it should insert
the furigana ("ruby text") for each base text item. The user can edit them by
inserting an alternative pronunciation, or by deleting the furigana for some
items.  Then the user clicks "Apply", and rubies (furigana) are applied to all
the word stems accordingly.

There are several ways in which this doesn't work properly in OOo:

1) Instead of picking up just word stems, it picks up whole words as they appear
in dictionaries, so if you write 行きます, (ikimasu -- the present or future
tense of "to go"), it picks up 行き, (iki -- the gerund of "to go"), when it
should pick up 行, (i -- the stem of "to go"). It leads to incorrect positioning
of furigana. Picking up only word stems shouldn't be hard from the programming
point of view, since they are sequences of kanji characters, and can be
identified by the simple expedient of checking the unicode value of the
characters to see if they are in the range of kanji characters, as opposed to
kana, space, punctuation, or some other (or by using a dictionary subset that
contains only kanji stems, no inflected forms or words spelled in kana).

2) The software doesn't produce the furigana, so the user has to type them all
in by hand. From the way the software picks up words for the "base text" fields,
it is evident that it is using a dictionary. If that dictionary contains entries
for the pronunciation of each word contained therein, it ought to be a small
matter to look up the pronunciation and insert that into the associated "ruby
text" field. Sadly, the software doesn't bother, and the user is left with the
tedious job of doing it by hand. In some situations, this could involve looking
up a separate dictionary.

3) The system is completely fooled by punctuation marks. If the user selects an
area of text that includes a comma (、), full-stop (。), or other punctuation
mark, all the rest of the selected area, including the punctuation mark will be
inserted into a single ruby text field. This adds to the tedium of adding
furigana to text, because only one phrase can be handled at a time, rather than
a substantial passage of text being all handled at once with a single invocation
of the dialog.

4) The software ought to remember the furigana entered by the user, so that it
can be automatically used again the next time the pronunciation guide dialog box
is invoked.

If the ruby dialog were fully functional, a user could select a whole page of
text, invoke the phonetics guide once, click "Apply", and see furigana for all
the words in passage. As it stands now, the user would have to invoke the dialog
dozens of times, and type in the pronunciation for each word by hand, in a
procedure more tedious than it would be to type out the whole passage twice,
once with kanji and once without.
Comment 1 michael.ruess 2006-03-24 12:37:17 UTC
Reassigned to SBA.
Comment 2 attahb 2006-07-04 10:48:53 UTC
I am sad to see that this issue is still marked "unconfirmed" after so long. It 
appears that the OOo developers don't appreciate the importance of furigana 
(aka "rubi") to those of us who wish to write or read documents in Japanese. It 
is 
*fundamental*. If you scan through the bug list, there are lots of queries 
about why the feature doesn't appear to work properly, or seems to be 
incomplete. In one of those reports, someone incorrectly states that other WPs 
don't have proper furigana/rubi feature, so why should OOo? As a matter of 
fact, Word 2000 has the feature. It makes a HUGE difference, and it would not 
be a huge programming project. Public domain dictionaries already exist, and 
the basic framework for the feature also already exists in the code.

I would install OOo 1.x, Japanese version, but it appears that it and 2.x can't 
live side-by-side on the same PC (I need to have an English language UI 
available). Therefore, I'm giving up on OOo for the time being, and getting a 
beta copy of Word 2007. Maybe by the time the beta licence runs out, this issue 
will have been fixed, and bilingual Western/Japanese WP will be feasible with 
OOo.
Comment 3 lohmaier 2006-08-05 00:23:02 UTC
"resolved remind" isn't a good status if the problem is still an issue - however
I cannot reproduce.

When using "行きます" and selecting it and using the Ruby feature, all four
characters/symbols are entered individually in the dialog (see screenshot)

Please do not write multiple request/bugs into one single issue this makes
dealing with these issue much, much harder.
Please also keep in mind that most developers and QA-People don't have knowledge
of complex scripts or asian text at all.

Regarding 2) I don't think that OOo uses a dictionary right now.
I can reproduce 3), but please file a seperate, dedicated issue for this.

For 4) please file a seperate issue as well.
Comment 4 lohmaier 2006-08-05 00:23:47 UTC
Created attachment 38278 [details]
no combined stuff in the dialog
Comment 5 stefan.baltzer 2009-12-01 13:01:39 UTC
The issue type "defect" means "This is a bug to fix" because "Defined behavior
does not work as designed". But this one looks more like a huge pile of feature
implementations are missing. I adjusted the summary to give a hint what this
issue is about.

I change the issue type to "Feature" and confirm. Confirming means: This is a
valid request for a re-work of the ruby implementation and its UI in general. 
But... It is as valid as thousands of other existing requests that "only" wait
for developer resources to come by...

Reassigned to requirements.
Comment 6 grehtietalders 2010-10-23 15:44:14 UTC
Created attachment 72666