On Japanese Resultatives: Some Cross-linguistic ImplicationsFukushima University
This is used in the oral presentation held at the University of Queensland at 10th of August, 2012.
The aims of this presentation are to exemplify the characteristics of Japanese resultative expressions and to provide some implications to cross-linguistic and descriptive perspectives. Resultative constructions have been explored deeply in the fields of syntax, lexical semantics, constructional grammar. Japanese resultatives have also been analysed in the theoretical frameworks and Japanese linguistics. In the previous literature, however, attention has barely been paid to the fact that the ‘Product-resultative’ type is not allowed in English such as ‘*Sam baked a meat pie delicious’ and ‘*David bored a hole round through the board,’ while they are allowed in Japanese. It suggests that Japanese resultatives distribute differently more than the way concluded in the previous literature, where it was concluded that English resultatives have more types of resultatives than that of Japanese.
In this talk, I exemplify that Japanese resultatives should be analysed as a type of adverbial modification rather than secondary predication or construction, and do not follow the ‘force-dynamics’ as English resultatives do. I also point out some irrelevant phenomena on the surface such as manner-incorporation, degree modifier, and that VP quantifiers have adverbial characteristics in common. The difference between Japanese and English resultatives shown in this talk provide us some cross-linguistic implications for further research.
This is a presentation that I delivered during a Syntax class in semester 1. It was part of a series of presentations delivered in Syntax class in Culture and Linguistics Master program at Ibn Tofail University.
On Japanese Resultatives: Some Cross-linguistic ImplicationsFukushima University
This is used in the oral presentation held at the University of Queensland at 10th of August, 2012.
The aims of this presentation are to exemplify the characteristics of Japanese resultative expressions and to provide some implications to cross-linguistic and descriptive perspectives. Resultative constructions have been explored deeply in the fields of syntax, lexical semantics, constructional grammar. Japanese resultatives have also been analysed in the theoretical frameworks and Japanese linguistics. In the previous literature, however, attention has barely been paid to the fact that the ‘Product-resultative’ type is not allowed in English such as ‘*Sam baked a meat pie delicious’ and ‘*David bored a hole round through the board,’ while they are allowed in Japanese. It suggests that Japanese resultatives distribute differently more than the way concluded in the previous literature, where it was concluded that English resultatives have more types of resultatives than that of Japanese.
In this talk, I exemplify that Japanese resultatives should be analysed as a type of adverbial modification rather than secondary predication or construction, and do not follow the ‘force-dynamics’ as English resultatives do. I also point out some irrelevant phenomena on the surface such as manner-incorporation, degree modifier, and that VP quantifiers have adverbial characteristics in common. The difference between Japanese and English resultatives shown in this talk provide us some cross-linguistic implications for further research.
This is a presentation that I delivered during a Syntax class in semester 1. It was part of a series of presentations delivered in Syntax class in Culture and Linguistics Master program at Ibn Tofail University.
Spell checkers are used to detect and where possible correct spelling errors. Errors are classified as nonword errors and real-word errors. Real-word errors require the consideration of the context of the sentence to detect and correct. Setswana language has several commonly used words which are often misspelled by either separating or merging them. The misspelling results in real-word errors. In this paper we propose contextual rules that look at neighbor words to determine whether the correct word is written as two separate words or merged as one word. For some words the rules require that the parts of speech category of neighbor words be determined whereas some depend on specific neighbor words or position in a sentence. Implemented rules show that the rules are very consistent with a 88% success rate. Our tool only looks at neighbor words and therefore does not look at the context of the whole sentence. Hence, for words that require context of the whole sentence to disambiguate correctly our rules fail. This module can be incorporated into a spell checker to detect and correct real world errors for some words. That is, help users to determine the correct orthography of certain words
International Journal of Engineering Research and Applications (IJERA) is a team of researchers not publication services or private publications running the journals for monetary benefits, we are association of scientists and academia who focus only on supporting authors who want to publish their work. The articles published in our journal can be accessed online, all the articles will be archived for real time access.
Our journal system primarily aims to bring out the research talent and the works done by sciaentists, academia, engineers, practitioners, scholars, post graduate students of engineering and science. This journal aims to cover the scientific research in a broader sense and not publishing a niche area of research facilitating researchers from various verticals to publish their papers. It is also aimed to provide a platform for the researchers to publish in a shorter of time, enabling them to continue further All articles published are freely available to scientific researchers in the Government agencies,educators and the general public. We are taking serious efforts to promote our journal across the globe in various ways, we are sure that our journal will act as a scientific platform for all researchers to publish their works online.
International Journal of Humanities and Social Science Invention (IJHSSI) is an international journal intended for professionals and researchers in all fields of Humanities and Social Science. IJHSSI publishes research articles and reviews within the whole field Humanities and Social Science, new teaching methods, assessment, validation and the impact of new technologies and it will continue to provide information on the latest trends and developments in this ever-expanding subject. The publications of papers are selected through double peer reviewed to ensure originality, relevance, and readability. The articles published in our journal can be accessed online.
Spell checkers are used to detect and where possible correct spelling errors. Errors are classified as nonword errors and real-word errors. Real-word errors require the consideration of the context of the sentence to detect and correct. Setswana language has several commonly used words which are often misspelled by either separating or merging them. The misspelling results in real-word errors. In this paper we propose contextual rules that look at neighbor words to determine whether the correct word is written as two separate words or merged as one word. For some words the rules require that the parts of speech category of neighbor words be determined whereas some depend on specific neighbor words or position in a sentence. Implemented rules show that the rules are very consistent with a 88% success rate. Our tool only looks at neighbor words and therefore does not look at the context of the whole sentence. Hence, for words that require context of the whole sentence to disambiguate correctly our rules fail. This module can be incorporated into a spell checker to detect and correct real world errors for some words. That is, help users to determine the correct orthography of certain words
International Journal of Engineering Research and Applications (IJERA) is a team of researchers not publication services or private publications running the journals for monetary benefits, we are association of scientists and academia who focus only on supporting authors who want to publish their work. The articles published in our journal can be accessed online, all the articles will be archived for real time access.
Our journal system primarily aims to bring out the research talent and the works done by sciaentists, academia, engineers, practitioners, scholars, post graduate students of engineering and science. This journal aims to cover the scientific research in a broader sense and not publishing a niche area of research facilitating researchers from various verticals to publish their papers. It is also aimed to provide a platform for the researchers to publish in a shorter of time, enabling them to continue further All articles published are freely available to scientific researchers in the Government agencies,educators and the general public. We are taking serious efforts to promote our journal across the globe in various ways, we are sure that our journal will act as a scientific platform for all researchers to publish their works online.
International Journal of Humanities and Social Science Invention (IJHSSI) is an international journal intended for professionals and researchers in all fields of Humanities and Social Science. IJHSSI publishes research articles and reviews within the whole field Humanities and Social Science, new teaching methods, assessment, validation and the impact of new technologies and it will continue to provide information on the latest trends and developments in this ever-expanding subject. The publications of papers are selected through double peer reviewed to ensure originality, relevance, and readability. The articles published in our journal can be accessed online.
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...kevig
This study extracted and analyzed the linguistic speech patterns that characterize Japanese anime or game
characters. Conventional morphological analyzers, such as MeCab, segment words with high
performance, but they are unable to segment broken expressions or utterance endings that are not listed in
the dictionary, which often appears in lines of anime or game characters. To overcome this challenge, we
propose segmenting lines of Japanese anime or game characters using subword units that were proposed
mainly for deep learning, and extracting frequently occurring strings to obtain expressions that
characterize their utterances. We analyzed the subword units weighted by TF/IDF according to gender,
age, and each anime character and show that they are linguistic speech patterns that are specific for each
feature. Additionally, a classification experiment shows that the model with subword units outperformed
that with the conventional method
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...kevig
This study extracted and analyzed the linguistic speech patterns that characterize Japanese anime or game characters. Conventional morphological analyzers, such as MeCab, segment words with high performance, but they are unable to segment broken expressions or utterance endings that are not listed in the dictionary, which often appears in lines of anime or game characters. To overcome this challenge, we propose segmenting lines of Japanese anime or game characters using subword units that were proposed mainly for deep learning, and extracting frequently occurring strings to obtain expressions that characterize their utterances. We analyzed the subword units weighted by TF/IDF according to gender, age, and each anime character and show that they are linguistic speech patterns that are specific for each feature. Additionally, a classification experiment shows that the model with subword units outperformed that with the conventional method.
Mastery in Japanese Conjunctions among Indonesian Learners of JapaneseQUESTJOURNAL
ABSTRACT: This study is an analysis of usages of Japanese conjunctions ~ tekara (after), ~ toki (when) and ~ tara (if / when) which were tested to the students at USU’s and UNAND’s Japanese Literature Departments which have applied the Japanese learning phase in the levels of N5 ~ N3. The students were requested to identify the differences among the three conjunctions. The research was both quantitative and qualitative. The research data were taken from the Pre-Test, which was shaped to choose one correct answer between (~tekara/tare) and (~toki/~tara) and the Post-Test, which was provided to complete a sentence with ~tekara, ~toki and ~tara. The results showed that there was a tendency that students translated the questions during PreTest into Indonesian and ignored the Japanese grammar. As a result, they made errors. It was concluded that there were lacks of understanding and training in these conjunctions.
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...kevig
This study proposes a method to develop neural models of the morphological analyzer for Japanese Hiragana sentences using the Bi-LSTM CRF model. Morphological analysis is a technique that divides text data into words and assigns information such as parts of speech. In Japanese natural language processing systems, this technique plays an essential role in downstream applications because the Japanese language does not have word delimiters between words. Hiragana is a type of Japanese phonogramic characters, which is used for texts for children or people who cannot read Chinese characters. Morphological analysis of Hiragana sentences is more difficult than that of ordinary Japanese sentences because there is less information for dividing. For morphological analysis of Hiragana sentences, we demonstrated the effectiveness of fine-tuning using a model based on ordinary Japanese text and examined the influence of training data on texts of various genres.
M ORPHOLOGICAL A NALYZER U SING THE B I - LSTM M ODEL O NLY FOR JAPANESE H IR...kevig
This study proposes a method to develop neural models of the morphological analyzer for Japanese
Hiragana sentences using the Bi-LSTM CRF model. Morphological analysis is a technique that
divides text data into words and assigns information such as parts of speech. In Japanese natural
language processing systems, this technique plays an essential role in downstream applications
because the Japanese language does not have word delimiters between words. Hiragana is a type
of Japanese phonogramic characters, which is used for texts for children or people who cannot read
Chinese characters. Morphological analysis of Hiragana sentences is more difficult than that of
ordinary Japanese sentences because there is less information for dividing. For morphological
analysis of Hiragana sentences, we demonstrated the effectiveness of fine-tuning using a model
based on ordinary Japanese text and examined the influence of training data on texts of various
genres.
A Comparative Study of Light Verbs in Tamil and Tuluijtsrd
In South Asian languages with SOV word order as an aerial feature, a compound verb construction appears as V1 V2 where V1 is the main verb and V2 the light verb. The semantic content that a light verb has is a grammaticalised one, in a V V construction. The light verb usually contributes grammaticalised meanings, aspect aktionsart, speaker evaluation etc. The light verb has an independent lexical meaning when it occurs solely as a main verb. Light verbs are also called as Explicator verbs and Vector verbs. They are found in Compound Verb Constructions as a verb that modifies the main verb. The main verb is called as a Polar verb and the light verb is called as a Vector verb. The V V construction is variously called as a Compound verb, Serial verb etc. This paper is the first of its kind to explore the area of light verbs in Tulu. Apart from listing out the light verbs in Tulu which is definitely not an exhaustive one , this paper compares them with their counterparts in Tamil, checking meanwhile if they are at all etymologically related. The etymological relation between the Tamil and Tulu light verbs is sought after because they both belong to the Southern branch of Dravidian languages. V. Ramprashanth | Dr. R. Kumarasamy "A Comparative Study of Light Verbs in Tamil and Tulu" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-7 | Issue-2 , April 2023, URL: https://www.ijtsrd.com.com/papers/ijtsrd54031.pdf Paper URL: https://www.ijtsrd.com.com/humanities-and-the-arts/other/54031/a-comparative-study-of-light-verbs-in-tamil-and-tulu/v-ramprashanth
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
1. Utilizing Agglutinative Features in Japanese-Uighur Machine Translation
Muhtar MAHSUT, Yasuhiro OGAWA, Kazue SUGINO, Yasuyoshi INAGAKI
Graduate School of Engineering, Nagoya University
Furo-cho, Chikusa-ku, Nagoya
Japan
{muhtar, yasuhiro}@inagaki.nuie.nagoya-u.ac.jp
Abstract
Japanese and Uighur languages are agglutinative languages and they have many syntactical and morphological similarities. And
roughly speaking, we can translate Japanese into Uighur sequentially by replacing Japanese words with corresponding Uighur ones
after morphological analysis. However, we should translate agglutinated suffixes carefully to make correct translation, because they
play important roles on both languages. In this paper, we pay attention to them and propose a Japanese-Uighur machine translation
utilizing the agglutinative features of both languages. To deal with the agglutinative features, we use the derivational grammar, which
makes the similarities clearer between both languages. This makes our system proposed here simple and systematical. We have
implemented the machine translation system and evaluated how effectively our system works.
Keywords
agglutinative languages, machine translation, derivational grammar, Japanese, Uighur
similarities between Japanese and Uighur through using
1 Introduction the derivational grammar.
Japanese and Uighur have many syntactical similarities, In this paper, we propose a Japanese-Uighur machine
especially with respect to the word order in the sentence. translation system based on the derivational grammar.
This observation suggests that we can translate Japanese First, we explain the similarities between them and show
into Uighur in such a manner as word for word replacing some illustrative examples of word-for-word translation.
after the morphological analysis of input sentences. In As a matter of course, the fact that Japanese and Uighur
addition, both languages have morphological similarities, are different languages does not allow us to apply the
which is a significant feature of agglutinative languages. word-for-word translation in a straightforward way. In
However, there is a grammatical problem, that is, Uighur fact, a straightforward word-for-word Japanese-Uighur
grammar says that Uighur has no conjugations, while translation sometimes yields unnatural Uighur sentences,
traditional Japanese grammar, which is often called school which is caused by wrong combination of Uighur
grammar, says that Japanese verbs do conjugate. morphemes. We introduce an idea of morpheme
Therefore the word-for-word translation based on school replacement to solve this problem. When unnatural
grammar needs to deal with verbal conjugations of Uighur morpheme sequence occurs, we replace an
Japanese, in which the translation of verbal endings is inappropriate Uighur morpheme with an appropriate one
very complicated to describe. determined by the information concerning morphemes
On the other hand, it is said that agglutinative language appearing before and after it. This leads us to succeed in
does not conjugate, and that verbal variants are derived generating more natural Uighur sentences.
from invariable stems by mechanically appending affixes Thus we have implemented the translation system by
to them. These two views contradict to each other. using a Japanese morphological analyser MAJO (Ogawa
To resolve this problem, G. N. Kiyose proposed the et al., 1998) which we have developed by the derivational
derivational grammar (Kiyose, 1995) based on the grammar approach. MAJO originally keeps the dictionary
phonological approach originated by Bloch (1946). This information as the list of 3-tuples <Japanese morpheme,
grammar claims that Japanese verbs do not conjugate and part-of-speech, meaning>. For translation, we replace the
Japanese verbal variants are made up by appending 3-tuples with <Japanese morpheme, part-of-speech,
suffixes to verbal stems. Because verbal conjugations Uighur morpheme >. As a result of it, outputs of the
need not to be considered, the derivational grammar morphological analysis by using MAJO become word-for-
makes morphological analysis quite simple. word translation from Japanese to Uighur. Then we apply
Since the derivational grammar pays attention to morpheme replacement to Uighur morphemes of MAJO’s
agglutinative features of languages, it is applicable to not outputs and acquire a translated Uighur sentence.
only Japanese but also other agglutinative languages such Our approach is not limited to Japanese-Uighur translation.
as Uighur. So applying the derivational grammar to the We can apply our approach to translations between other
two languages, we can make not only syntactical but also agglutinative languages successively, since the
morphological similarities between them clearer. This derivational grammar approach treats to agglutinative
approach also makes the translation simp ler and more features of languages.
systematic. This paper is organized as follows: Agglutinative
Recently, there are many literatures that discuss similarities between Japanese and Uighur with examples
translation between agglutinative languages such as of translations between them are shown in Section 2.
Japanese and Korean (Kim et al., 1998; Lee et al., 1990). Section 3 points out some problems of word-for-word
They use only syntactical similarities between them, but translation. In Section 4, we introduce morpheme
they can not make use of morphological similarities. Our replacement rules to solve them. We describe our
approach systematizes both syntactical and morphological implementation of Japanese-Uighur translation system
2. karega tobirawo aketa tobirawo karega aketa
kare ga tobira wo ake ta tobira wo kare ga ake ta
u Ø ixik ni aq di ixik ni u Ø aq di
u ixikni aqdi ixikni u aqdi
Figure 1: Word-for-word translation from Japanese to Uighur
based on the derivational grammar in Section 5. Finally, This causes a serious problem for word-for-word
we evaluate our system in Section 6. translation.
We overcome this problem by using the derivational
2 Agglutinative Similarities between grammar, which claims that Japanese verbs do not
Japanese and Uighur conjugate, and that appending suffixes to verbal stems
makes up Japanese verbal variants. We also apply the
2.1 Syntactical Similarities derivational grammar to Uighur. This makes their
morphological similarities clear as well as the syntactical
Both Japanese and Uighur languages are agglutinative and ones.
they often refer to such the languages as free word order
languages. In Japanese, for example, we can say “karega 2.2.1 Union consonant and union vowel
tobirawo aketa” as well as “tobirawo karega aketa”. The invariable part of a verb is called a verbal stem. For
Both sentences mean “he opened the door” in English. example, ‘tabe-’ in “tabe-ru” [eat] and ‘kak-’ in “kak-
What does allow such the word order changes keeping the u” [write] are both verbal stems in Japanese. There are
equivalent meaning? The answer should be found in the two sorts of verbal stems : the one ending with a vowel,
function of case suffices. The dependency relation of a e.g. ‘tabe-’ and the one ending with a consonant, e.g.
noun to other words, that is the role which plays in a ‘kak-’. We call the former a vowel stem and the latter a
sentence, is indicated by case suffixes. The Japanese case consonant stem, respectively.
suffix ‘-ga’ indicates the subjective noun of a sentence. A verbal phrase consists of a verbal stem and some verbal
That is, ‘-ga’ makes the nominative case. The case suffix suffixes. The verbal suffixes of “tabe-ru” and “kak-u”
‘-wo’ makes the accusative case and it indicates that the are ‘-ru ’ and ‘-u’, respectively. The verbal suffix ‘-ru’
prepositive noun is the object of the verb in a sentence. So follows only a vowel stem and the suffix ‘-u’ follows only
case suffixes make it possible to understand sentences a consonant stem. The derivational grammar says those
even if the position of “karega” and “tobirawo” are suffixes are the environmental variants of the suffix
exchanged. ‘-(r)u’. The consonant ‘r’ appears when it is appended to
Uighur language has the same property. Uighur accusative a vowel stem and disappears when it is appended to a
case marker is ‘-ni’ and it corresponds to Japanese case consonant stem. A consonant of this kind is called a union
suffix ‘-wo’. But in Uighur language the nominative case consonant.
is often indicated by the zero-form. We show it by ‘Ø’. Verbal phrases “tabe-na-i” [do not eat] and “kak-ana-i”
This does not mean that Uighur language has no [do not write] represent negative actions. The negative
nominative case suffixes. verbal suffixes of them are ‘-na-’ and ‘-ana-’,
Similar by to Japanese, Uighur case suffixes make it respectively. Those suffixes are also the variants of the
possible to exchange word order with no change of suffix ‘-(a)na-’. The vowel ‘a’ appears when it is
meaning. Thus, we can translate both Japanese sentences appended to a consonant stem and disappears when it is
“karega tobirawo aketa” and “tobirawo karega appended to a vowel stem. A vowel of this kind is called a
aketa” into Uighur in the manner of word-for-word as union vowel.
shown in Figure 1. Both of “u ixikni aqdi” and “ixikni u
aqdi” are natural Uighur sentences that mean “he opened
meaning Japanese Uighur
the door”.
This observation means that the case suffixes play the causative -(s)ase- -guz-
essential roles in Japanese and Uighur, and should be passive -(r)are- -(i)l-
treated carefully in Japanese-Uighur translation. potential -(r)e- -(y)ala-
polite -(i)mas- -
2.2 Morphological Similarities negative -(a)na- -ma-
It has been considered that there is a morphological desiderative -(i)ta- -gu-
difference between Japanese and Uighur. That is, although
Uighur has no verbal conjugations, the traditional
Japanese grammar says that Japanese verbs do conjugate. Table 1: Derivational suffixes in Japanese and Uighur
3. form meaning Japanese Uighur
finite non-perfective -(r)u -[i]du
form perfective -(i)ta -di
participle non-perfective -(r)u -[i]digan
form perfective -(i)ta -gan
perfective copulative -(i)te -(i)p
converb provisional conditional -(r)eba -sa
form negative copulative -(a)zu -mastin
simultaneous -(i)nagara -gaq
imperative affirmative -e,-ro -gin
form negative -(r)una -magin
Table 2: Syntactical suffixes in Japanese and Uighur
The derivational grammar summarizes these discussions In addition, there are some derivational suffixes other than
as the following two rules: these in Uighur. For example, a Uighur suffix ‘-(i)x-’
indicates cooperative meaning. If ‘kutra-’ [congratulate]
Connection rule 1: When the suffix beginning with a follows ‘-(i)x-’, it forms ‘kutrax-’ [congratulate each
union consonant is appended to a consonant stem, other]. In Japanese similar meaning indicated by ‘-(i)aw-’,
the union consonant disappears. which is not a single suffix but is combination of a
Connection rule 2: When the suffix beginning with a syntactical suffix ‘-(i)’ and a verb ‘aw -’. However, for
union vowel is appended to a vowel stem, the union translation, Japanese ‘-(i)aw-’ matches Uighur
vowel disappears. derivational suffix ‘-(i)x-’, so we take ‘-(i)aw-’ as a
derivational suffix. Similarly, we consider some
2.2.2 Derivational Suffixes and Syntactical Suffixes compound morphemes as derivational suffixes, such as
As shown in the examples of the above section, the ‘-(i)teir-’ and ‘-(i)tutuar-’. Both suffixes correspond to
negative verbal suffix ‘-(a)na-’ can be followed by other Uighur suffix ‘-(i)wat -’.
verbal suffixes, for example, ‘-i’ in “kak-ana-i” [do not Here, we should note that Uighur has another connection
write]. These examples show that the verbal suffix rule. This shows a change of a phoneme instead of
appended to verbal stems derives a new stem, which is disappearing. For example, when the syntactical suffix
called a secondary stem. A suffix that derives a new stem ‘-[i]digan’, which means non-perfective participle,
is called a derivational suffix. For example, “kak-ana-” follows a consonant stem ‘yaz-’, a phoneme ‘i’ appears
and “yazidigan” is formed. But if it follows a vowel stem
is a secondary stem and ‘-(a)na-’ is a derivational suffix.
‘yasa-’, ‘i’ changes to ‘y’ and “yasaydigan” is formed.
Other verbal suffixes that do not derive new stems are
We named such a phoneme a union half vowel. It varies
called syntactical suffixes. They make various verbal according to the following rule:
forms, which are classified into 4 forms: fin ite, participle,
converb and imperative. For example, the suffix ‘-(r)u’ in Connection rule 3: When a suffix beginning with a
“tabe-ru” is a syntactical suffix and it forms a finite form. union half vowel is appended to a consonant stem,
Japanese verbal stems can be followed by several suffixes, the union half vowel becomes ‘i’, otherwise it
but syntactical suffixes are always appended last. Note becomes ‘y’.
that Japanese has strict rules for order of verbal suffixes,
although the word order in Japanese sentence is rather 3 Problems for Japanese-Uighur Translation
flexible. There are m any similarities between Japanese and Uighur
but there are also some linguistic differences between
2.2.3 Applying the derivational grammar to Uighur them. Those differences cause serious problems to the
Uighur verbal phrases have the similar features to word-for-word translation. In this section, some examples
Japanese. We can also apply the derivational grammar to illustrate the problems. Our solutions to these problems
Uighur language. A Uighur passive derivational suffix is are descried in section 4.
‘-(i)l-’, where ‘(i)’ refers a union vowel. When ‘-(i)l-’
follows a vowel stem ‘yasa-’ [create], a union vowel ‘(i)’
3.1 Problems of verbal suffixes
disappears and ‘yasal-’ [be created] yields.
Let's consider a Uighur verbal stem ‘ az-’ [write]. In
y Table 2 shows the correspondences between Japanese and
order to express a causative meaning, ‘yaz-’ is followed Uighur syntactical suffixes. As you see, Japanese
by a causative suffix ‘-guz-’ and forms a new stem participle suffixes are the same form as finite suffixes
‘yazguz-’ [cause to write], that is, ‘-guz-’ is a while Uighur ones are not. For example, Japanese
derivational suffix. In addition, ‘-di’ completes the verbal perfective syntactical suffix ‘-(i)ta’ is used for both finite
phrase “yazguzdi” by appending to the new stem. Both and participle forms. In Uighur, however, they are
Japanese and Uighur have several derivational suffixes different forms as shown in Figure 2. For this reason, to
and the correspondence between them is summarized in translate ‘-(i)ta’, we have to decide which of ‘-di’ or
Table 2. ‘-gan’ is the correct translation.
4. Finite form Participle form
karega tobirawo aketa. karega aketa tobira
kare ga tobira wo ake ta . kare ga ake ta tobira
u Ø ixik ni aq di . u Ø aq -gan ixik
u ixikni aqdi. u aqgan ixik
[he opened the door.] [the door that he opened]
Figure 2: The difference about finite form and participle form
“köwrük din ötidu”. Japanese case suffix ‘wo’ is
3.2 Problems of case suffixes translated into ‘din’. The suffix ‘w o’ usually indicates the
We have already explained that for Japanese and Uighur object, but sometimes it indicates the place, while Uighur
case suffixes specify the role of noun phrases in the case suffix ‘ni’ and ‘din’ indicates the object and the place,
sentences . This fact allows the flexible word order in the respectively. So we have to choose ‘ni’ or ‘din’ according
sentences of these languages. But there does not exist one- to the role of ‘ wo’ in the sentence. In a fact, the case
to-one correspondence between case suffixes of Japanese suffix ‘wo’ that indicates a place often appears together
and those of Uighur. For example, “gomiwo suteru” is with such the motional verbs like “tooru ” [pass] and
translated into “éhlétni tökidu”, where Japanese case “wataru” [cross]. This suggests we can choose ‘ or ni’
suffix ‘wo’ is translated into Uighur case suffix ‘ni’. On ‘din’ according to the verbs which the noun phrases with
the other hand, the translation of “hasiwo wataru” is ‘wo’ depends on.
We have investigated the ambiguities of case suffixes by
using IPA Lexicon of the Japanese Language for
Japanese Computers (basic Japanese verbs) (IPAL, 1987), which
suffix translated Uighur suffixes and its number total contains 861 Japanese verbs and 3473 example sentences.
Ø ning gé We have translated the Japanese case suffixes that
ga 3637 77 5 3719 appeared in the sentences into Uighur ones and counted
97.7% 2.0% 0.3% the number of them. Table 3 shows the result. ‘Ø/ni’ in
ni Ø/ni din others fault the second row means that ‘wo’ in the sentence is
wo 1566 320 118 57 47 2108 translated in ‘ni’ but it is usually omitted in the
74.3% 15.2% 5.6% 2.7% 2.2% corresponding Uighur sentences. Some of example
gé dé din Ø fault sentences can not be translated directly and we classified
ni 1183 294 81 16 35 1609 such cases as faults.
73.5% 18.3% 5.0% 1.0% 2.2% We do not deal with Japanese case suffix ‘no’, which is
dé bilén usually translated Uighur case suffix ‘ning’, because ‘no’
de 638 14 652 is usually used in such a way “A no B” [B of A] and it is
97.8% 2.2% related little to verbs. We do not deal with Japanese case
dép Ø/dép bilén suffix ‘ya’, too, because of the same reason.
to 193 46 146 385
50.1% 12.0% 37.9% 4 Suffix Adjustment
din fault Now let us proceed to our discussions on realization of
kara 289 5 294 word-for-word translation from Japanese to Uighur. The
98.3% 1.7% facts we revealed so far show that the problems to be
gé solved here is how to decide verbal or case suffix
he 236 236 correspondences correctly. To overcome these problems,
100% we adopt a method to assign the default Uighur suffix to
din each Japanese suffix and then to substitute a well fitted
yori 14 14 suffix for an unnatural one under replacement rules. Since
a verbal stem and a following verbal suffix affect each
100%
other, we can choose an appropriate suffix by knowing the
Table 3: Statistical correspondence between Japanese right and left words. On the other hand, the verb which the
noun phrase with a case suffix depends on affects the
and Uighur case suffixes
suffix. So we need to decide the correct case suffix
5. Japanese default prepositive postpositive new new parts
suffix suffix word word suffix -of-speech
* end of sentence -ydu finite suffix
-(r)u -ydigan * punctuation mark -ydu finite suffix
* sentence-final particle -ydu finite suffix
* end of sentence -di finite suffix
-(i)ta -gan * punctuation mark -di finite suffix
* sentence-final particle -di finite suffix
Table 4: Replacement table of verbal suffixes
considering the verbs depended. Thus, we propose two ‘ni’ is more popular than the locative one. But the locative
types of rules for suffix replacement. case ‘ni’ is less related to verbs than the dative one. So we
translate ‘ni’ to ‘dé’ as default and it is replaced with ‘gé’
4.1 Replacement Table of Verbal Suffixes if the verb needs an indirect object.
For verbal suffixes, we make a replacement rule shown in In addition, we consider not ‘dép’ but ‘bilén’ as a default
Table 4. A Japanese suffix in the left most columns is not Uighur suffix for Japanese suffix ‘to’, which has two
used for the replacement but for the purpose of helping to meanings. One is to change the preceding sentence to a
understand. A default suffix in the second column is a quotative clause and it corresponds to ‘dép’. Another is
Uighur morpheme , which is translated directly from the called the comitative case, which expresses a co-operant
corresponding Japanese suffix in the first column. A or accompanist, and it corresponds to ‘bilén’. Traditional
prepositive word and a postpositive word represent a Japanese grammar says that both designations of ‘to’ are
condition for replacement of the default suffix. If the case suffixes. But the derivational grammar treats the
default suffix appears between the prepositive and former as a conjunctive particle and only the latter as a
postpositive words, it should be replaced by a new suffix case suffix. So we consider it is not the problem of
in the fifth column. The mark *(don't care) denotes that a translation but the one of morphological analysis to
replacement rule does not require a prepositive or distinguish two type of ‘to’. We use the morphological
postpositive word. Since words of a sentence are tagged analyzer that tags only the latter as a case suffix. So we
parts of speech by the morphological analysis, we need a decide ‘bilén’ as a default for case suffix ‘to’.
part of speech of the new suffix in the right most columns.
5 Machine translation from Japanese into
4.2 Replacement Rule for Case Suffixes Uighur
There are often several phrases between a noun phrase We have implemented the Japanese-Uighur machine
and a verbal phrase on which it depends. So we can not translation system. Our system consists of four modules:
use a replacement rules similar to above and need another MAJO, the two replacement modules and the morpheme
type of rules for case suffixes. We use dependency connection system. MAJO is a morphological analyser of
relation and add the case pattern data to the Uighur verb, Japanese based on the derivational grammar, and its
such as <öt-{wo/-din}>. The Uighur verb ‘öt-’ has a pair dictionary consists of 3-tuples <Japanese morpheme, part-
of Japanese case suffix and Uighur one and it is a of-speech, meaning>. For the translation, we replaced the
replacement rule for case suffixes. In translation, if a 3-tuples with <Japanese morpheme, part-of-speech,
Uighur verb that has such a pair appears in the sentence, Uighur morpheme >. Therefore outputs of MAJO become
the translation system searches the noun phrase depending word-for-word translations for input sentences.
on the verb and having the Japanese suffix contained in Here, a Japanese input sentence is “tukutta hasiwo
the pair. Then the old Uighur suffix is replaced by the new watatta.” [crossed the bridge that one constructed.]
one that is in the pair. shown in figure 3. Firstly, MAJO divides it into Japanese
For each Japanese case suffix, we assume the most words and yields a sequence of equivalent Uighur words.
possible one determined by Table 3 as a default Uighur Secondly, replacement rules of verbal suffixes are applied
suffix. But, exceptionally, we adopt ‘dé’ as a default to those Uighur suffixes if they match the conditions on
Uighur suffix for a Japanese suffix ‘ni’. The reason is the the replacement table. In the example, there are two ‘-tta’
following. The Japanese case suffix ‘ni’ often indicates a
in the input sentence and MAJO translates both into
position of space or time and it is said to represent the
‘-gan’. At this point, last ‘-gan’ matches the replacement
locative case. In such a case ‘ni’ should be translated
Uighur suffix ‘dé’ which designates the locative case. On rule and it is replaced with ‘-di’. Thirdly, case suffixes are
the other hand, ‘ni’ also denotes the dative case and replaced if the verbs that they depend have the
corresponds to ‘gé’. Table 3 shows that the dative case replacement rules satisfied the condition. In the example,
Japanese case suffix ga wo ni de to kara he yori
default Uighur suffix Ø ni dé dé bilén din gé din
Table 5: Default translation for case suffixes
6. input sentence tukutta hasiwo watatta.
Morphological analysis
tukur- -ta hasi -wo watar- -ta .
Word-for-word translation
sal- -gan köwrük -ni öt- -gan .
Replacement 1
sal- -gan köwrük -ni öt- -di .
Replacement 2
sal- -gan köwrük -din öt- -di .
Morpheme connection
output sentence salgan köwrükdin ötdi.
Figure 3: Translation system and an example of translation
the first step translates ‘wo’ into ‘ ni’. But the verb
- - develop a word selection method. The replacement table
‘watar’ on which the noun phrase “hasiwo” depends has of our system describe only suffixes and parts of speech.
a replacement rule <watar, consonant-verb, öt-{wo/ If we expand it to word meanings, we could solve
-din}>. So ‘-ni’ is replaced with ‘-din’. Finally, the ambiguities of word meaning.
morpheme connection system connects Uighur We have discussed only case suffixes, but there are other
morphemes according to the connection rules and suffixes in Japanese, like ‘mo’ and ‘sika’. We know
generates a Uighur output sentence. some of them correspond to Uighur suffixes. For example,
Uighur suffix ‘mu’ has similar role of Japanese ‘mo’. So
5 Experiments we should investigate their correspondence further.
We used 136 sentences that include 306 verbal phrases We are now making bigger size of experiments of our
(254 different patterns) to evaluate performance of our translation system and collecting m replacement rules.
ore
Japanese-Uighur machine translation system. We We are aiming to make our system fit for practical use.
constructed a Japanese-Uighur dictionary that had about
13,000 words including 3,800 verbs. We compared Bibliographical References
translation results between the system with/without the Bloch, B. (1946). Studies in Colloquial Japanese, Part I,
replacement rules for verbal suffixes. As a result, the Inflection, Journals of the American Oriental Society
system translated 119 verbal phrases correctly without the 66.
replacement rules, while the system with the replacement IPA L (1987). IPA Lexicon of the Japanese Language for
rules translated 212 verbal phrases correctly. Thus, the Computers (Basic Verbs), The Information-technology
precisions of translation about verbal phrases improved Promotion Agency.
from 38.9% to 69.3%. Kim, J. and Lee, J. and Lee, G. (1998). Generation of
We also evaluate translation of 295 case suffixes appeared Korean Predicates Based on Modality-Feature Ordering
the 136 sentences. The simple system translated 257 and Lexicalizing Table in Japanese-Korean Machine
suffixes correctly, while our system translates 293 case Translation, Journal of Natural Language Processing
suffixes correctly. The precisions of translation for case 5(2), pp.3-24.
suffixes improved from 87.1% to 99.3%. Kiyose, G. N. (1995). Japanese grammar --A new
approach--, Kyoto university press.
6 Conclusion Lee, H. and Nakajima, M. and Agui, T. (1990). A
Japanese-Korean Predicate Translation System Using
In this paper, we proposed a Japanese-Uighur translation the Semantic Correspondence between Their Auxiliary
system. Our system is based on the derivational grammar Predicative Expressions, Transactions of Information
and has succeeded in systematic word-for-word Processing Society of Japan 31(6), pp.801-809.
translation. In addition, it can generate a natural Uighur Ogawa, Y. and Muhtar and Toyama, K. and Inagaki, Y.
sentence by using replacement rules. (1998). Derivational Grammar Approach to
Our system has two modules for replacements and now Morphological Analysis of Japanese Sentences,
we are going to unify them. In addition, it does not take PRICAI'98, LNAI 1531, pp. 424-435.
account of amb iguities of word meaning. For example,
Japanese syntactical suffix ‘-(r)are-’ has three meanings,
which are passive, potential and honorific, but we now
consider it only as a passive suffix. Therefore, we need to