SlideShare a Scribd company logo
Dacco and qdacco
An Open Source Catalan-English Dictionary
and its GUI
Fosdem 2008 in Brussels, Belgium
English-
Catalan
Catalan-
English
Carles Pina i Estany
carles@pina.cat
2/21
What is the history of Catalan and
what is its future?
● It's a Romance language that derives from
Latin, just like Spanish, Italian, French,
Portuguese and Romanian
● The Catalan linguistic and cultural community
has its own .cat Top Level Domain
● It's an official language of Catalonia, Andorra,
Valencia and the Balearic Islands. It is also
spoken in Alghero (Sardinia) and parts of
southern France
3/21
Map of Europe
Author: Wikipedia
4/21
Where is it spoken?
Author: Josep Gallart
5/21
Speakers as % of EU population
Catalan Portuguese Dutch Spanish
0
2
4
6
8
10
12
14
16
%
Data source: http://ec.europa.eu/public_opinion/archives/ebs/ebs_243_en.pdf
9,1 Million
6/21
What is Dacco?
● English-Catalan dictionary tailored for speakers
of both languages
● All entries are written in XML
– Also exported to PDF for improved usability
– Form-based web search at www.catalandictionary.org
– Plugins available for Firefox and IE
– Gadget for iGoogle homepage
– Catalan verb conjugator
– qdacco (standalone application)
7/21
Dacco's history
● An English student of Catalan found that there
was a lack of Catalan dictionaries available
which were comprehensive and aimed at
English speakers
● This was confirmed by other students of
Catalan through a survey she distributed during
her PhD studies
● The collection of dictionary entries was begun
in 2001. The first version of Dacco was
released in 2003
8/21
● People from all walks of life: linguists but also
mathematicians, geographers, IT professionals.
Who contributes?
Lou Hevly
Josep M.
López
James
Macgill
Linda Oxnard
Oriol Vilaseca
Others:
•Joanathan Kaye
•Gill Martin
•David Gimeno
•Jaume Ortolà
•Max Wheeler
•Margarita Castañón
Leopold Palomo
9/21
The 'average' contributor
● Aged between 20 and 80
● English speaker living in UK, US, Canada or
Australia
● Catalan speaker from Barcelona, Girona,
Tarragona, Valencia or the Balearic Islands
● Common purpose: wanting to create a resource
which is not only useful to language learners
but is also free and open
10/21
Where do we get our entries from?
● We do not copy entries from any dictionary
● Agreement with TermCat to incorporate their
open source vocabulary lists into our dictionary
● Entries are often added by those who use the
dictionary in their daily tasks when they search
for a word and fail to find it
● Recently: we are matching some Catalan-Latin-
English birds list to incorporate
11/21
How does the project work?
● First step: find a new word
– Mailing list: regular contributors send suggestions
– Web form: any user can send suggestions
– 'Missing word lists': created automatically from
online search engine and manually from
suggestions sent through qdacco
● Second Step: discuss the new word's
translation equivalent
– Different meanings according to geographic area,
age, socioeconomic status, etc.
● Third Step: project admin adds word to the XML
12/21
Dacco's Special Features (1/3)
● It's free: anyone can modify, copy or redistribute
– Data (XML files): LGPL
– PDF files: CC Attribution-Share Alike 2.5
– qdacco: GPL 3
● More than 15,000 entries in each side of the
dictionary (over 200 DIN-A4 pages)
13/21
Dacco's Special Features (2/3)
● 4 dictionaries:
– Catalan-English dictionary
– English-Catalan dictionary
– Catalan-English dictionary
– English-Catalan dictionary
● Examples and usage notes tailored according
to user's native language
for Catalan speakers
for English speakers
14/21
Dacco's Special Features (3/3)
● Links to images
● Usage notes
● Examples
● Word frequency counts (from Google)
● Semantic fields: apple ⇒ fruit ⇒ food
15/21
Why is it so important that the
dictionary be open source?
● Culture should be free
● Anybody can incorporate it into their own
application or web site
● Anybody can suggest entries, though every
suggestion is examined and discussed by team
of contributors so that quality of dictionary is
never compromised. Descriptive, not normative
dictionary.
● We encourage others to create their own open
source dictionaries and are happy to share the
benefit of our experience
16/21
What is qdacco?
● Multi-platform (Unix, Linux, Windows*)
standalone application
● Available in standard Debian repositories
● qdacco has been developed since 2005
– Approximately two major releases a year
17/21
Screenshots
18/21
Why did we create qdacco?
● We wanted a reference application for Dacco
● Access to all Dacco resources (photos, links,
examples, notes, etc.)
● Reports missing words to the Dacco Project.
● Integration with Festival (Speech synthesizer)
● Auto-completion and many other features
● Offline searching - faster than looking through a
PDF file on your own.
19/21
qdacco architecture
libqdacco
qdacco textdacco
XML
InternetFestival
Send suggestions
20/21
Similar projects
● GPL German-Catalan dictionary:
– GPL Deutsch-Katalanisches Wörterbuch
– http://www.aldeaglobal.net/diccionari/index.php
● Wiktionary:
– Catalan: http://ca.wiktionary.org/wiki/Portada
– English: http://en.wiktionary.org/wiki/Main_Page
21/21
Thanks for your
attention
Questions?
?
Mail: carles@pina.cat
http://www.catalandictionary.org
22/21
Creative Commons
This work is licensed under the Creative
Commons Attribution 2.5 Spain License. To
view a copy of this license, visit
http://creativecommons.org/licenses/by/2.5/es/
or send a letter to Creative Commons, 171
Second Street, Suite 300, San Francisco,
California, 94105, USA.

More Related Content

Similar to Dacco

META-NET and META-SHARE: Language Technology for Europe
META-NET and META-SHARE: Language Technology for EuropeMETA-NET and META-SHARE: Language Technology for Europe
META-NET and META-SHARE: Language Technology for Europe
Georg Rehm
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana
 
Apertium: Free/open-source rule-based machine translation and language proces...
Apertium: Free/open-source rule-based machine translation and language proces...Apertium: Free/open-source rule-based machine translation and language proces...
Apertium: Free/open-source rule-based machine translation and language proces...
TAUS - The Language Data Network
 
Apertium: a unique free/open-source MT system for related languages [but not ...
Apertium: a unique free/open-source MT system for related languages [but not ...Apertium: a unique free/open-source MT system for related languages [but not ...
Apertium: a unique free/open-source MT system for related languages [but not ...
Prompsit Language Engineering
 
Multilingualism for Digital Europe
Multilingualism for Digital EuropeMultilingualism for Digital Europe
Multilingualism for Digital Europe
Georg Rehm
 
2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit
Laurent Le Meur
 
META-NET: Towards a Strategic Research Agenda for Multilingual Europe
META-NET: Towards a Strategic Research Agenda for Multilingual EuropeMETA-NET: Towards a Strategic Research Agenda for Multilingual Europe
META-NET: Towards a Strategic Research Agenda for Multilingual Europe
Georg Rehm
 
Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
Antoine Isaac
 
Celtic language technologies in the digital age
Celtic language technologies in the digital ageCeltic language technologies in the digital age
Celtic language technologies in the digital age
techiaith
 
Promoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language TechnologyPromoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language Technology
techiaith
 
SpeakApps presentation
SpeakApps  presentationSpeakApps  presentation
SpeakApps presentation
SpeakApps Project
 
Cracking the Language Barrier for a Multilingual Europe
Cracking the Language Barrier for a Multilingual EuropeCracking the Language Barrier for a Multilingual Europe
Cracking the Language Barrier for a Multilingual Europe
Georg Rehm
 
Global education conferencef
Global education conferencefGlobal education conferencef
Global education conferencef
elkanj
 
META-NET: Language Technology for Europe
META-NET: Language Technology for EuropeMETA-NET: Language Technology for Europe
META-NET: Language Technology for Europe
Georg Rehm
 
Human Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual EuropeHuman Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual Europe
Georg Rehm
 
Galician Experience with OpenOffice.org
Galician Experience with OpenOffice.orgGalician Experience with OpenOffice.org
Galician Experience with OpenOffice.org
Alexandro Colorado
 
LangMOOC project _EMMA Summer School 2015, Ischia, Italy
LangMOOC project _EMMA Summer School 2015, Ischia, ItalyLangMOOC project _EMMA Summer School 2015, Ischia, Italy
LangMOOC project _EMMA Summer School 2015, Ischia, Italy
Maria Perifanou
 
Dissemination Strategy Plan
Dissemination Strategy PlanDissemination Strategy Plan
Dissemination Strategy Plan
SpeakApps Project
 
EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
 EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
EUmoocs
 
The Strategic Impact of META-NET on the Regional, National and International ...
The Strategic Impact of META-NET on the Regional, National and International ...The Strategic Impact of META-NET on the Regional, National and International ...
The Strategic Impact of META-NET on the Regional, National and International ...
Georg Rehm
 

Similar to Dacco (20)

META-NET and META-SHARE: Language Technology for Europe
META-NET and META-SHARE: Language Technology for EuropeMETA-NET and META-SHARE: Language Technology for Europe
META-NET and META-SHARE: Language Technology for Europe
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
 
Apertium: Free/open-source rule-based machine translation and language proces...
Apertium: Free/open-source rule-based machine translation and language proces...Apertium: Free/open-source rule-based machine translation and language proces...
Apertium: Free/open-source rule-based machine translation and language proces...
 
Apertium: a unique free/open-source MT system for related languages [but not ...
Apertium: a unique free/open-source MT system for related languages [but not ...Apertium: a unique free/open-source MT system for related languages [but not ...
Apertium: a unique free/open-source MT system for related languages [but not ...
 
Multilingualism for Digital Europe
Multilingualism for Digital EuropeMultilingualism for Digital Europe
Multilingualism for Digital Europe
 
2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit
 
META-NET: Towards a Strategic Research Agenda for Multilingual Europe
META-NET: Towards a Strategic Research Agenda for Multilingual EuropeMETA-NET: Towards a Strategic Research Agenda for Multilingual Europe
META-NET: Towards a Strategic Research Agenda for Multilingual Europe
 
Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
 
Celtic language technologies in the digital age
Celtic language technologies in the digital ageCeltic language technologies in the digital age
Celtic language technologies in the digital age
 
Promoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language TechnologyPromoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language Technology
 
SpeakApps presentation
SpeakApps  presentationSpeakApps  presentation
SpeakApps presentation
 
Cracking the Language Barrier for a Multilingual Europe
Cracking the Language Barrier for a Multilingual EuropeCracking the Language Barrier for a Multilingual Europe
Cracking the Language Barrier for a Multilingual Europe
 
Global education conferencef
Global education conferencefGlobal education conferencef
Global education conferencef
 
META-NET: Language Technology for Europe
META-NET: Language Technology for EuropeMETA-NET: Language Technology for Europe
META-NET: Language Technology for Europe
 
Human Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual EuropeHuman Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual Europe
 
Galician Experience with OpenOffice.org
Galician Experience with OpenOffice.orgGalician Experience with OpenOffice.org
Galician Experience with OpenOffice.org
 
LangMOOC project _EMMA Summer School 2015, Ischia, Italy
LangMOOC project _EMMA Summer School 2015, Ischia, ItalyLangMOOC project _EMMA Summer School 2015, Ischia, Italy
LangMOOC project _EMMA Summer School 2015, Ischia, Italy
 
Dissemination Strategy Plan
Dissemination Strategy PlanDissemination Strategy Plan
Dissemination Strategy Plan
 
EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
 EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
 
The Strategic Impact of META-NET on the Regional, National and International ...
The Strategic Impact of META-NET on the Regional, National and International ...The Strategic Impact of META-NET on the Regional, National and International ...
The Strategic Impact of META-NET on the Regional, National and International ...
 

More from Carles Pina Estany

Circumnavigating the Antarctic with Python and Django during ACE 2016
Circumnavigating the Antarctic with Python and Django during ACE 2016Circumnavigating the Antarctic with Python and Django during ACE 2016
Circumnavigating the Antarctic with Python and Django during ACE 2016
Carles Pina Estany
 
ACE (Antarctic Circumnavigation Expedition) 2016 IT
ACE (Antarctic Circumnavigation Expedition) 2016 ITACE (Antarctic Circumnavigation Expedition) 2016 IT
ACE (Antarctic Circumnavigation Expedition) 2016 IT
Carles Pina Estany
 
Expedición ACE: dando la vuelta la Antártida
Expedición ACE: dando la vuelta la AntártidaExpedición ACE: dando la vuelta la Antártida
Expedición ACE: dando la vuelta la Antártida
Carles Pina Estany
 
Seal traveling - Icehack
Seal traveling - IcehackSeal traveling - Icehack
Seal traveling - Icehack
Carles Pina Estany
 
Benches
BenchesBenches
Midi madness
Midi madnessMidi madness
Midi madness
Carles Pina Estany
 
Olfactory notifications
Olfactory notificationsOlfactory notifications
Olfactory notifications
Carles Pina Estany
 
Dynamic Slides using OpenOffice.org Impress and Python
Dynamic Slides using OpenOffice.org Impress and PythonDynamic Slides using OpenOffice.org Impress and Python
Dynamic Slides using OpenOffice.org Impress and Python
Carles Pina Estany
 

More from Carles Pina Estany (8)

Circumnavigating the Antarctic with Python and Django during ACE 2016
Circumnavigating the Antarctic with Python and Django during ACE 2016Circumnavigating the Antarctic with Python and Django during ACE 2016
Circumnavigating the Antarctic with Python and Django during ACE 2016
 
ACE (Antarctic Circumnavigation Expedition) 2016 IT
ACE (Antarctic Circumnavigation Expedition) 2016 ITACE (Antarctic Circumnavigation Expedition) 2016 IT
ACE (Antarctic Circumnavigation Expedition) 2016 IT
 
Expedición ACE: dando la vuelta la Antártida
Expedición ACE: dando la vuelta la AntártidaExpedición ACE: dando la vuelta la Antártida
Expedición ACE: dando la vuelta la Antártida
 
Seal traveling - Icehack
Seal traveling - IcehackSeal traveling - Icehack
Seal traveling - Icehack
 
Benches
BenchesBenches
Benches
 
Midi madness
Midi madnessMidi madness
Midi madness
 
Olfactory notifications
Olfactory notificationsOlfactory notifications
Olfactory notifications
 
Dynamic Slides using OpenOffice.org Impress and Python
Dynamic Slides using OpenOffice.org Impress and PythonDynamic Slides using OpenOffice.org Impress and Python
Dynamic Slides using OpenOffice.org Impress and Python
 

Recently uploaded

Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Jeffrey Haguewood
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Tatiana Kojar
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Operating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptxOperating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptx
Pravash Chandra Das
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
Shinana2
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
saastr
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 

Recently uploaded (20)

Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
Letter and Document Automation for Bonterra Impact Management (fka Social Sol...
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Operating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptxOperating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptx
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
dbms calicut university B. sc Cs 4th sem.pdf
dbms  calicut university B. sc Cs 4th sem.pdfdbms  calicut university B. sc Cs 4th sem.pdf
dbms calicut university B. sc Cs 4th sem.pdf
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 

Dacco

  • 1. Dacco and qdacco An Open Source Catalan-English Dictionary and its GUI Fosdem 2008 in Brussels, Belgium English- Catalan Catalan- English Carles Pina i Estany carles@pina.cat
  • 2. 2/21 What is the history of Catalan and what is its future? ● It's a Romance language that derives from Latin, just like Spanish, Italian, French, Portuguese and Romanian ● The Catalan linguistic and cultural community has its own .cat Top Level Domain ● It's an official language of Catalonia, Andorra, Valencia and the Balearic Islands. It is also spoken in Alghero (Sardinia) and parts of southern France
  • 4. 4/21 Where is it spoken? Author: Josep Gallart
  • 5. 5/21 Speakers as % of EU population Catalan Portuguese Dutch Spanish 0 2 4 6 8 10 12 14 16 % Data source: http://ec.europa.eu/public_opinion/archives/ebs/ebs_243_en.pdf 9,1 Million
  • 6. 6/21 What is Dacco? ● English-Catalan dictionary tailored for speakers of both languages ● All entries are written in XML – Also exported to PDF for improved usability – Form-based web search at www.catalandictionary.org – Plugins available for Firefox and IE – Gadget for iGoogle homepage – Catalan verb conjugator – qdacco (standalone application)
  • 7. 7/21 Dacco's history ● An English student of Catalan found that there was a lack of Catalan dictionaries available which were comprehensive and aimed at English speakers ● This was confirmed by other students of Catalan through a survey she distributed during her PhD studies ● The collection of dictionary entries was begun in 2001. The first version of Dacco was released in 2003
  • 8. 8/21 ● People from all walks of life: linguists but also mathematicians, geographers, IT professionals. Who contributes? Lou Hevly Josep M. López James Macgill Linda Oxnard Oriol Vilaseca Others: •Joanathan Kaye •Gill Martin •David Gimeno •Jaume Ortolà •Max Wheeler •Margarita Castañón Leopold Palomo
  • 9. 9/21 The 'average' contributor ● Aged between 20 and 80 ● English speaker living in UK, US, Canada or Australia ● Catalan speaker from Barcelona, Girona, Tarragona, Valencia or the Balearic Islands ● Common purpose: wanting to create a resource which is not only useful to language learners but is also free and open
  • 10. 10/21 Where do we get our entries from? ● We do not copy entries from any dictionary ● Agreement with TermCat to incorporate their open source vocabulary lists into our dictionary ● Entries are often added by those who use the dictionary in their daily tasks when they search for a word and fail to find it ● Recently: we are matching some Catalan-Latin- English birds list to incorporate
  • 11. 11/21 How does the project work? ● First step: find a new word – Mailing list: regular contributors send suggestions – Web form: any user can send suggestions – 'Missing word lists': created automatically from online search engine and manually from suggestions sent through qdacco ● Second Step: discuss the new word's translation equivalent – Different meanings according to geographic area, age, socioeconomic status, etc. ● Third Step: project admin adds word to the XML
  • 12. 12/21 Dacco's Special Features (1/3) ● It's free: anyone can modify, copy or redistribute – Data (XML files): LGPL – PDF files: CC Attribution-Share Alike 2.5 – qdacco: GPL 3 ● More than 15,000 entries in each side of the dictionary (over 200 DIN-A4 pages)
  • 13. 13/21 Dacco's Special Features (2/3) ● 4 dictionaries: – Catalan-English dictionary – English-Catalan dictionary – Catalan-English dictionary – English-Catalan dictionary ● Examples and usage notes tailored according to user's native language for Catalan speakers for English speakers
  • 14. 14/21 Dacco's Special Features (3/3) ● Links to images ● Usage notes ● Examples ● Word frequency counts (from Google) ● Semantic fields: apple ⇒ fruit ⇒ food
  • 15. 15/21 Why is it so important that the dictionary be open source? ● Culture should be free ● Anybody can incorporate it into their own application or web site ● Anybody can suggest entries, though every suggestion is examined and discussed by team of contributors so that quality of dictionary is never compromised. Descriptive, not normative dictionary. ● We encourage others to create their own open source dictionaries and are happy to share the benefit of our experience
  • 16. 16/21 What is qdacco? ● Multi-platform (Unix, Linux, Windows*) standalone application ● Available in standard Debian repositories ● qdacco has been developed since 2005 – Approximately two major releases a year
  • 18. 18/21 Why did we create qdacco? ● We wanted a reference application for Dacco ● Access to all Dacco resources (photos, links, examples, notes, etc.) ● Reports missing words to the Dacco Project. ● Integration with Festival (Speech synthesizer) ● Auto-completion and many other features ● Offline searching - faster than looking through a PDF file on your own.
  • 20. 20/21 Similar projects ● GPL German-Catalan dictionary: – GPL Deutsch-Katalanisches Wörterbuch – http://www.aldeaglobal.net/diccionari/index.php ● Wiktionary: – Catalan: http://ca.wiktionary.org/wiki/Portada – English: http://en.wiktionary.org/wiki/Main_Page
  • 21. 21/21 Thanks for your attention Questions? ? Mail: carles@pina.cat http://www.catalandictionary.org
  • 22. 22/21 Creative Commons This work is licensed under the Creative Commons Attribution 2.5 Spain License. To view a copy of this license, visit http://creativecommons.org/licenses/by/2.5/es/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA.

Editor's Notes

  1. No parlar (excepte Carles)
  2. German case
  3. Version for catalan and english Castell diference
  4. Tota la peña del qui-som Non-academic people = carles and I
  5. Mencionar rapid els primers punts, per relacionar-ho amb les fotos del redere
  6. no, except termcat because Free license
  7. Carles Why LGPL Why Creative Commons
  8. Ensaimada example (una imatge val més que 1000 paraules)
  9. as beer should be
  10. libqdacco, textdacco, qdacco, newarchitecture 0.7 windows*
  11. oxford genie
  12. German culture again