Back to LanguageTool Homepage - Privacy - Imprint

[pt] Portuguese rule contribution/discussion


(Marco A.G.Pinto) #1

@tiagosantos

Hello!

While I was sleeping this came to mind:
Is it different to write “chocar com” from “chocar contra”?

“choquei com a parede”

Not 100% sure.


[pt] Some false positives - 2019-01-18
(Tiago F. Santos) #2

Not sure either, but verbal regency rules are on the plans. If you confirm that with an URL, please, share the rule so it can be added.

This is also a great rule, within the same category (verbal regency), so likewise.

Can we make this the new [pt] thread, and place all [pt] rule contributions/discussions here?

Having a single thread, makes everything tidier and easier to reference. I feel discouraged when searching for old threads in the dozens of posts this forum has, so potencially good ideas get lost this way.


(Marco A.G.Pinto) #3

Sure, @tiagosantos.

Could you just change the topic subject by clicking in the pencil so that people know this is the topic?

Or maybe only the person who creates the topic can edit its name? If so, which name should be given?

Thanks!


(Tiago F. Santos) #4

It has to be you.
[pt] Portuguese rule contribution
[pt] Portuguese rule discussion
or anything else you see fit.


(Marco A.G.Pinto) #5

Done!


(Tiago F. Santos) #6

Many thanks.


(Marco A.G.Pinto) #7

Possible rule:
DEPOIS DE/APÓS + VERB 3rd PERSON + À/AO -> suggest VERB INFINITIVE

Hello @tiagosantos

I was reading this article:
http://sicnoticias.sapo.pt/mundo/2017-01-30-Viver-no-espaco-altera-os-genes-revela-estudo-da-NASA-sobre-gemeos-astronautas

They wrote:

Houve, já se sabe, alterações que no entanto foram revertidas, como foi o caso de Scott Kelly ter crescido 3,81 centímetros, mas depois de regressa à Terra voltou à altura que tinha.

“depois de REGRESSA à” should suggest “REGRESSAR” in infinite.

Great possible rule, eh? I have seen several persons committing this mistake.

Thanks!

Kind regards,
>Marco A.G.Pinto
------------------------


(Tiago F. Santos) #8

DONE

[spoiler]

[quote=“marcoagpinto, post:4, topic:1340”]
For what I can think:[1] a [2] = [1] à [2]

[1]hojeamanhãsegundasegunda-feiraterçaterça-feiraquartaquarta-feiraquintaquinta-feirasextasexta-feirasábadodomingo

[2]tardenoite
[/quote][/spoiler]

All added to the list. Keep’em coming.


(Tiago F. Santos) #9

Since the Dicionários Natura is not releasing updated versions of the dictionaries I though on moving on with my own dictionary releases in GitHub. The catch is, I need someone to review it.
My changes are only derivations (prefixes and sufixes), and they are based in general morphological rules. I have reviewed them summarily, but they require extra checking.
If you cooperate on this task, I will push the first version as a new GitHub project later this week. First step is only plurals and only AO90.


(Marco A.G.Pinto) #10

Sure,

I will test it with my thesis + dissertation + book :slight_smile:


(Marco A.G.Pinto) #11

@tiagosantos

I have a deep interest in the prereform pt_PT speller as I still write without the AO90.

Is there a way to make it official?

I mean, to transfer the maintenance from Minho University to you, and use it in OpenOffice + LibreOffice?

I could search for the e-mails I sent to Minho University years ago suggesting many hundreds of words, none of which were added.

:frowning:


(Marco A.G.Pinto) #12

@tiagosantos

A couple of years ago or so, I was in Freenode annoying the LibreOffice guys to accept both pre and post reform pt_PT spellers and they said there was already a code to differentiate both (or something like that) in the official languages somewhere site where there are the standards.

Maybe it could be possible to release both, pre+post, in the same OXT and LO allow to choose between them? Or maybe that would need to be improved in LO 5.4? :slight_smile:


(Tiago F. Santos) #13

Awesome. Many thanks. It is not a easy task.

Do not get me wrong, Marco. I do not wish to fork the project, nor take that extra responsability, but I believe that having the improvements tested here before integration in the “mainstream” would be a win-win situation. LibreOffice could also use it later, after sufficient triage (one release cycle, maybe).

This is great.
One thing that may be easier is to convince them to push a pre-AO90 dictionary to pt-AO.
Although they already have the locale, they do not have a dictionary pre-installed for Angola, and it is equivalent. Another win-win situation.

To make Timar understand the subject, point him here and to this link that may help:
http://svepotranslations.se/?p=809

Toggle solutions will not be easy.


(Tiago F. Santos) #14

Released. I made it an integrated extension and with hyphenator, thesaurus and both dictionary versions. Try it here:

It has changed roughly 30k dictionary entries, so some may need a fix. Extensive review is required before integration in LO, but I believe that in a week we can integrate in LanguageTool for wide usage testing.


(Tiago F. Santos) #15

For testing convenience, I added a Firefox dictionary add-on. Only Pos-AO90.
https://github.com/TiagoSantos81/PortugueseLibreOfficeExtension/raw/master/FirefoxDictionary.xpi


(Marco A.G.Pinto) #16

Tiago, I still haven’t had the time to check anything.

Could you do a Firefox add-on pre-AO90?

:slight_smile:


(Tiago F. Santos) #17

It would have to be set as a pt-AO locale to avoid reporting conflict. The pre-AO version is also more conservative. It is limited to improvements in gender and number variations.
Good enough?

Any news on this, or on LibreOffice add-on testing?

NOTE: All feedback is welcome. This thread is not an “internal discussion” so all users testing the new dictionary version are welcome to provide constructive criticism.


(Marco A.G.Pinto) #18

Sorry… I spent the day coding to fix some issues in the PhD project (software).

Moments ago I installed the OXT in LO 5.3 and here are most of the words that appear as typos in the thesis (most probably already suggested by me to Minho University) (notice that I used M$ Word 2016 pre-AO to write it):

  1. Jorge Canelhas
  2. co-orientado
  3. Teresa Baptista
  4. Isabel Pita
  5. SeaMonkey
  6. Edma
  7. Ortins de Bettencourt
  8. Assembly Z80
  9. shareware
  10. freeware
  11. Commodore
  12. PCs
  13. hobby
  14. para Windows, Linux e Mac
  15. beta-tester
  16. desenvolvedor + desenvolvedores
  17. Hunspell
  18. SeaMonkey
  19. far-se-á
  20. disruptir
  21. hacking
  22. autoproclamada
  23. insurgências
  24. sobrelotados
  25. botnets
  26. zombies
  27. multiagente
  28. polímata
  29. XYZ
  30. subnacionais
  31. Osama Bin Laden
  32. destabilizando
  33. Sadam Hussein
  34. franchisings
  35. proibitivos
  36. hackers
  37. cracks
  38. crackers
  39. Khadafi
  40. sobrelotadas
  41. raides aéreos
  42. Análise Bayesiana
  43. grupal
  44. interorganizacional
  45. know-how
  46. sememas
  47. interrelacionadas
  48. difundível
  49. ataques DoS/DDoS
  50. interconectividade
  51. interconectados
  52. SPAM (check if it can be written in lowercase)
  53. token + tokens
  54. hash
  55. interrelacionam
  56. experienciámos
  57. cluster
  58. inspiracional
  59. feedback
  60. robots + robot
  61. pseudo-inteligência
  62. co-fundador
  63. superinteligência
  64. pseudocódigo
  65. GUI (Interface gráfico)
  66. pixéis (não sei se leva acento)
  67. semiautónomos
  68. impassáveis
  69. aleatoriedade
  70. circunjacente
  71. impassável
  72. passável
  73. baseamo-nos
  74. ID
  75. gadget + gadgets
  76. recomputado
  77. DirectX
  78. Pentium (processador)
  79. SSD (disco)
  80. CPU
  81. rastreamento
  82. background
  83. screenshot
  84. lossless
  85. co-senos
  86. subopções
  87. reset
  88. AutoCAD
  89. pixelização
  90. Excel
  91. cache
  92. browser + browsers
  93. Bin Laden

Here are most of the words that appear as typos… again, notice that some may be pre-AO.

As you can see, even I didn’t want to, I had to use M$ Office because the pt_PT speller doesn’t recognise tons of words.

@tiagosantos
I have also tons of false positive grammar suggestions reported by LanguageTool, which I will make a list when I have some more free time.

Thanks for dedicating all this effort and time to the projects.


(Tiago F. Santos) #19

Well, those are words not included. Not actual dictionary errors. The revision has to be on false negatives, i.e. word derivations included that are incorrect.
Using your examples. If ‘rastreamenta’ (feminin of rastreamento) was considered correct, that would be a dicionary error.
Moreover, most words are foreign words or proper names (foreign or rare). See our replacement tables or the VOP for more information. If you really require them, you can add them to spelling.txt here on LT. Most are barbarisms that you should avoid, and that have no place in a proofreader. See:

Sure. Just try to show the “regular one”. We have daily regression testing, so we can avoid odd test cases like the ones in:


agreement issues with proper names, and false negatives in brands.

Dedidate your time to finish your PhD. I already have people looking into it. Thanks.


(Tiago F. Santos) #20

Pushed:

For usage examples, see:


Please, keep these thread for all portuguese matters. It is much easier to reference.