Data
-
WebExtension Manifest.json permissions options list
WebExtension Manifest.json permissions options list is quite long. Let us see what we can access with our plugin for FF. This hsould inspire You to write Your own for quality of life and ease of doing things… or avoid them all together. For avoidance i would highly recommend to look into prper ad block filter and creating / adding your own. Rule : Do not scare users Remember when Your app requires and asks for access. You either explain to user in every detail what for and go with bare minimum. Just as a rule of thumb of not beeing accused of getting and selling info. It is always scary…
-
Data contracts. Building universal data access proxy api
Data contracts usually requires us to build around them a universal data access proxy API for users to consume. API utilizing proper data contracts, negotiated with different teams, acts as a unified gateway providing the necesities. Allows access to data sources like databases, REST APIs, GraphQL endpoints or other file systems. One api to rule them all Steps to build data contracts for proxy api You could try and adopt a similar flow for creating such access points, even make a template in JIRa so You will know where to get proper data and how to acquire it… or maybe expose the library and just aprove properly looking merge requests……
-
When to use Value Objects in database
Question when to use value objects in database is usually discussed during storing a new set of data. Usage of value objects should be considered while modeling abstractions that are constant / immutable and do not require unique identity. Examples can be: geospatial data, estate addresses or any categories that belong to an entity. VO also encapsulates validation logic and business rules. Helps to avoid raw strings or magic numbers. When to use it and its benefits How to map a value object in db Scenario Mapping Benefit Embed Columns in parent table (Street, City in Address) Fast reads, no joins Complex nesting JSONB/PostgreSQL or owned entities Flexible schema, validation…
-
Burner email
Burner email is number one technique to keep You sane of Your private data… beeing private. This is not about just loosing Your account, it is about loosing, breaching, leaking Your accoun, Your email, profile data You provied and password to the general public. Naughty hackers included. Burner mail address is a smart privacy and security strategy (checkout OWASP ) . It is a temporary, disposable address, and and account You make based on that email, that keeps your true, real life data separated from anything else. Crucially, you should use something You can always loose.. better yet.. Your worst enemy can get their hands on… Burner email services Name…
-
How to check if Your email and password was leaked
Why You need a strong passwords and check if it was breached How to check if Your email and password was leaked ? Just check the security blogs or institutions like https://databreach.com/. Google the title of this blog post. Fix your breached credentials or You will be sorry ! Passwords are Your first line of defense. Multiple factor authentication (MFA) like sms, email verification or authentication application should also be something You do! How to make a good password ? Many different ways to make it happend. I would recommend to consider : This is a simple algorith You can use to remember all of those passwords and every one…
-
Adr vs readme vs changelog
Adr, readme and changelog are the most valueable pieces of documentation that a project could? / should? maintain. They provide us with crucial, core informations we need to star and continue effective development. Let us look at them closely. Usually we can keep everything in plain text or better yet use a Markdown format. You can even try and use some linter, highlgihts for easier writing. ADR (Architectural Decision Record) ADR (Architectural Decision Record): Document that holds a single, significant architectural decision. Includes the context, options that were considered, chosen solution with consequences. Serves as a traceable rationale for future reference as well as new team members. Readme txt or…
-
The Golden Byte. Most valuable data
The Golden Byte. Most valuable data In data engineering, every byte has a cost but not all bytes are made to be equal ( read Animal Farm by George Orwell). We collect terabytes of data in the form of logs, metrics, cookies, text, pictorues and transactions. Yet only a small portion of this information is truly crucial and drives business outcomes. That fraction is what can call the Golden Byte, single most valuable unit of data that fuels strategic insight and decision-making. Data tiers architecture The Golden Byte embodies the essence of a gold layer in modern data architecture: raw ,curated , aggregated, and business-ready information. It is the outcome…
-
Popular LLMs training data, what do they use ?
Popular LLMs training data seems to be universal and generic. This is why such models are so popular, they more or less know an answer to everything. But how do they come about to those answers ? What is the source of that ? Where do they get the data from ? Let`s search the web the old fashioned way and find out. Popular LLMs training data types The training data for these models come from all around the world. We humans are the ones that provide it. It is our work that is pushed into a model. LLMs training data reflects carefully curated huge datasets designed to provide high…
-
Know Your data. Cost per byte vs value per byte
Cost per byte vs. Value per byte: Rethinking Data Efficiency We are living in an era where nothing gets erased (just archived). Let us dwell on cost per byte vs value per byte of such data. Every byte you store, move, or process has a cost. We focus on cost saving. Data engineering isn’t just about hoarding everything, it’s a calculated risk about understanding whether those bytes are worth to store them. Pro hint – do not fall into the trap of ‘let us grab everything and think about it later’. It does make sense until you figure out what is what but then remember to delete it ? Oh…


















