Registered since September 28th, 2017
Has a total of 3359 bookmarks.
Showing top Tags within 48 bookmarks
Tag selected: document.
Saved by uncleflo on January 4th, 2019.
As an entrepreneur, one of the most important tasks you can perform is getting your idea(s) out from your head into a tangible format so that you can communicate that with others. In the past, this usually meant a well-researched business plan, that would usually take weeks (more like months) to create. I turn to the Lean Canvas to help me quickly formulate possible business models, product launches, campaigns and variations of, and communicate this with my stakeholders for my Lead Flow Method work. Having the Lean Canvas as a visual guide made this part “communicating the model/idea” so much more effective — and I think the most valuable function of the tool. The problem with business plans for startups & entrepreneurs are that they’re a waste of time, don’t get me wrong a well-researched business plan is important but only at the right stage of your business (usually when you’re in growth/investment). This quote from Steve Blank sums up this point in case: “Business Plan: a document investors make you write, that they don’t read”. The key fundamental to Lean methodology is the elimination of waste — this includes time, processes, inventory and more. So as a lean startup you need a quicker way to get ideas out of your head, you need to stay lean & avoid waste — so, it’s time to introduce Lean Canvas.
lean optimist canvas startup unfair funnel formulate recommend quicker insider validation blank communicating proposition customer entrepreneur pricing calculate communicate invest advantage solving methodology business money return howto analysis analytical guide method waste research important model launch variation document investor tool process inventory quick idea communication pitch poster start
Saved by uncleflo on December 23rd, 2018.
There are many tools in the developer’s toolbox when it comes to automatic data extraction. A good example is TF-IDF algorithm (Term Frequency – Inverse Document Frequency) which helps the system understand the importance of keywords extracted using OCR. Here’s how TF-IDF can be used for invoice and receipt recognition. In this article we focus on other techniques in order to make this text file “understandable” to a computer. For this purpose, we must delve into the world of NLP or Natural Language Processing. We will focus mainly on how we can transform our file of raw text into a format that will easily be understandable by our algorithm. In a nutshell, TF-IDF is a technique for understanding how important a word is in a document which is often used as a weighting factor for numerous use cases. TF-IDF takes under consideration how frequent a word appears in a single document in relation to how frequent that word is in general. Search engines can use TF-IDF to determine which results are the most relevant for a search query.
bigram tf-idf toolbox categorical algorithm classify assign vocabulary document extraction words procedure frequency count extracted word numerical development technical analysis article blog consider language process exraction important explanation
Saved by uncleflo on December 23rd, 2018.
A central question in text mining and natural language processing is how to quantify what a document is about. Can we do this by looking at the words that make up the document? One measure of how important a word may be is its term frequency (tf), how frequently a word occurs in a document. There are words in a document, however, that occur many times but may not be important; in English, these are probably words like “the”, “is”, “of”, and so forth. We might take the approach of adding words like these to a list of stop words and removing them before analysis, but it is possible that some of these words might be more important in some documents than others. A list of stop words is not a sophisticated approach to adjusting term frequency for commonly used words. Another approach is to look at a term’s inverse document frequency (idf), which decreases the weight for commonly used words and increases the weight for words that are not used very much in a collection of documents. This can be combined with term frequency to calculate a term’s tf-idf, the frequency of a term adjusted for how rarely it is used. It is intended to measure how important a word is to a document in a collection (or corpus) of documents. It is a rule-of-thumb or heuristic quantity; while it has proved useful in text mining, search engines, etc., its theoretical foundations are considered less than firm by information theory experts.
quantify tidy calculate corpus document words frequency calculating verbs numerical examine occur text weight quantity approach mining collection keyword tag analyse development howto data principle useful technical analysis developer code explanation article
Saved by uncleflo on December 23rd, 2018.
I am working on text classification using SVM. In a paper (Fuzzy Support vector machine for multi-class text categorization) the author has reduced the features(words) by applying the following criteria: "Eliminate the words that are ICF>log2, Uni<0.2 and TF_IDF<26". My question is how can we find TF_IDF value of a word. TF is a local measure and IDF is a global measure. TF_IDF gives different value for a word in each document. TF-IDF is the acronym for Term Frequency–Inverse Document Frequency. This metric aims at estimating how important is a keyword not only in a particular document, but rather in a whole collection of documents (corpus). Actually, a lot of common words like articles or conjunctions may appear several times in a document but they are not relevant as key-concepts to be indexed or searched. TF (Term Frequency) provides a measure about how frequently a term occurs in a document.
Saved by uncleflo on December 23rd, 2018.
In information retrieval, tf–idf or TFIDF, short for term frequency–inverse document frequency, is a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus. It is often used as a weighting factor in searches of information retrieval, text mining, and user modeling. The tf–idf value increases proportionally to the number of times a word appears in the document and is offset by the number of documents in the corpus that contain the word, which helps to adjust for the fact that some words appear more frequently in general. Tf–idf is one of the most popular term-weighting schemes today; 83% of text-based recommender systems in digital libraries use tf–idf. Variations of the tf–idf weighting scheme are often used by search engines as a central tool in scoring and ranking a document's relevance given a user query. tf–idf can be successfully used for stop-words filtering in various subject fields, including text summarization and classification. One of the simplest ranking functions is computed by summing the tf–idf for each query term; many more sophisticated ranking functions are variants of this simple model.
Saved by uncleflo on December 20th, 2018.
This form of Bill of Sale is produced by the RYA for use by personal members for the transfer of an unregistered boat or a boat registered on the Part III (Small Ships) Register. Transfers of boats registered under Part I of the Central Register of British Ships should be evidenced using the Bill of Sale prescribed by the Maritime and Coastguard Agency obtainable from the UK Ship Register (RSS). In order to re-register or de-register your boat, please contact the UK Ship Register (RSS), PO Box 420, Cardiff, CF24 5XR.
Saved by uncleflo on November 27th, 2018.
The process to follow when applying to the CAA for an operational authorisation (a permission or an exemption) depends on the relative complexity of the authorisation that you are seeking: Applications for relatively ‘simple’ operational authorisations, which are commonly referred to as a ‘Standard Permission’, are addressed via an online application process: A ‘Standard Permission’ enables a person to conduct commercial operations with a small unmanned aircraft (drone) and also permits operations within a congested area. Applicants will be required to provide all documents and payment online (credit /debit card only ) before submitting their application. Applications for more complex operational authorisations, commonly referred to as ‘Non-Standard Permissions’, or those that require a specific Exemption from the Air Navigation Order are addressed via a more traditional ‘paper based’ application form, the SRG 1320.
application aircraft operation authorisation person document payment permission process depend complex seek standard addres commercial operation drone permit operation area metropolitan law allowance submit howto caa approval
Saved by uncleflo on June 29th, 2018.
Usually, programs are written and documented in English, and use English at execution time to interact with users. This is true not only of GNU software, but also of a great deal of proprietary and free software. Using a common language is quite handy for communication between developers, maintainers and users from all countries. On the other hand, most people are less comfortable with English than with their own native language, and would prefer to use their mother tongue for day to day’s work, as far as possible. Many would simply love to see their computer screen showing a lot less of English, and far more of their own language. However, to many people, this dream might appear so far fetched that they may believe it is not even worth spending time thinking about it. They have no confidence at all that the dream might ever become true. Yet some have not lost hope, and have organized themselves. The Translation Project is a formalization of this hope into a workable structure, which has a good chance to get all of us nearer the achievement of a truly multi-lingual set of programs. GNU gettext is an important step for the Translation Project, as it is an asset on which we may build many other steps. This package offers to programmers, translators and even users, a well integrated set of tools and documentation. Specifically, the GNU gettext utilities are a set of tools that provides a framework within which other free packages may produce multi-lingual messages.
gnu gettext utility integration project translation package messge formal organize description reference official howto program software communication developer language document lingual user translater content website development administration
Saved by uncleflo on June 1st, 2018.
Saved by uncleflo on May 29th, 2018.
Every once in a while I get on a computer and I need to count the number of lines in a file. My first instinct is to open my text editor (editplus) and hit ctrl+end to get to the bottom of the document. Then I can view the status bar which will tell me the line number. This works fine when I am on my computer but not when I am on another computer that does not have editplus installed. My next option might be to open this file in notepad and do the exact same thing. This will work fine if the file is not to large. The problem is I often deal with very large files. I need a quicker way to produce the same results. This is where DOS comes into play. I can use the following command and let DOS quickly tell me the number of lines in the file.
Saved by uncleflo on February 7th, 2018.
This document specifies an Internet standards track protocol for the Internet community, and requests discussion and suggestions for improvements. Please refer to the current edition of the "Internet Official Protocol Standards" (STD 1) for the standardization state and status of this protocol. Distribution of this memo is unlimited. Lightweight Directory Access Protocol (LDAP) search filters are transmitted in the LDAP protocol using a binary representation that is appropriate for use on the network. This document defines a human-readable string representation of LDAP search filters that is appropriate for use in LDAP URLs (RFC 4516) and in other applications.
ldap document directory protocol string rfc standard technology community request discussion improvement improve specification binary representation search filter url application howto reference lookup internet suggestion administration
Saved by uncleflo on December 27th, 2017.
The Document Library includes a framework of specifications, tools, measurements and support resources to help organizations ensure the safe handling of cardholder information at every step. Guidance for PCI DSS Scoping and Segmentation: Framework for a robust payment card data security process; PCI DSS v3.2: Framework for a robust payment card data security process; SAQ Documents: Self-validation tool for merchants and service providers
document library framework specification tool measurement support resource help organization safe handling cardholder guidance data security security official standard council validation provide service industry payment development howto merchant data guide information knowledge tutorial
Saved by uncleflo on July 10th, 2017.
The OAuth 2.0 authorization framework enables a third-party application to obtain limited access to an HTTP service, either on behalf of a resource owner by orchestrating an approval interaction between the resource owner and the HTTP service, or by allowing the third-party application to obtain access on its own behalf. This specification replaces and obsoletes the OAuth 1.0 protocol described in RFC 5849.
Saved by uncleflo on June 30th, 2017.
Qualys is the leading provider of integrated asset discovery, network security, threat protection, compliance monitoring and web application security solutions. SSL Labs is a collection of documents, tools and thoughts related to SSL. It's an attempt to better understand how SSL is deployed, and an attempt to make it better. I hope that, in time, SSL Labs will grow into a forum where SSL will be discussed and improved. SSL Labs is a non-commercial research effort, and we welcome participation from any individual and organization interested in SSL.
Saved by uncleflo on June 23rd, 2017.
Software that lets you edit a shared document together with others over the internet can be really useful if you’re working in a team, especially a distributed one. Nowadays, there is a lot of web-based software for simultaneous collaborative editing, no matter if you’re working on rich-text documents, spreadsheets, presentations or source code. Let’s take a moment to define more precisely what we mean with the term realtime collaborative editing. What we want is that multiple people working at different computers can make changes to a document hosted on a server at any time. These changes are synchronized immediately with the other peers, in contrast to version control systems like Git, where one usually works on a feature all by oneself and eventually merges the changes back to the project. No client should have to communicate with the server or any other client before making a change. In particular, there is no need to acquire a lock from the server to make an edit and concurrent edits can occur. After all changes have been synchronized, every client should see the exact same document. It turns out that implementing this kind of real-time collaboration is far from trivial. The most common solution responds to the name Operational Transformation (usually abbreviated OT). It originated from a research paper published in 1989 but got more recently popularized by Google Wave. Today, it powers many collaborative editors such as.
Saved by uncleflo on May 6th, 2017.
This page contains general information posted by the FPS Foreign Affairs. If you are residing abroad, please verify in this list whether your embassy or consulate has posted specific information for your place of residence on its website. A passport is an official, worldwide recognized and usable travel document in the format of a booklet issued by the relevant authority in a particular State to its nationals. One must make a distinction between a passport and an identity card which, as its name implies, is just a single card. Belgian passports are valid 7 years (5 years for minors). Any Belgian citizen can be issued his or her own passport starting at birth. It is possible to travel to around 50 countries using an identity card, but a passport allows its holder to travel to any country in the world, subject to fulfilling any applicable visa requirements.
Saved by uncleflo on December 2nd, 2016.
This document specifies an Internet standards track protocol for the Internet community, and requests discussion and suggestions for improvements. Please refer to the current edition of the "Internet Official Protocol Standards" (STD 1) for the standardization state and status of this protocol. Distribution of this memo is unlimited.
Saved by uncleflo on December 2nd, 2016.
One of the most interesting domains of future Internet applications will be located in the area of electronic commerce, where online products and services are offered. Products of value are accounted by monetary transactions, involving a bank or credit card service to handle the electronic cash flow. To achieve a reliable and consistent flow of information in these applications, the concept of transactions has been proven to be the right choice.
transaction tip protocol description example test internet practice www application online multiple adminsitration research development security analysis reference paper document howto pdf download useful
Saved by uncleflo on November 17th, 2016.
Web Service Atomic Transaction is an OASIS standard. To achieve all-or-nothing property for a group of services, it defines three protocols (completion, volatile two-phase commit, and durable two-phase commit), and a set of services. These protocols and services together ensure automatic activation, registration, propagation and atomic termination of Web services. The protocols are implemented via the WS-Coordination context management framework and emulate ACID transaction properties. Following the standard, a distributed transaction has a coordinator, an initiator, and one or more participants. This document describes the WS-AtomicTransaction namespace. It also contains a directory of links to related resources using Resource Directory Description Language (RDDL) 2.0.
atomic atomicity transaction data service webservice oasis web complete commit rollback online activation phase document docs protocol register propagate implementation development administration framework communication standard manage
Saved by uncleflo on October 21st, 2016.
WSDL is an XML format for describing network services as a set of endpoints operating on messages containing either document-oriented or procedure-oriented information. The operations and messages are described abstractly, and then bound to a concrete network protocol and message format to define an endpoint. Related concrete endpoints are combined into abstract endpoints (services). WSDL is extensible to allow description of endpoints and their messages regardless of what message formats or network protocols are used to communicate, however, the only bindings described in this document describe how to use WSDL in conjunction with SOAP 1.1, HTTP GET/POST, and MIME.
xml standard document procedure information wsdl international development learn service endpoint description definition w3 consortium network protocol application administration explanation detail reference
Saved by uncleflo on July 23rd, 2016.
PowerPay's deep expertise and wide range of services mean we can say "yes" to all kinds of customers, whether they need software, eCommerce solutions, card processing or merchant account services. Agent Partnerships offer generous revenue share programs and unique opportunities that benefit merchant accounts and agents alike.
merchant account agent partner power pay career leadership team contact customer application digital signature installation resource service document software processing personal support market trend revenue stream competition integration fund country solution technical capability comprehensive industry money payment
Saved by uncleflo on March 11th, 2016.
Saved by uncleflo on June 13th, 2015.
Welcome to the SVGBasics tutorials. Scalable Vector Graphics (SVG) is a standard from the W3C which is built on top of XML. The SVG standard describes ways that graphics can be drawn for use on the web. I've assembled here some examples that you can freely modify and reuse. The idea is to cover the basics with numerous examples that people can experiment with to get an idea of what SVG can do. The Scalable Vector Graphics (SVG) recommendation comes from the W3C. The recommendation describes how to create vector graphics using a mark up language. Because SVG is based on XML, tools that already know how to interpret XML will be able to interpret SVG. Having this well-established standard as a foundation saves a lot of the work in defining the language. It also means that there are numerous tools around which can check the structure of an SVG document or read it to pull out interesting information.
svg graphics browser vector scale hand tutorial xml howto development step-by-step description structure document language tool information font curve line transformation shape basic text light image source
Saved by uncleflo on February 22nd, 2015.
From blogging to log analysis and search engine optimisation (SEO) people are looking for scripts that can parse web pages and RSS feeds from other websites - to see where their traffic is coming from among other things. Parsing your own HTML should be no problem - assuming that you use consistent formatting - but once you set your sights at parsing other people's HTML the frustration really sets in. This page presents some regular expressions and a commentary that will hopefully point you in the right direction.